Peptides and proteins fragment in the presence of 3-mercaptopropionic acid to afford thioesters which can be used in native chemical ligation reactions.
Abstract
Peptides and proteins fragment sequence-specifically in the presence of 3-mercaptopropionic acid to afford thioesters which can be used in native chemical ligation reactions.
Native chemical ligation (NCL) is an extremely useful method for the production of synthetic and semi-synthetic proteins.1 Since it is widely accepted that approximately fifty amino acid residues represents the limit for efficient automated solid phase peptide synthesis (SPPS) the use of recombinant methods for the production of the required thioester and cysteine-containing components has been widely exploited.2 However, while the number of available synthetic methods for the production of the thioester component has risen dramatically,3 microorganism-derived thioesters are most frequently generated by the commercially available intein-fusion expression system.4 Recently, several variations on a central theme of thioester production via an N to S acyl shift, that are mechanistically more similar to the intein system, have been reported (Scheme 1).5
Following the work of Kawakami and Aimoto5d on the use of cysteinylprolyl esters (CPEs) for the transient production of thioesters which can participate in NCL reactions, we had attempted to utilise CPEs in our studies geared towards the semi-synthesis of erythropoietin (EPO) and had managed to apply them with limited success (Scheme 2(a)).6 Consequently, we attempted to “rescue” CPE 1 employing 3-mercaptopropionic acid (MPA) to produce thioesters via a N → S acyl shift in aqueous MPA as reported by Hojo and co-workers.5b In doing so we observed formation of an undesired glycine thioester 3 at the expense of our desired alanine thioester. Interestingly the “isolated” cysteine residue appeared to facilitate conversion to a glycine-MPA thioester in the absence of any additional stimulus such as a CPE, a mercaptomethyl proline residue,5b or cysteine N-alkylation5e and led us to predict that some selectivity may be achievable in peptide and protein fragmentation leading to thioesters in a single step, using MPA as the sole thioesterification reagent.7 The desired alanine thioester was observed in trace quantities only and although it was possible that any Ala thioester could be further processed to the Gly-thioester, it was not yet clear if this was the case. To validate that the glycine thioester 3 had indeed formed it was isolated by semi-preparative HPLC and successfully ligated to model peptide 2 (Scheme 2(b)).8
To determine whether or not MPA could effect this transformation more widely we investigated the MPA mediated fragmentation of a recombinant 21 kDa protein sample, His10-WT human erythropoietin (EPO). This 166 residue protein contains four cysteine residues in the arrangement I7C, G29C, H33C and A161C and was treated with 20% MPA over time periods of 1–48 h and at temperatures ranging from 40 to 80 °C. The protein appeared to fragment in 20% v/v MPA at temperatures as low as 40 °C (Fig. 1(a)) and surprisingly there appeared to be an accumulation of the His10-EPO(1-28)-Gly-MPA thioester and His10-EPO(1-32)His-MPA thioester peptides, with the His-thioester peptide appearing the most abundant. The peptides corresponding to cleavage at the I7C junction or between A161C were not observed. While it proved difficult to separate this pair of thioesters, the mutation G28A was sufficient to suppress thioesterification at this site (Fig. 1(b)) and the His10EPO-1-32(G28A)-MPA thioester was isolated in 28% yield.
The presumed peptide thioesters were isolated by reverse-phase HPLC and also successfully ligated to model peptide 2 in the presence of 6 M guanidine·HCl, 300 mM Na phosphate buffer; pH 7.4, 25 mM 4-mercaptophenylacetic acid (MPAA),9 20 mM TCEP to afford the expected NCL products.8 This experiment further suggested that MPA facilitated protein fragmentation at cysteine residues and that the relative degree of fragmentation may vary depending on the nature of the amino acid adjacent to the cysteine residue. Combining these results led us to postulate an approximate reactivity ranking of H ≥ G > A > I, a trend similar to that observed in the reactivity of thioesters in regular NCL reactions, the reverse process.10 Consequently we predicted that useful thioesterification may be limited to NCL’s favoured three residues (Gly, His, Cys).
Seven peptides (Table 1) were prepared to probe this hypothesis in more detail. Each peptide was related to the recombinant EPO sample that had undergone fragmentation and contained two possible positions at which fragmentation could take place, though in each case the amino acid adjacent to the cysteine residue had been altered such that the outcome of the cleavage reaction could be predicted. In keeping with the observed reactivity profile of peptide thioesters in native chemical ligation reactions we allowed Gly-Cys, Cys-Cys, Ile-Cys and His-Cys “pairs” to compete with each other. Gly, Cys, and His thioesters are known to participate in native chemical ligation reactions with particularly fast kinetics whereas Ile is among the slower thioesters to react.10 Consequently, we expected that peptides which contained exclusively GC junctions or a GC junction combined with CC or HC junctions to yield mixtures of thioesters whereas peptides that contained IC or protected GC(S-Acm) to yield singly-cleaved compounds and this was indeed found to be the case. Thioester formation at HC junctions appeared to be the most rapid and most extensive followed by cleavage at GC and CC junctions. Interestingly the peptide containing the GC and HC pair (entry 1) appeared to fragment in a very similar manner to the recombinant protein fragment providing a mixture of His and Gly thioesters.
Table 1. Peptides designed to test for selectivity in MPA-mediated fragmentation.
X8C cleavageb |
X12C cleavageb |
||||||
Entry | Peptide sequencea | Calc. m/z | Obs. m/z | Calc. m/z | Obs. m/z | X12C : X8Cc | Isolated yieldd (%) |
1 | H-AENITTGCAEHC-NH2 | 793.3 | 793.5 | 1233.5 | 1233.6 | ≈1 : 1 | 28 |
2 | H-AENITTGC(Acm)AEHC-NH2 | 793.3 | n/o | 1304.5 | 1304.6 | >9 : 1 | 39 |
3 | H-AENITTICAEHC-NH2 | 849.4 | n/o | 1289.5 | 1289.7 | >9 : 1 | 60 |
4 | H-AENITTICAEGC-NH2 | 849.4 | n/o | 1209.5 | 1209.7 | >9 : 1 | n.d. |
5 | H-AENITTGCAEGC-NH2 | 793.3 | 793.5 | 1153.4 | 1153.6 | ≈1 : 1 | 26 |
6 | H-AENITTGCAECC-NH2 | 793.3 | 793.5 | 1199.4 | 1199.6 | ≈1 : 1 | 28 |
7 | H-AENITTGCAEIC-NH2 | 793.3 | 793.5 | 1209.5 | n/o | 1 : 9 | n.d. |
aPeptides prepared on Rink amide MBHA resin.
bEach 50 µL reaction contained approx. 8 mM peptide and was analysed by LC-MS after 36 h at 50 °C (unoptimised) in 20% MPA. The calculated mass corresponds to the MPA thioester.
cDetermined by LC-MS.
dUnoptimised yield of the full length (11 mer) thioester. n.d. = not determined and applies to peptides where isolation was hindered by overlapping MPA derived peaks.
Having demonstrated that the thioesterification reaction appeared to be selective, we conducted another small screen of test peptides to probe for particularly favourable or unfavourable thioesterification sites.8 Ten short peptides of sequence H-AENITTXC-NH2 (where X = A, D, E, F, G, K, P, S, V, W) were cleaved from the solid support. LC-MS of the crude products indicated that the peptides were highly pure though in some cases (with the exception of FC, PC, KC, WC and VC containing peptides) two resolved peaks with identical masses were observed which were tentatively assigned as the N- and the S- peptide.5f Each crude sample was isolated in near quantitative yield so each peptide was used directly in MPA-mediated thioesterification reactions. The peptides were exposed to forcing conditions of 20% MPA at 80 °C for 24 h, anticipating that Gly-thioester formation should be near complete and Val-thioester formation negligible. Analysis of the reaction mixtures after 20 h revealed that, as expected, Gly-thioester formation had progressed to near completion whereas Pro and Val showed negligible, though detectable, thioester formation. Notably the Asp-Cys containing peptide had been completely consumed yet did not give rise to a thioester product, the major species corresponding to a product which had lost both the Asp and Cys residues. In contrast the Glu-Cys containing peptide (like the Ala-Cys, Ser-Cys, and Phe-Cys containing peptides) gave rise to an approximately 50 : 50 mix of starting material and product. The Ser-Cys containing peptide (both the starting material and the product) appeared to have further condensed with MPA under the reaction conditions, presumably on the primary hydroxyl group of serine, to afford an MPA ester. This modification of serine, along with Ser-MPA thioester formation, was negligable when the reaction was repeated at 60 °C for 16 h with the vast majority of the starting material remaining unchanged. Besides Val and Pro, Lys and Trp appeared to have performed least efficiently in thioester formation at 80 °C. From these results we concluded that, of the nine additional fragmentation junctions tested, only Gly-Cys junctions and His-Cys are likely to give rise to thioesters under more desirable conditions (temperatures ranging from 50 to 60 °C). The model experiments with test peptides also provided access to multimilligram quantities of products allowing 1H and 13C NMR analysis of the proposed thioester products. The 1H NMR spectrum of H-AENITTHC-NH2 and H-AENITTGC-NH2 before and after exposure to MPA clearly indicate the loss of cysteine, the incorporation of MPA (Fig. 2) along with the associated downfield shift in the resonance of the α-hydrogens of the C-terminal residue.8 Thioester formation was further supported by the appearance of a signal at 205 ppm, typical for the carbonyl carbon of a thioester, in the 13C NMR spectrum of H-AENITTHC-NH2 after treatment with MPA. 13C labelling studies are currently underway to further confirm thioester formation.
In summary, through our investigations employing CPE terminated peptides, we observed an interesting one-step thioesterification reaction in which peptide and protein samples containing GC, CC and HC junctions preferentially fragment at these positions in the presence of MPA.11 While fragmentation apparently occurs most readily at histidine the extent of racemisation needs detailed investigation though a short thioester, Ac-Ala-His-SCH2CH2CO2H derived from peptide Ac-Ala-(l-)His-Cys-NH2 upon exposure to 20% MPA for 48 h at 55 °C surprisingly exhibits undetectable racemisation when compared (by HPLC) with Ac-Ala-(d-)His-Cys-NH2 exposed to identical conditions.8 Optimisation of the reaction conditions and exploration of alternative experimental protocols, e.g. use of microwave irradiation, in combination with site-directed mutagenesis, may ultimately yield a direct, small molecule-mediated, method for the production of bacterially expressed protein thioesters to compliment the intein-fusion system.
The authors acknowledge financial support from The Wellcome Trust and the EPSRC.
Footnotes
References
- (a) Dawson P. E., Muir T. W., Clark-Lewis I., Kent S. B. Science. 1994;266(5186):776. doi: 10.1126/science.7973629. [DOI] [PubMed] [Google Scholar]; (b) Dawson P. E., Kent S. B. H. Annu. Rev. Biochem. 2000;69:923. doi: 10.1146/annurev.biochem.69.1.923. [DOI] [PubMed] [Google Scholar]; (c) Macmillan D. Angew. Chem., Int. Ed. 2006;45(46):7668. doi: 10.1002/anie.200602945. [DOI] [PubMed] [Google Scholar]; (d) Haase C., Seitz O. Angew. Chem., Int. Ed. 2008;47(9):1553. doi: 10.1002/anie.200704886. [DOI] [PubMed] [Google Scholar]
- (a) Muralidharan V., Muir T. W. Nat. Methods. 2006;3(6):429. doi: 10.1038/nmeth886. [DOI] [PubMed] [Google Scholar]; (b) Muir T. W., Sondhi D., Cole P. A. Proc. Natl. Acad. Sci. USA. 1998;95(12):6705. doi: 10.1073/pnas.95.12.6705. [DOI] [PMC free article] [PubMed] [Google Scholar]; (c) Noren C. J., Wang J. M., Perler F. B. Angew. Chem., Int. Ed. 2000;39(3):451. [PubMed] [Google Scholar]; (d) McGinty R. K., Kim J., Chatterjee C., Roeder R. G., Muir T. W. Nature. 2008;453(7196):812. doi: 10.1038/nature06906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (a) Shin Y., Winans K. A., Backes B. J., Kent S. B. H., Ellman J. A., Bertozzi C. R. J. Am. Chem. Soc. 1999;121(50):11684. [Google Scholar]; (b) Ingenito R., Bianchi E., Fattori D., Pessi A. J. Am. Chem. Soc. 1999;121(49):11369. [Google Scholar]; (c) Swinnen D., Hilvert D. Org. Lett. 2000;2(16):2439. doi: 10.1021/ol0060836. [DOI] [PubMed] [Google Scholar]; (d) Hojo H., Haginoya E., Matsumoto Y., Nakahara Y., Nabeshima K., Toole B. P., Watanabe Y. Tetrahedron Lett. 2003;44(14):2961. [Google Scholar]; (e) Camarero J. A., Hackel B. J., Yoreo J. J. D., Mitchell A. R. J. Org. Chem. 2004;69(12):4145. doi: 10.1021/jo040140h. [DOI] [PubMed] [Google Scholar]; (f) Franziska Mende O. S. Angew. Chem., Int. Ed. 2007;46(24):4577. doi: 10.1002/anie.200700356. [DOI] [PubMed] [Google Scholar]; (g) Manabe S., Sugioka T., Ito Y. Tetrahedron Lett. 2007;48(5):849. [Google Scholar]; (h) Hogenauer T. J., Wang Q., Sanki A. K., Gammon A. J., Chu C. H. L., Kaneshiro C. M., Kajihara Y., Michael K. Org. Biomol. Chem. 2007;5:759. doi: 10.1039/b618442a. [DOI] [PubMed] [Google Scholar]; (i) Ficht S., Payne R. J., Guy R. T., Wong C.-H. Chem. Eur. J. 2008;14(12):3620. doi: 10.1002/chem.200701978. [DOI] [PubMed] [Google Scholar]; (j) Blanco-Canosa J. B., Dawson P. E. Angew. Chem., Int. Ed. 2008;47:6851. doi: 10.1002/anie.200705471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- www.neb.com.
- (a) Ollivier N., Behr J.-B., El-Mahdi Q., Blanpain A., Melnyk O. Org. Lett. 2005;7(13):2647. doi: 10.1021/ol050776a. [DOI] [PubMed] [Google Scholar]; (b) Nagaike F., Onuma Y., Kanazawa C., Hojo H., Ueki A., Nakahara Y., Nakahara Y. Org. Lett. 2006;8(20):4465. doi: 10.1021/ol0616034. [DOI] [PubMed] [Google Scholar]; (c) Hojo H., Onuma Y., Akimoto Y., Nakahara Y., Nakahara Y. Tetrahedron Lett. 2007;48(1):25. [Google Scholar]; (d) Kawakami T., Aimoto S. Tetrahedron Lett. 2007;48(11):1903. [Google Scholar]; (e) Hojo H., Murasawa Y., Katayama H., Ohira T., Nakahara Y., Nakahara Y. Org. Biomol. Chem. 2008;6:1808. doi: 10.1039/b800884a. [DOI] [PubMed] [Google Scholar]; (f) Nakamura K., Sumida M., Kawakami T., Vorherr T., Aimoto S. Bull. Chem. Soc. Jpn. 2006;79:1773. [Google Scholar]; (g) Wu B., Chen J., Warren J. D., Chen G., Hua Z., Danishefsky S. J. Angew. Chem., Int. Ed. 2006;45(25):4116. doi: 10.1002/anie.200600538. [DOI] [PubMed] [Google Scholar]
- Although the reaction was not successful, when the CPE peptide was later re-prepared with the cysteine residue adjacent to glycine protected with an S-Acm group the desired CPE ligation product was observed and the Ala-MPA thioester could be isolated (after treatment of the CPE peptide with MPA).
- N→S acyl transfer has been observed in a synthetic peptide containing a Gly-Cys junction, in ref. 5f, upon exposure to 71% TFA in CDCl3 as part of a two-step thioesterification procedure. The glycine thioester so produced was not isolated or used in ligation reactions due to difficulties encountered as a consequence of the two-step procedure adopted, (the S-peptide spontaneously reverted to the N-peptide upon attempted isolation). Neither the potential selectivity of the thioesterification, nor application to recombinant samples was subsequently explored
- See ESI for experimental details
- Johnson E. C. B., Kent S. B. H. J. Am. Chem. Soc. 2006;128(20):6640. doi: 10.1021/ja058344i. [DOI] [PubMed] [Google Scholar]
- Hackeng T. M., Griffin J. H., Dawson P. E. Proc. Natl. Acad. Sci. USA. 1999;96(18):10068. doi: 10.1073/pnas.96.18.10068. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang L., Tam J. P. Tetrahedron Lett. 1997;38:3. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.