Abstract
The macrocyclization of linear peptides is very often accompanied by significant improvements in their stability and biological activity. Many strategies are available for their chemical macrocyclization, however, enzyme‐mediated methods remain of great interest in terms of synthetic utility. To date, known macrocyclization enzymes have been shown to be active on both peptide and protein substrates. Here we show that the macrocyclization enzyme of the cyanobactin family, PatGmac, is capable of macrocyclizing substrates with one, two, or three 1,4‐substituted 1,2,3‐triazole moieties. The introduction of non‐peptidic scaffolds into macrocycles is highly desirable in tuning the activity and physical properties of peptidic macrocycles. We have isolated and fully characterized nine non‐natural triazole‐containing cyclic peptides, a further ten molecules are also synthesized. PatGmac has now been shown to be an effective and versatile tool for the ring closure by peptide bond formation.
Keywords: biotransformation, cyanobactin, cyclic peptides, peptidomimetics, triazole
Macrocycles in general and peptidic macrocycles in particular have recently gained more attention since they have been identified as potential sources for novel therapeutics.1 These compounds, abundant in nature, are known to possess a wide range of biological activities.1, 2 They owe their attractiveness to having higher structural rigidity and better chemical stability, especially against hydrolysis and peptidase activity in the case of peptide‐based macrocycles.3 The increased molecular weight of these compounds is usually balanced by their increased specificity and selectivity towards the targets compared to “small” molecules.4 Patellamides are natural cyclic octapeptides that belong to the large family of cyanobactins.5 Their biosynthesis consists of the modification of a long precursor peptide via the action of different tailoring enzymes to achieve heterocyclization of cysteines, or threonines and serines to thiazoline or oxazoline, proteolysis, macrocyclisation, epimerization of certain amino acids, oxidation of the heterocycles, and in some cases prenylation.6
Although many natural peptidic macrocycles have some side chain modifications, (e.g. sugars, prenyl, lipid), it would appear that it is relatively rare in nature, to have both amino acids and non‐peptidic molecules forming the macrocyclic backbone. Nonetheless, the synthesis of such compounds would be greatly beneficial for complementary SAR studies on different targets.2b A restriction to the natural amino acid building blocks in these cyclic peptides limits the diversity and thus the chemical properties and activities that such molecules can possess. A partial solution to this problem comes from the use of non‐natural amino acids in biological systems in vivo or in vitro,7 which have already been demonstrated for peptidic macrocycles8 (Figure 1). Hybrid molecules have gained significant attention through the use of the so called stapled peptides,9 but in this case, the non‐peptidic module tends to be chosen to lock the peptide in a specific conformation rather than being a key component for activity.
Figure 1.
PatGmac macrocylized compounds. a) The natural compound, patellamide D. b–d) Examples of previously synthesized non‐natural cyclic compounds.8a–8c e) This study; n represents the number of triazole units in the sequence; the molecule number represents the position of the triazole (Tz) moiety/moieties in the core peptide (i.e. Cyclo(‐VTzGAGIGFP) is CP1). The highlighted heterocycle in grey is C‐terminal in the core peptide. The bold parts of the molecules represent the non‐natural modifications to the core peptide. Ahp: 7‐aminoheptanoic acid. (Thz): thiazoline heterocycle.
The ring closure of peptidic macrocycles can be achieved either chemically2b, 10 or enzymatically with a number of distinct enzymes known including butelase,11 POPB,12 sortase,13 recombinant asparaginyl endopeptidase (AEP),14 thioesterase,15 and the macrocyclase enzyme (PatGmac) from the patellamide family.8a–8c, 16 PatGmac recognizes a C‐terminal AYD(G) motif located outside the core of the precursor peptide substrate and cleaves it to form a water shielded acyl‐enzyme intermediate using its catalytic Asp‐His‐Ser triad.16 The intermediate is then attacked by the free N‐terminal amine of the core peptide. The substrate must have a thiazoline, oxazoline or a proline immediately N‐terminal to the (cleaved off) AYD(G)16 motif with no other restrictions on the rest of the core peptide. This very broad substrate range has made PatGmac an attractive tool for biotransformation (Figure 1). We decided to test its ability to macrocyclize substrates in which non‐peptidic motifs are included in the sequence. We have chosen to investigate the incorporation of 1,4‐substituted 1,2,3‐triazole17 groups into macrocycles.18 The triazole ring can act as an isostere for a trans‐amide bond and its synthesis is facile.19 Further the triazole moiety has been incorporated in many biologically active compounds as it is thermodynamically and physiologically stable.20 We reasoned therefore this would be a suitable test of the versatility of PatGmac by systematically incorporating the triazole unit (Tz) at different positions of the precursor peptide while varying their number as well. We report herein the successful formation (catalyzed by PatGmac) of macrocycles containing one, two, and three 1,2,3‐triazoles.21
A series of 19 different linear precursor peptides P based on the core sequence VGAGIGFP were prepared by standard solid phase peptide synthesis using the Fmoc strategy (see Supporting Information, SI). The linear peptides VGAGIGFPAYD‐NH2 consisted of the core sequence modified with one, two, or three non‐consecutive triazoles and the PatGmac recognition motif AYD with an N‐terminal amide. The triazoles were generated directly on solid phase using a copper(I) catalyzed azide‐alkyne cycloaddition (CuAAC).18b Azido acids 1 a,b and 1 d were synthesized in one step by a diazo transfer reaction22 (Scheme S3).23 Fmoc protected amino alkynes 2 a–c were synthesized in five steps starting from the commercial Boc protected amino acid. First, the Weinreb amides were synthesized by a standard coupling reaction. The use of DIBAL‐H for the amide reduction in a one pot procedure with the Bestmann–Ohira reagent24 in our hands was found to yield very little conversion to the aldehyde. Separating the two steps with the reduction to aldehyde using LiAlH4 followed by the Bestmann‐Ohira reagent, successfully afforded the Boc protected amino alkynes. A final deprotection and protection step afforded the desired alkynes 2 a–c in moderate yields overall (Scheme S4). The N‐terminal azide was attached on the growing peptide by a standard coupling reaction then was subjected to the CuAAC reaction with the corresponding alkyne counterpart 2 on solid phase (Scheme 1; Scheme S1). Final precursor peptides P1‐6 have a single triazole (Tz) at a different position of the core sequence25 (the number denoting the position), precursors P13‐16, P24‐26, P35‐36 and P46 have two triazoles (i.e. at positions 1 and 3 of the peptide for P13; VTzGATzGIGFPAYD), and P135 and P246 have three triazoles (Table S1; Table S2). P0 has no triazoles and was synthesized to serve as a control for the macrocyclization reaction.
Scheme 1.
Synthesis of triazole‐containing cyclic peptides.
All precursor peptides P were subjected to the macrocylization reaction conditions in the presence of PatGmac in bicine buffer at pH 8.1 and 37 °C. The reaction progress was monitored by MALDI‐MS and some of the resulting spectra are shown in Figure 2.
Figure 2.
MALDI‐MS traces of the macrocyclization reactions of representative sequences with a) control peptide; VGAGIGFP, 0 triazole, 20 days; b) VTzGAGIGFP, 1 triazole, 21 days; c) VTzGATzGIGFP, 2 triazoles, 23 days; d) VTzGATzGITzGFP, 3 triazoles, 25 days.
All 19 precursor peptides converted to the desired macrocyclic structure and no significant linear product (AYD cleavage product) was detected in the reaction mixture. We observed that two triazole‐containing substrates were slower to macrocyclize than single triazole‐containing peptides and that both three triazole peptides were the slowest to convert. Nonetheless, because PatGmac is known to be a slow enzyme, the rate difference is insignificant (reactions were monitored 20 to 25 days for full conversion on large scale).16
Nine of the precursor peptides, P0, P1, P2, P3, P4, P5, P6, P13 and P135, were reacted at a larger scale (8 µmol, 9–10 mg) in the presence of PatGmac under the previously stated conditions in order to provide sufficient amounts of the macrocyclic product for full characterization. Cyclic peptides CP were extracted and HPLC purified to afford moderate yields of high purity samples. Results are summarized in Tables 1 and S3 (see Scheme 1 or SI, section I for structures). For some of these (CP3, CP13, CP135), two peaks on the HPLC had the expected mass of the cyclic peptide in the crude reaction mixture, as can be seen from the ratio column in Table 1 (SI, section VIII). When applicable, both peaks were collected and analyzed by HRMS and MS‐MS fragmentation patterns and both were determined to be the desired cyclic peptide (Table S3). However, the NMR experiments were only conducted for the major isolated compound.
Table 1.
PatGmac macrocylization reaction products.
Peptide | Core peptide [a] | Ratio [%][b] | Yield [%][c] | Purity [%][d] |
---|---|---|---|---|
CP0 | VGAGIGFP | 100 | 32 | 99 |
CP1 | VTzGAGIGFP | 100 | 43 | 99 |
CP2 | VGTzAGIGFP | 100 | 48 | 91 |
CP3 | VGATzGIGFP | 76/24 | 55 | 99 |
CP4 | VGAGTzIGFP | 100 | 34 | 98 |
CP5 | VGAGITzGFP | 100 | 58 | 93 |
CP6 | VGAGIGTzFP | 100 | 40 | 98 |
CP13 | VTzGATzGIGFP | 20/80 | 54 | 98 |
CP135 | VTzGATzGITzGFP | 15/85 | 39 | 97 |
[a] Tz=triazole replacing the amide bond. [b] Ratio of cyclic products in crude mixture (SI, section VIII). [c] Total yield. [d] Purity of major product determined by HPLC, UV absorption at 220 nm (SI, section IX).
The MS‐MS fragmentation data of the cyclic peptides gave a very complex series of ions consistent with their peptidic sequence, the newly formed valine to proline bond (macrocycle formation), and the triazole presence (SI, section VII).
NMR experiments were also conducted in order to further confirm their structures. The proton NMR data of the cyclic peptides showed at least two species present at different ratios depending on the compound. For CP0, CP1, CP2, CP3 CP4, CP6 and CP13 this was shown by Exchange Spectroscopy (EXSY) experiments to result from different conformers (SI, section VI). However, compound CP5 was found to have two distinct compounds in a 1:0.6 ratio. The NMR assignments of both species show they have the same peptidic sequence (cyclic VGAGITzGFP). Both HRMS and MS‐MS fragmentation data confirm the presence of one mass/one sequence. This led us to believe that the two species are most likely epimers but we cannot exclude structurally rigid conformers unable to interconvert.26
CP135 has a very complex proton and 2D NMR spectra that showed at least four different species (with two other minor species). General assignment has been achieved for most proton peaks (SI, section V) however, the individual sequence for each was not determined. EXSY showed that two of them are related conformers and the other two are distinct compounds. Similarly to compound CP5, these species would most probably arise from epimers or from conformationally rigid variants.
Biotransformation is now widely used in organic chemistry for the synthesis of biologically relevant compounds. The PatGmac enzyme is unusual in that it operates on substrates and disposes of almost the entire recognition site during the reaction. PatGmac macrocyclizes a wide range of natural peptidic substrates and tolerates non‐natural amino acids.8a,8b From a chemical diversity point of view, the ability to mix amino acid residues and non‐amino acid scaffolds within the backbone of macrocyles is highly desirable. We have now shown that it is possible to synthesize macrocycles with one, two, or three triazoles within a peptidic sequence. Thus PatGmac is able to process peptide hybrid substrates extending its utility. The ability to macrocyclise such hybrid molecules highlights the scope for creating highly diverse libraries through split and pool approaches.
Supporting information
As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re‐organized for online delivery, but are not copy‐edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors.
Supplementary
Acknowledgements
This work was supported by the European Research Council (339367), UK Biotechnology and Biological Sciences Research Council (K015508/1), The Wellcome Trust (TripleTOF 5600 mass spectrometer (094476), the MALDI TOF‐TOF Analyser (079272AIA), 700 NMR) and the EPSRC UK National Mass Spectrometry Facility at Swansea University. J.H.N. is a Royal Society Wolfson Merit Award Holder and 1000 talent scholar at Sichuan University.
E. Oueis, M. Jaspars, N. J. Westwood, J. H. Naismith, Angew. Chem. Int. Ed. 2016, 55, 5842.
References
- 1.
- 1a. Driggers E. M., Hale S. P., Lee J., Terrett N. K., Nat. Rev. Drug Discovery 2008, 7, 608; [DOI] [PubMed] [Google Scholar]
- 1b. Newman D. J., Cragg G. M. in Macrocycles in Drug Discovery (Eds.: J. R. Morphy, C. J. Harris), The Royal Society of Chemistry, London, 2015, pp. 1–36. [Google Scholar]
- 2.
- 2a. Bockus A. T., McEwen C. M., Lokey R. S., Curr. Top. Med. Chem. 2013, 13, 821; [DOI] [PubMed] [Google Scholar]
- 2b. Yu X., Sun D., Molecules 2013, 18, 6230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Joo S. H., Biomol. Ther. 2012, 20, 19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Heinis C., Nat. Chem. Biol. 2014, 10, 696. [DOI] [PubMed] [Google Scholar]
- 5. Jaspars M., Chem. Commun. 2014, 50, 10174–10176. [DOI] [PubMed] [Google Scholar]
- 6.
- 6a. Koehnke J., Bent A. F., Houssen W. E., Mann G., Jaspars M., Naismith J. H., Curr. Opin. Struct. Biol. 2014, 29, 112–121; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6b. Schmidt E. W., Nelson J. T., Rasko D. A., Sudek S., Eisen J. A., Haygood M. G., Ravel J., Proc. Natl. Acad. Sci. USA 2005, 102, 7315–7320. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Young T. S., Schultz P. G., J. Biol. Chem. 2010, 285, 11039–11044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.
- 8a. McIntosh J. A., Robertson C. R., Agarwal V., Nair S. K., Bulaj G. W., Schmidt E. W., J. Am. Chem. Soc. 2010, 132, 15499–15501; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8b. Oueis E., Adamson C., Mann G., Ludewig H., Redpath P., Migaud M., Westwood N. J., Naismith J. H., ChemBioChem 2015, 16, 2646–2650; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8c. Sardar D., Lin Z., Schmidt E. W., Chem. Biol. 2015, 22, 907–916; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8d. Kawakami T., Ohta A., Ohuchi M., Ashigai H., Murakami H., Suga H., Nat. Chem. Biol. 2009, 5, 888–890. [DOI] [PubMed] [Google Scholar]
- 9. Yudin A. K., Chem. Sci. 2015, 6, 30–49. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. White C. J., Yudin A. K., Nat. Chem. 2011, 3, 509–524. [DOI] [PubMed] [Google Scholar]
- 11. Nguyen G. K. T., Kam A., Loo S., Jansson A. E., Pan L. X., Tam J. P., J. Am. Chem. Soc. 2015, 137, 15398–15401. [DOI] [PubMed] [Google Scholar]
- 12. Luo H., Hong S.-Y., Sgambelluri R. M., Angelos E., Li X., Walton J. D., Chem. Biol. 2014, 21, 1610–1617. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. van't Hof W., Silvie H. M., Enno C. I. V., Jan G. M. B., Biol. Chem. 2015, 396, 283–293. [DOI] [PubMed] [Google Scholar]
- 14. Harris K. S., Durek T., Kaas Q., Poth A. G., Gilding E. K., Conlan B. F., Saska I., Daly N. L., van der Weerden N. L., Craik D. J., Anderson M. A., Nat. Commun. 2015, 6, 6370. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Kohli R. M., Trauger J. W., Schwarzer D., Marahiel M. A., Walsh C. T., Biochemistry 2001, 40, 7099–7108. [DOI] [PubMed] [Google Scholar]
- 16. Koehnke J., Bent A., Houssen W. E., Zollman D., Morawitz F., Shirran S., Vendome J., Nneoyiegbe A. F., Trembleau L., Botting C. H., Smith M. C. M., Jaspars M., Naismith J. H., Nat. Struct. Mol. Biol. 2012, 19, 767–772. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.In the rest of the paper, the 1,4-substituted 1,2,3-triazole will be referred to as triazole.
- 18.
- 18a. Tornøe C. W., Christensen C., Meldal M., J. Org. Chem. 2002, 67, 3057–3064; [DOI] [PubMed] [Google Scholar]
- 18b. Meldal M., Tornøe C. W., Chem. Rev. 2008, 108, 2952–3015. [DOI] [PubMed] [Google Scholar]
- 19. Tron G. C., Pirali T., Billington R. A., Canonico P. L., Sorba G., Genazzani A. A., Med. Res. Rev. 2008, 28, 278–308. [DOI] [PubMed] [Google Scholar]
- 20. Pibiri I., Buscemi S., Curr. Bioact. Compd. 2010, 6, 208. [Google Scholar]
- 21.The replacement of an amide bond by a triazole is represented by a “Tz” between the corresponding two amino acids in the sequence.
- 22. Goddard-Borger E. D., Stick R. V., Org. Lett. 2007, 9, 3797. [DOI] [PubMed] [Google Scholar]
- 23.Reagant 1 e is commercially available. Reagant 2 e is easily prepared from commercial propargylamine.
- 24. Dickson H. D., Smith S. C., Hinkle K. W., Tetrahedron Lett. 2004, 45, 5597–5599. [Google Scholar]
- 25.The molecule number represents the position of the triazole (Tz) moiety/moieties in the core peptide (VTzG is position 1, GTzF is position 6).
- 26. Tabudravu J. N., Jaspars M., Morris L. A., Kettenes-van den Bosch J. J., Smith N., J. Org. Chem. 2002, 67, 8593–8601. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re‐organized for online delivery, but are not copy‐edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors.
Supplementary