Skip to main content
ACS Medicinal Chemistry Letters logoLink to ACS Medicinal Chemistry Letters
. 2023 Apr 24;14(5):557–565. doi: 10.1021/acsmedchemlett.3c00037

Beyond 20 in the 21st Century: Prospects and Challenges of Non-canonical Amino Acids in Peptide Drug Discovery

Jennifer L Hickey , Dan Sindhikara , Susan L Zultanski §, Danielle M Schultz §,*
PMCID: PMC10184154  PMID: 37197469

Abstract

graphic file with name ml3c00037_0006.jpg

Life is constructed primarily using a toolbox of 20 canonical amino acids—relying upon these building blocks for the assembly of proteins and peptides that regulate nearly every cellular task, including cell structure, function, and maintenance. While Nature continues to be a source of inspiration for drug discovery, medicinal chemists are not beholden to only 20 canonical amino acids and have begun to explore non-canonical amino acids (ncAAs) for the construction of designer peptides with improved drug-like properties. However, as our toolbox of ncAAs expands, drug hunters are encountering new challenges in approaching the iterative peptide design–make–test–analyze cycle with a seemingly boundless set of building blocks. This Microperspective focuses on new technologies that are accelerating ncAA interrogation in peptide drug discovery (including HELM notation, late-stage functionalization, and biocatalysis) while shedding light on areas where further investment could not only accelerate the discovery of new medicines but also improve downstream development.

Keywords: Peptides, late-stage functionalization, non-canonical amino acids, drug discovery, biocatalysis, in silico prediction


Peptides have been referred to as the “Goldilocks” chemical modality due to their intermediate size which combines favorable attributes of both small molecules and biologics, such as high target specificity, absence of off-target effects, and differentiated pharmacokinetic profiles. Yet only modest gains have been observed in FDA approvals despite a rich legacy of peptides impacting human health for over 100 years, from insulin to vasopressin and, most recently, tirzepatide.1,2 One reason for this discrepancy is that the majority of approved peptide drugs are inspired by native peptide structures and are linear in nature with high canonical amino acid (AA) content, resulting in poor gastrointestinal stability and low permeability/bioavailability that often restricts the route of administration to intravenous or subcutaneous. However, a recent Perspective in ACS Med. Chem. Lett. from Blanco et al.3 highlighted the creativity and innovation of “new generation” macrocyclic peptides (MPs, 5–14mers) that are showing great promise in clinical studies owing to their improved biopharmaceutical properties, thereby enabling modulation of protein–protein interactions (PPIs), once considered undruggable.4 While macrocyclization addresses some of the liabilities associated with linear peptides, expanding to the vast space beyond canonical AAs provides a means to selectively access both intra- and extracellular targets while fine-tuning a range of properties such as solubility, metabolic stability, and importantly, oral bioavailability.5

The inspiration for applying non-canonical amino acids (ncAAs) to macrocyclic peptide drug discovery has origins in peptide natural products, specifically non-ribosomal peptides (NRPs) and ribosomally synthesized and post-translationally modified peptides (RiPPs).6 Notable examples include cyclosporin A (an NRP) and, most recently, darobactin A (a RiPP),7 where biological activity can be attributed to multifaceted, unique AA architectures (Figure 1A).8 While ncAAs are established in small molecule and peptidomimetic drug discovery,9 not until recently have advances in affinity-based hit-finding platforms, such as mRNA display, provided a means to generate >1012 unique de novo ncAA-containing macrocyclic peptides (ncAA-MPs) in a single round of screening.10 Pioneering efforts by the Suga and Szostak laboratories have enabled ncAA incorporation into mRNA display screening libraries;11,12 however, limitations on codon table modifications remain, providing rich opportunities for drug hunters to further optimize the hits to lead-like chemical matter. As one can imagine, the chemical space of ncAAs for medicinal chemists to explore can be limitless; nevertheless, inspiration from NRPs and RiPPs, such as cyclosporin A, darobactin A, and tryglysin A, have provided several common strategies to design and optimize peptidic structures (Figure 1B). For instance, d-configured building blocks, substitution at the N-, α-, or β-centers, or simply changing the side chain functionality can greatly enhance chemical diversity while improving the pharmaceutical properties.1315 It should be noted that other modifications, such as cross-links or amide bond surrogates, can also add structural diversity by influencing the resulting three-dimensional conformation.16 Recently, macrocyclic peptides with origins in mRNA display have emerged from such institutions as Chugai and Merck, leveraging the engineered introduction of ncAAs for accessing either intra- or extracellular targets, respectively (Figure 1C). For Chugai, their first clinical macrocyclic peptide for intracellular rat sarcoma virus (RAS) inhibition was identified through incorporation of multiple N-substituted ncAAs, reducing the polar surface area and improving membrane permeability.17 In turn, Merck recently disclosed the structure of MK-0616, an oral proprotein convertase subtilisin/kexin type 9 (PCSK9) inhibitor, which in a recent Phase 2b study, significantly reduced LDL-C in patients with hypercholesterolemia.18 Potency and protease stability were obtained through introduction of a fluorinated tryptophan, d-Ala (AA2), α-Me-Pro (AA8), and cross-linking, ultimately yielding an oral MP inhibitor at a fraction of the size of current monoclonal antibody injectables.

Figure 1.

Figure 1

Nature serving as inspiration for peptide drug discovery (A), with ncAA classes from NRPs and RiPPs now appearing in promising peptide candidates with improved pharmaceutical properties (B, C).

Several promising macrocyclic peptides are on track to be the first approved drugs derived from mRNA display,19 resulting in a renewed focus on understanding the roles and impacts of ncAAs in drug discovery and development.5 However, the “peptide toolbox” for drug discovery is still being defined, and its understanding remains extremely limited compared to that of small molecules and biologics which continue to dominate FDA approvals.20 For instance, both informatics and prediction software tend to focus primarily on either small molecule or biologics discovery, largely stemming from the historical lack of interest in peptides, coupled with their topological complexity and shortage of relevant training data for machine learning. And while ncAAs can be sourced commercially for initial structure–activity relationship (SAR) exploration, there are gaps in structural diversity, and availability becomes increasingly limited on larger scales, negatively impacting early development timelines. Moreover, current synthetic technologies, such as solid-phase peptide synthesis (SPPS), have historically employed excess AAs to improve synthetic efficiency, which exacerbates the cost of goods (COG) for ncAA-rich peptides. Thus, to seize the opportunity that peptides, both linear and cyclic, can bring to human health, we need to capitalize on ncAAs in peptide drug discovery, investing in informatics and synthesis to realize the full potential of this highly versatile modality.

Informatics and Prediction for ncAAs

Computational analysis and prediction are key aspects of drug discovery. While a wealth of tools exist for both small molecule and biologic modalities, ncAA-MPs do not necessarily fit into either category and thus require significant adaptations. For example, regarding textual representation, small molecules typically use simplified molecular-input line-entry system (SMILES) strings which use atoms and bonds as the primary units, whereas biologics are represented as a sequence of canonical AAs written in the standard single-letter codes (typically FAST-All (FASTA) format). Neither approach works well for ncAA-MPs, since the SMILES strings become far too long and complex, while FASTA single-letter AA codes only allow for linear structural representations using the standard 20 canonical AAs (Figure 2). However, the flexible hierarchical editing language for macromolecules (HELM) has proven to be an excellent fit.21 HELM can represent diverse ncAAs as simple, human-legible text symbols (analogous to canonical AA single-letter codes) as well as cross-linking or cyclization architectures. This simplified monomeric representation can reduce assembly errors commonly arising from manually misdrawing these large biomolecules that contain many atoms, stereocenters, and cross-links. Additionally, the monomeric representation is a convenient model for SAR analysis and representation. Despite these advances, there remains a need for further standardization of HELM in both symbol naming and processing.

Figure 2.

Figure 2

Textual representations for various chemical modalities. ncAA-MPs (center) are well-represented using HELM which concisely depicts ncAAs as well as cross-linking and cyclization architectures.

Sequence alignment is fundamental for protein analysis, as positional assignment is necessary to study SARs; however, performing these alignments is a non-trivial task for ncAA-MPs. For example, the similarity matrices, which are required for performing alignments, typically use well-established 20×20 monomer–monomer similarity matrices, such as BLOSUM (blocks substitution matrix) scores.22 However, for ncAAs, where the total count is limitless, the matrices become not only much larger but also highly dynamic. To address this issue, Merck has developed a method called Peptide Sequence Alignment (PepSeA) for ncAA-MPs using a dynamic monomer similarity matrix (Figure 3).23 This has enabled downstream peptide SAR analysis using alignment and visualization tools along with sequence-based descriptors for machine learning. Sequence-based analysis is particularly useful to directly interpret the impact of single AA mutations through direct linear coefficients, simple generative estimators such as Free-Wilson analyses,24,25 or alternatively, more complex machine-learning algorithms. One might conjecture that ncAA-MP sequence alignment could enable structure prediction, such as what is done in AlphaFold2 or RoseTTAFold,26,27 but the dearth of molecules topologically similar to ncAA-MPs in the Protein Data Bank prohibits practical training and deployment of such deep-learning models at this time. Thus, structure determination is relegated primarily to experimental techniques or less reliable physics-based predictors.2830

Figure 3.

Figure 3

Monomer similarity matrix using atom/distance pairs with chirality (APC/DPC) descriptors.31,32 Subset of a large similarity matrix, which must be generated dynamically, to perform sequence alignments for ncAA-MPs. For example, dW is more likely to align to W (0.85 similarity) than to F (0.55).

Physics-based modeling of peptides is necessary for predicting properties of novel ncAA-MPs which lack sufficient experimental data to fuel machine learning. For typical physics-based models using atomic representation, molecules are parameterized distinctly as either proteins or small molecules, by simple heuristics but which may not natively handle ncAAs. In silico residue scans consisting of three-dimensional conformational searches using molecular force fields typically handle side chain substitutions well but struggle with backbone modifications. Molecular dynamics (MD)-based free energy calculations (free energy perturbation, thermodynamic integration, etc.) are optimized for minimal changes in framework—either small molecules with limited cores or proteins with minimally moving backbones; however, the conformationally flexible backbones of peptides and macrocycles can be highly non-trivial to predict with these relatively short simulation timescales. Potentially, this problem can be partially addressed by combining the binding enthalpy estimates with separate conformational preference estimates by utilizing solution-state MD simulations (vanilla MD or enhanced sampling to study the conformational behavior).33,34 Still, there remains a need for robust unified physics-based binding free energy solutions for peptides.

These adaptations of biologic and small molecule informatics utilities, along with other custom informatics tools built for ncAA-MPs, are necessary to properly leverage the wealth of data generated by peptide drug discovery teams. However, peptide tools are often less mature; therefore, opportunities to maximize the unique aspects of informatics within this space are abundant.

Advances in Peptide Medicinal Chemistry: Leveraging Late-Stage Functionalization for ncAA Incorporation

The incorporation of ncAAs into peptides is typically approached via two methodologies: the first is through traditional SPPS, and the second is through late-stage functionalization (LSF) of fully assembled sequences. Fluorenylmethyloxycarbonyl (Fmoc)-based SPPS requires either commercial availability or preliminary synthesis of discrete Fmoc-protected AAs, which can be readily integrated through standard amide bond-forming protocols. However, commercial availability of diverse, readily accessible ncAAs is limited and often exorbitantly expensive. For instance, if we consider Fmoc-protected phenylalanine analogs, at the time of this Microperspective, there were ∼250 l-configured building blocks, ∼150 d-configured analogs, and ∼40 compounds containing homologated aryl side chains commercially available, with the number of available AAs inversely proportional to the price.35 In addition to accessing diverse ncAAs for SAR investigations, as medicinal chemistry teams look to scale-up promising peptide leads for safety and pharmacokinetics studies, securing bulk quantities of Fmoc-ncAAs can be a non-trivial, costly, and time-consuming endeavor.

Alternately, a ncAA bearing a reactive handle can be incorporated during linear chain assembly, serving as a point of further diversification through LSF (Figure 4). An established technology used in small molecule drug discovery, LSF enables rapid and extensive chemical/structural diversity to be accessed from a single precursor molecule.36 For peptide drug discovery, the advantages of pursuing LSF are even greater, as the precursor peptide can typically be obtained from commercially available building blocks. However, the biggest challenge remains in developing chemo-, regio-, and stereoselective LSF chemistries compatible with peptides and larger biopolymers both on-resin and in solution.

Figure 4.

Figure 4

Peptide LSF protocols to access ncAAs through diversification of aryl side chains (A, B) and installation of de novo functionality (C, D).

While native chemical ligation along with bioconjugation and cross-linking methodologies (click, ring-closing metathesis, etc.) have gained widespread adoption within the peptide community,5 LSF on peptides is still an emerging field. Recent developments highlight both on-resin and in solution methodologies and can be split into two general categories: those that incorporate a distal decoration on an existing AA side chain and those that allow for installation of de novo functionality in a customizable, modular fashion (Figure 4). Although the diversity of contributions in this space are vast, we will focus on highlighting a selection of representative LSF technologies reported over the past 12–24 months in this Microperspective. In 2022, Merck described the late-stage diversification of solid-phase-supported peptides containing p-bromophenylalanine residues using Suzuki–Miyaura cross-coupling chemistry (Figure 4A). This work nicely illustrated how leveraging the power of high-throughput experimentation (HTE) within SPPS can enable rapid access to multidimensional peptide libraries with expanded chemical space through distal modifications on aryl side chains.37 Similarly, the Ackermann lab expanded the tryptophan LSF toolbox through a manganese-catalyzed C–H hydroarylation at the indole C2 position, which provided access to both ligation and macrocyclic peptide products (Figure 4B).38 Lastly, Mallek et al. have described a palladium-mediated, chemoselective N-arylation of p-aminophenylalanine on a fully deprotected peptide in the presence of other aliphatic amino groups.39

As we broaden our field of view and expand beyond chemistries that modify existing aryl AAs, progress has also been made in synthetic methodologies that allow access to diverse, de novo side chain functionalities. For instance, the Thomas laboratory reported on a nucleophilic substitution reaction via on-resin iodination of homoserine, followed by subsequent functionalization with aliphatic and aromatic thiols, pyridines, and carboxylic acids, showing great promise for HTE and combinatorial approaches.40 More recently, the Xu lab described a visible-light-mediated deaminative hydroalkylation on a series of alkenes, including dehydroalanine (Dha), providing an efficient way to prepare β-alkyl-substituted ncAAs either as discrete building blocks or within an elaborated peptide under mild and catalyst-free conditions (Figure 4C).41 Common to many Dha modifications for ncAA synthesis is the lack of stereocontrol which can cause complex mixtures of epimers; however, new reports have shown that chiral catalysts can invoke higher stereoselectivity.42,43

Recently, alkyl Katritzky salts have emerged as stable and versatile radical precursors, providing an alternative way to access diverse and customizable side chain functionality in peptides.44 Conversion of lysine, and related AAs with primary amine-containing side chains, into the corresponding pyridinium salts involves straightforward, high-yielding, and robust chemistry. In addition, the resulting Fmoc-protected building blocks are often resistant to SN2 and epimerization reactions and can be seamlessly incorporated into peptides through standard SPPS and trifluoroacetic acid (TFA) deprotection workflows. Openy et al. showcased the first example of utilizing a side chain Katritzky salt for on-resin peptide LSF, accessing diverse ncAAs via a photochemical Giese reaction using commercially available Michael acceptors.45 With line of sight to readily transform abundant canonical AA feedstocks into unique ncAAs, an academic–industrial collaboration between Merck and the Watson group at the University of Delaware recently reported a cross-electrophile coupling between lysine-derived Katritzky salts and aryl halides to access aryl alanines and homologated derivatives (Figure 4D).46 Through this work, facile access to LSF substrates was achieved, ultimately providing diverse, boutique side chain functionality on fully elaborated peptidic structures.

As investment in this space continues, we believe the next wave of peptide LSF chemistries should not only strive to identify unique reactive handles but also leverage canonical AA side chain functionality to access complementary reactivity. One could envision an aspirational future state where the rapid buildup of complexity occurs from a single parent peptide in a multidimensional format from distinct side chain vectors to produce large peptide libraries of vast chemical diversity. Ultimately, these LSF chemistries should allow us to continue to mimic and expand upon the structural features and functional group diversity, both aryl and alkyl, that we observe in NRPs/RiPPs.

Prospects for ncAA Scale-Up: Streamlined Biocatalytic Processes to Access ncAA Classes of Interest

In comparison to conventional small molecules, peptides typically possess higher production costs due to their larger size (>1000 Da) and longer synthetic sequences. To this end, many improvements have been documented for peptide synthesis on a manufacturing scale, with SPPS remaining the most common method, where excess AAs are usually required.47 For ncAA-containing peptides, the cost of synthesizing each ncAA must also be considered, as delivering excess of synthetically complex ncAAs could significantly impact overall production costs. While there are efficient chemical processes to access some ncAAs, enzymatic processes are quintessential because they proceed with high stereocontrol and obviate protecting group manipulations, culminating in reduced step counts. Frances Arnold and co-workers recently shared a perspective on this topic, bringing attention to new biocatalytic methods for accessing ncAAs, including β-AAs, substituted prolines, and α-deuterated AAs.48 Herein, we highlight recent biocatalytic transformations and remaining opportunities toward accessing β-Me-, β-OH-, and α-Me-branched α-AAs (Figure 5). These represent several examples of ncAA classes which are known to stabilize peptides to protease-induced amide hydrolysis, a particularly challenging barrier to overcome for the development of oral drugs. At the same time, they maintain a relatively small profile, which helps to minimally impact linear peptide chain assembly. These ncAA classes can also improve potency, conformational rigidity, and other drug-like properties.

Figure 5.

Figure 5

Established routes and existing opportunities toward β-Me-, β-OH-, and α-Me-substituted AAs of interest.

In addition to having the capacity to confer peptide stability, β-Me-branched α-AAs are of interest as the presence of the β-Me group can increase binding affinity due to an entropic gain from restricted bond rotation between the α-carbon and the β-carbon.15 Highly stereoselective routes to access both (2S,3S) and (2S,3R) β-Me-branched diastereomers are sought, as the optimal diastereomer is case-dependent. Indole-substituted β-Me-tryptophans constitute one subclass, and engineered variants of tryptophan synthase (e.g., PfTrpB 7E6) coupled with pyridoxal phosphate (PLP) cofactor have been reported to deliver diverse (2S,3S) β-Me-tryptophan analogs in one step from threonine and the requisite indole (Figure 5A).49 Currently only a single example of accessing the complementary (2S,3R) β-Me-tryptophan exists, which proceeds through a PLP-dependent cascade from tryptophan and methyl iodide via α-AA transaminase (TA), α-keto acid methyltransferase (MT), halide methyltransferase (HMT), and S-adenosylhomocysteine (SAH).50 Considering aromatic α-AAs beyond tryptophan, the Renata lab has recently engineered PLP-dependent aromatic AA aminotransferases (ArATs) to access (2S,3R) β-Me-arylalanines from ketoacids with broad scope (Figure 5B).51 A complementary enzymatic process to access (2S,3S) β-Me-arylalanines would be desirable yet remains elusive. For aliphatic AAs, (2S,3R) β-Me aliphatic AAs (Figure 5C) can be accessed by the aforementioned enzymatic cascade used to make (2S,3R) β-Me-tryptophan;48 however, an enzymatic process to access (2S,3S) β-Me aliphatic AAs has yet to be reported.

While β-OH-branched α-AAs can impart benefits akin to the β-Me group, the hydroxy group can also participate in favorable intra- or intermolecular hydrogen-bonding interactions. PLP-dependent threonine aldolases (L-TA), threonine transaldolases (L-TTA), and serine hydroxymethyl transferases (SHMT) have each been utilized to access diverse β-OH aryl- and alkylalanines (Figure 5D).52 However, these reactions typically produce mixtures of (2S,3S) and (2S,3R) diastereomers. There have been recent engineering efforts to develop diastereoselective processes,53 yet many opportunities remain to access single diastereomers in high diastereomeric excess and conversion, a necessity for manufacturing processes. On a final note, α-Me-branched α-AAs can also improve peptide stability while potentially influencing peptide secondary structure due to the location of the α-Me group on the peptide backbone. Biocatalytic processes to access this subclass would therefore be of value but are currently elusive.

Conclusion

As a scientific community, our understanding of how best to leverage ncAAs to access a seemingly endless source of peptide chemical diversity is still in its infancy. Computational tools can be employed to accelerate the design of ncAA-containing peptides, but significant adaptation and innovation are required in terms of both informatics and physics-based modeling. However, with investment in novel synthetic methodologies and manufacturing platforms for both ncAAs and peptides, we aspire to a future state in which accessing chemical diversity is not rate-limiting and peptide scale-ups proceed with costs and timelines akin to small molecules. Consequently, we must come together to truly capitalize on the opportunities that ncAA-rich peptides present to human health, utilizing industrial consortia and academic–industrial collaborations to address the challenges outlined in this Microperspective. Only time will tell how influential ncAAs will be in the discovery of new peptide therapeutics; nevertheless, we look forward to playing an active role on this exciting journey, with the ultimate goal of positively impacting human health well beyond the 21st century.

Acknowledgments

We would like to thank Kaustav Biswas, Nicolas Boyer, L.-C. Campeau, Jennifer Johnston, Chris Plummer, and Ben Sherry for productive discussions and comments.

Glossary

Abbreviations

AA

amino acid

APC/DPC

atom/distance pairs with chirality

ArATs

aromatic AA aminotransferases

BLOSUM

blocks substitution matrix

COG

cost of goods

Dha

dehydroalanine

FASTA

FAST-All

Fmoc

fluorenylmethyloxycarbonyl

HELM

hierarchical editing language for macromolecules

HMT

halide methyltransferase

HTE

high-throughput experimentation

LSF

late-stage functionalization

L-TA

l-threonine aldolase

L-TTA

l-threonine transaldolase

MD

molecular dynamics

MPs

macrocyclic peptides

MT

methyltransferase

ncAA

non-canonical amino acid

ncAA-MP

non-canonical amino acid-containing macrocyclic peptide

NRP

non-ribosomal peptide

PCSK9

proprotein convertase subtilisin/kexin type 9

PepSeA

Peptide Sequence Alignment

PLP

pyridoxal phosphate

PPIs

protein–protein interactions

RAS

rat sarcoma virus

RiPP

ribosomally synthesized and post-translationally modified peptide

SAH

S-adenosylhomocysteine

SAR

structure–activity relationship

SHMT

serine hydroxymethyl transferase

SMILES

simplified molecular-input line-entry system

SPPS

solid-phase peptide synthesis

TA

transaminase

TFA

trifluoroacetic acid

The authors declare no competing financial interest.

Special Issue

Published as part of the ACS Medicinal Chemistry Letters virtual special issue “New Enabling Drug Discovery Technologies - Recent Progress”.

References

  1. Muttenthaler M.; King G. F.; Adams D. J.; Alewood P. F. Trends in peptide drug discovery. Nat. Rev. Drug Disc. 2021, 20, 309–325. 10.1038/s41573-020-00135-8. [DOI] [PubMed] [Google Scholar]
  2. Chavda V. P.; Ajabiya J.; Teli D.; Bojarska J.; Apostolopoulos V. Tirzepatide, a new era of dual-targeted treatment for diabetes and obesity: a mini-review. Molecules 2022, 27, 4315–4324. 10.3390/molecules27134315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Blanco M.-J.; Gardinier K. M.; Namchuk M. N. Advancing new chemical modalities into clinical studies. ACS Med. Chem. Lett. 2022, 13, 1691–1698. 10.1021/acsmedchemlett.2c00375. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Zhang H.; Chen S. Cyclic peptide drugs approved in the last two decades (2001–2021). RSC Chem. Biol. 2022, 3, 18–31. 10.1039/D1CB00154J. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Vinogradov A. A.; Yin Y.; Suga H. Macrocyclic peptides as drug candidates: recent progress and remaining challenges. J. Am. Chem. Soc. 2019, 141, 4167–4181. 10.1021/jacs.8b13178. [DOI] [PubMed] [Google Scholar]
  6. Mordhorst S.; Ruijne F.; Vagstad A. L.; Kuipers O. P.; Piel J. Emulating nonribosomal peptides with ribosomal biosynthetic strategies. RSC Chem. Biol. 2023, 4, 7–36. 10.1039/D2CB00169A. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Imai Y.; Meyer K. J.; Iinishi A.; Favre-Godal Q.; Green R.; Manuse S.; Caboni M.; Mori M.; Niles S.; Ghiglieri M.; Honrao C.; Ma X.; Guo J. J.; Makriyannis A.; Linares-Otoya L.; Böhringer N.; Wuisan Z. G.; Kaur H.; Wu R.; Mateus A.; Typas A.; Savitski M. M.; Espinoza J. L.; O’Rourke A.; Nelson K. E.; Hiller S.; Noinaj N.; Schäberle T. F.; D’Onofrio A.; Lewis K. A new antibiotic selectively kills Gram-negative pathogens. Nature 2019, 576, 459–464. 10.1038/s41586-019-1791-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Lee D.; Lee S.; Choi J.; Song Y.-K.; Kim M. J.; Shin D.-S.; Bae M. A.; Kim Y.-C.; Park C.-J.; Lee K.-R.; Choi J.-H.; Seo J. Interplay among conformation, intramolecular hydrogen bonds, and chameleonicity in the membrane permeability and cyclophilin A binding of macrocyclic peptide cyclosporin O derivatives. J. Med. Chem. 2021, 64, 8272–8286. 10.1021/acs.jmedchem.1c00211. [DOI] [PubMed] [Google Scholar]
  9. Blaskovich M. A. T. Unusual amino acids in medicinal chemistry. J. Med. Chem. 2016, 59, 10807–10836. 10.1021/acs.jmedchem.6b00319. [DOI] [PubMed] [Google Scholar]
  10. Li X.; Craven T. W.; Levine P. M. Cyclic peptide screening methods for preclinical drug discovery. J. Med. Chem. 2022, 65, 11913–11926. 10.1021/acs.jmedchem.2c01077. [DOI] [PubMed] [Google Scholar]
  11. Goto Y.; Suga H. The RaPID platform for the discovery of pseudo-natural macrocyclic peptides. Acc. Chem. Res. 2021, 54, 3604–3617. 10.1021/acs.accounts.1c00391. [DOI] [PubMed] [Google Scholar]
  12. Josephson K.; Hartman M. C. T.; Szostak J. W. Ribosomal synthesis of unnatural peptides. J. Am. Chem. Soc. 2005, 127, 11727–11735. 10.1021/ja0515809. [DOI] [PubMed] [Google Scholar]
  13. Werner H. M.; Cabalteja C. C.; Horne W. S. Peptide backbone composition and protease susceptibility: impact of modification type, position, and tandem substitution. Chembiochem 2016, 17, 712–718. 10.1002/cbic.201500312. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Yamaguchi H.; Kodama H.; Osada S.; Kato F.; Jelokhani-Niaraki M.; Kondo M. Effect of α-α-dialkyl amino acids on the protease resistance of peptides. Biosci. Biotechnol. Biochem. 2003, 67, 2269–2272. 10.1271/bbb.67.2269. [DOI] [PubMed] [Google Scholar]
  15. Hruby V. J. Peptide science: exploring the use of chemical principles and interdisciplinary collaboration for understanding life processes. J. Med. Chem. 2003, 46, 4215–4231. 10.1021/jm0303103. [DOI] [PubMed] [Google Scholar]
  16. McTiernan T. J.; Diaz D. B.; Saunders G. J.; Sprang F.; Yudin A. K. Navigating complex peptide structures using macrocycle conformational maps. RSC Chem. Biol. 2022, 3, 739–747. 10.1039/D2CB00016D. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Nomura K.; Hashimoto S.; Takeyama R.; Tamiya M.; Kato T.; Muraoka T.; Kage M.; Nii K.; Kotake K.; Iida S.; Emura T.; Tanada M.; Iikura H. Broadly applicable and comprehensive synthetic method for N-alkyl-rich drug-like cyclic peptides. J. Med. Chem. 2022, 65, 13401–13412. 10.1021/acs.jmedchem.2c01296. [DOI] [PubMed] [Google Scholar]
  18. Johns D. G.; Campeau L.-C.; Banka P.; Bautmans A.; Bueters T.; Bianchi E.; Branca D.; Bulger P. G.; Crevecoeur I.; Ding F.-X.; Garbaccio R. M.; Guetschow E. D.; Guo Y.; Ha S. N.; Johnston J. M.; Josian H.; Kauh E. A.; Koeplinger K. A.; Kuethe J. T.; Lai E.; Lanning C. L.; Lee A. Y. H.; Li L.; Nair A. G.; O’Neill E. A.; Stoch S. A.; Thaisrivongs D. A.; Tucker T. J.; Vachal P.; van Dyck K.; Vanhoutte F. P.; Volckaert B.; Wolford D. G.; Xu A.; Zhao T.; Zhou D.; Zhou S.; Zhu X.; Zokian H. J.; Walji A.; Wood H. B.. An orally bioavailable macrocyclic peptide that inhibits binding of PCSK9 to the LDL receptor. Circulation 2023, 10.1161/CIRCULATIONAHA.122.063372, in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Iskandar S. E.; Bowers A. A. mRNA display reaches for the clinic with new PCSK9 inhibitor. ACS Med. Chem. Lett. 2022, 13, 1379–1383. 10.1021/acsmedchemlett.2c00319. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Henninot A.; Collins J. C.; Nuss J. M. The current state of peptide drug discovery: back to the future?. J. Med. Chem. 2018, 61, 1382–1414. 10.1021/acs.jmedchem.7b00318. [DOI] [PubMed] [Google Scholar]
  21. Milton J.; Zhang T.; Bellamy C.; Swayze E.; Hart C.; Weisser M.; Hecht S.; Rotstein S. HELM software for biopolymers. J. Chem. Inf. Model. 2017, 57, 1233–1239. 10.1021/acs.jcim.6b00442. [DOI] [PubMed] [Google Scholar]
  22. Henikoff S.; Henikoff J. G. Amino acid substitution matrices from protein blocks. Proc. Natl. Acad. Sci. U.S.A. 1992, 89, 10915–10919. 10.1073/pnas.89.22.10915. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Baylon J. L.; Ursu O.; Muzdalo A.; Wassermann A. M.; Adams G. L.; Spale M.; Mejzlik P.; Gromek A.; Pisarenko V.; Hancharyk D.; Jenkins E.; Bednar D.; Chang C.; Clarova K.; Glick M.; Bitton D. A. PepSeA: peptide sequence alignment and visualization tools to enable lead optimization. J. Chem. Inf. Model. 2022, 62, 1259–1267. 10.1021/acs.jcim.1c01360. [DOI] [PubMed] [Google Scholar]
  24. Free S. M.; Wilson J. W. A mathematical contribution to structure-activity studies. J. Med. Chem. 1964, 7, 395–399. 10.1021/jm00334a001. [DOI] [PubMed] [Google Scholar]
  25. Kubinyi H. Free Wilson analysis. Theory, applications and its relationship to Hansch analysis. Quant. Struct. Relationships 1988, 7, 121–133. 10.1002/qsar.19880070303. [DOI] [Google Scholar]
  26. Jumper J.; Evans R.; Pritzel A.; Green T.; Figurnov M.; Ronneberger O.; Tunyasuvunakool K.; Bates R.; Žídek A.; Potapenko A.; Bridgland A.; Meyer C.; Kohl S. A. A.; Ballard A. J.; Cowie A.; Romera-Paredes B.; Nikolov S.; Jain R.; Adler J.; Back T.; Petersen S.; Reiman D.; Clancy E.; Zielinski M.; Steinegger M.; Pacholska; Berghammer M.; Bodenstein T.; Silver D.; Vinyals O.; Senior A. W.; Kavukcuoglu K.; Kohli P.; Hassabis D. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Baek M.; DiMaio F.; Anishchenko I.; Dauparas J.; Ovchinnikov S.; Lee G. R.; Wang J.; Cong Q.; Kinch L. N.; Schaeffer R. D.; Millán C.; Park H.; Adams C.; Glassman C. R.; DeGiovanni A.; Pereira J. H.; Rodrigues A. V.; van Dijk A. A.; Ebrecht A. C.; Opperman D. J.; Sagmeister T.; Buhlheller C.; Pavkov-Keller T.; Rathinaswamy M. K.; Dalwadi U.; Yip C. K.; Burke J. E.; Garcia K. C.; Grishin N. V.; Adams P. D.; Read R. J.; Baker D. Accurate prediction of protein structures and interactions using a three-track neural network. Science 2021, 373, 871–876. 10.1126/science.abj8754. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Coutsias E. A.; Lexa K. W.; Wester M. J.; Pollock S. N.; Jacobson M. P. Exhaustive conformational sampling of complex fused ring macrocycles using inverse kinematics. J. Chem. Theory Comput. 2016, 12, 4674–4687. 10.1021/acs.jctc.6b00250. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Wang J.; Miao Y. Peptide Gaussian accelerated molecular dynamics (Pep-GaMD): Enhanced sampling and free energy and kinetics calculations of peptide binding. J. Chem. Phys. 2020, 153, 154109. 10.1063/5.0021399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Sindhikara D.; Spronk S. A.; Day T.; Borrelli K.; Cheney D. L.; Posy S. L. Improving accuracy, diversity, and speed with prime macrocycle conformational sampling. J. Chem. Inf. Model. 2017, 57, 1881–1894. 10.1021/acs.jcim.7b00052. [DOI] [PubMed] [Google Scholar]
  31. Carhart R. E.; Smith D. H.; Venkataraghavan R. Atom pairs as molecular features in structure-activity studies: definition and applications. J. Chem. Inf. Comput. Sci. 1985, 25, 64–73. 10.1021/ci00046a002. [DOI] [Google Scholar]
  32. Kearsley S. K.; Sallamack S.; Fluder E. M.; Andose J. D.; Mosley R. T.; Sheridan R. P. Chemical similarity using physiochemical property descriptors. J. Chem. Inf. Comput. Sci. 1996, 36, 118–127. 10.1021/ci950274j. [DOI] [Google Scholar]
  33. Wang L.; Friesner R. A.; Berne B. J. Replica exchange with solute scaling: a more efficient version of replica exchange with solute tempering (REST2). J. Phys. Chem. B 2011, 115, 9431–9438. 10.1021/jp204407d. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Kannan S.; Zacharias M. Enhanced sampling of peptide and protein conformations using replica exchange simulations with a peptide backbone biasing-potential. Proteins 2007, 66, 697–706. 10.1002/prot.21258. [DOI] [PubMed] [Google Scholar]
  35. Reaxys commercial availability search conducted on 30-Jan-2023 for 1 g quantities of Fmoc-OH AAs with a generic aryl group: 251 L-AAs, 158 D-AAs, and 42 homologated AAs.
  36. Börgel J.; Ritter T. Late-stage functionalization. Chem 2020, 6, 1877–1887. 10.1016/j.chempr.2020.07.007. [DOI] [Google Scholar]
  37. Li S.; Pissarnitski D.; Nowak T.; Wleklinski M.; Krska S. W. Merging late-stage diversification with solid-phase peptide synthesis enabled by high-throughput on-resin reaction screening. ACS Catal. 2022, 12, 3201–3210. 10.1021/acscatal.1c05502. [DOI] [Google Scholar]
  38. Kaplaneris N.; Kaltenhäuser F.; Sirvinskaite G.; Fan S.; De Oliveira T.; Conradi L.-C.; Ackermann L. Late-stage stitching enabled by manganese-catalyzed C–H activation: peptide ligation and access to cyclopeptides. Sci. Adv. 2021, 7, eabe6202 10.1126/sciadv.abe6202. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Mallek A. J.; Pentelute B. L.; Buchwald S. L. Selective N-arylation of p-aminophenylalanine in unprotected peptides with organometallic palladium reagents. Angew. Chem., Int. Ed. 2021, 60, 16928–16931. 10.1002/anie.202104780. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Werner M.; Pampel J.; Pham T. L.; Thomas F. Late-stage functionalisation of peptides on the solid phase by an iodination-substitution approach. Chem.—Eur. J. 2022, 28, e202201339 10.1002/chem.202201339. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Wang M.; Wang C.; Huo Y.; Dang X.; Xue H.; Liu L.; Chai H.; Xie X.; Li Z.; Lu D.; Xu Z. Visible-light-mediated catalyst-free synthesis of unnatural α-amino acids and peptide macrocycles. Nat. Commun. 2021, 12, 6873–6881. 10.1038/s41467-021-27086-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Qi X.; Jambu S.; Ji Y.; Belyk K. M.; Panigrahi N. R.; Arora P. S.; Strotman N. A.; Diao T. Late-stage modification of oligopeptides by nickel-catalyzed stereoselective radical addition to dehydroalanine. Angew. Chem., Int. Ed. 2022, 61, e202213315 10.1002/anie.202213315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Key H. M.; Miller S. J. Site- and stereoselective chemical editing of thiostrepton by Rh-catalyzed conjugate arylation: new analogues and collateral enantioselective synthesis of amino acids. J. Am. Chem. Soc. 2017, 139, 15460–15466. 10.1021/jacs.7b08775. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Yousif A. M.; Colarusso S.; Bianchi E. Katritzky salts for the synthesis of unnatural amino acids and late-stage functionalization of peptides. Eur. J. Org. Chem. 2023, 26, e202201274 10.1002/ejoc.202201274. [DOI] [Google Scholar]
  45. Openy J.; Amrahova G.; Chang J.-Y.; Noisier A.; Hart P. Solid-phase peptide modification via deaminative photochemical Csp3-Csp3 bond formation using katritzky salts. Chem.—Eur. J. 2022, 28, e202201121 10.1002/chem.202201121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Twitty J. C.; Hong Y.; Garcia B.; Tsang S.; Liao J.; Schultz D. M.; Hanisak J.; Zultanski S. L.; Dion A.; Kalyani D.; Watson M. Diversifying amino acids and peptides via deaminative reductive cross-couplings leveraging high-throughput experimentation. J. Am. Chem. Soc. 2023, 145, 5684. 10.1021/jacs.2c11451. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Pennington M. W.; Zell B.; Bai C. J. Commercial manufacturing of good manufacturing practice peptides spanning the gamut from neoantigen to commercial large-scale products. Med. Drug Disc. 2021, 9, 100071. 10.1016/j.medidd.2020.100071. [DOI] [Google Scholar]
  48. Alfonzo E.; Das A.; Arnold F. H. New additions to the arsenal of biocatalysts for noncanonical amino acid synthesis. Curr. Opin. Green Sustain. Chem. 2022, 38, 100701. 10.1016/j.cogsc.2022.100701. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Boville C. E.; Scheele R. A.; Koch P.; Brinkmann-Chen S.; Buller A. R.; Arnold F. H. Engineered biosynthesis of β-alkyl tryptophan analogues. Angew. Chem., Int. Ed. 2018, 57, 14764–14768. 10.1002/anie.201807998. [DOI] [PubMed] [Google Scholar]
  50. Liao C.; Seebeck F. P. Asymmetric β-Methylation of l- and d-α-amino acids by a self-contained enzyme cascade. Angew. Chem., Int. Ed. 2020, 59, 7184–7187. 10.1002/anie.201916025. [DOI] [PubMed] [Google Scholar]
  51. Li F.; Yang L.-C.; Zhang J.; Chen J. S.; Renata H. Stereoselective synthesis of β-branched aromatic α-amino acids by biocatalytic dynamic kinetic resolution. Angew. Chem., Int. Ed. 2021, 60, 17680–17685. 10.1002/anie.202105656. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. For an early example, see:; Gutierrez M. L.; Garrabou X.; Agosta E.; Servi S.; Parella T.; Joglar J.; Clapés P. Serine hydroxymethyl transferase from Streptococcus thermophilus and l-threonine aldolase from Escherichia coli as stereocomplementary biocatalysts for the synthesis of β-hydroxy-α,ω-diamino acid derivatives. Chem.—Eur. J. 2008, 14, 4647–4656. 10.1002/chem.200800031. [DOI] [PubMed] [Google Scholar]
  53. For a recent high dr/high conversion example, see:; Xu L.; Wang L.-C.; Su B.-M.; Xu X.-Q.; Lin J. Multi-enzyme cascade for improving β-hydroxy-α-amino acids production by engineering L-threonine transaldolase and combining acetaldehyde elimination system. Bioresour. Technol. 2020, 310, 123439. 10.1016/j.biortech.2020.123439. [DOI] [PubMed] [Google Scholar]

Articles from ACS Medicinal Chemistry Letters are provided here courtesy of American Chemical Society

RESOURCES