1. Introduction
Microbes have a remarkable capacity to build amino acid frameworks that are not incorporated into proteins. It has been estimated that about 500 naturally occurring amino acids have been identified to date, leaving the 20 proteinogenic amino acids as the 4% minority. Many of these nonproteinogenic amino acids have been discovered and biological context established since the last time this subject was reviewed in this journal in 1983.[1] While some of the nonproteinogenic amino acids are utilized as intermediates in primary metabolic pathways (e.g. homoserine, ornithine), most of the unheralded 96% majority serve as building blocks for small bioactive peptide scaffolds. They may represent an underutilized inventory of building blocks for protein engineers, medicinal chemists, and materials scientists.
On the one hand the existence of many hundreds of natural peptides with one or more nonproteinogenic amino acid residues reflects the ability of side chains of these particular monomers to impart some useful functional property not available in the 20 proteinogenic building blocks. On the other hand these natural peptide frameworks highlight the existence of biologic machineries for selection, activation, and incorporation of nonproteinogenic amino acid monomers into nonribosomal peptides (NRPs) as well as chain elongation of the resulting intermediates.[2,3] Perhaps many of those machineries could be moved, evolved, and re-engineered to work as building blocks in ribosomal protein biosynthesis. The development of pairs of evolved tRNAs and aminoacyl-tRNA synthetases to recognize synthetic unnatural amino acids, their introduction into growing cells, and the ability of the resulting cells to efficiently synthesize proteins harboring the corresponding unnatural amino acids augur well for the utility of these endogenously generated nonproteinogenic amino acids to the protein engineer.[4,5] To do so requires knowledge of the metabolic pathways to these noncanonical building blocks and the kinds of enzymatic transformations involved.
This review summarizes the biosynthetic routes to and metabolic logic for the major classes of the noncanonical amino acid building blocks that end up in both nonribosomal peptide frameworks and in hybrid nonribosomal peptide-polyketide scaffolds noted below.
In addition to microbial peptides of 2–22 residues built nonribosomally and therefore independent of mRNA messages, there are hundreds of naturally occurring scaffolds that are hybrids of peptide and polyketide (PK) frameworks,[6] many containing nonproteinogenic amino acid monomers. The rapamycin and FK506 family members,[7,8] sanglifehrin,[9] and the epothilones are examples of therapeutically interesting categories of such NRP-PK hybrid molecular frameworks that harbor nonproteinogenic amino acid units.[10,11]
2. Amino Acid Selection, Activation, and Incorporation for Ribosomal versus Nonribosomal Peptide Biosynthesis
In both NRP and hybrid NRP-PK scaffolds, the amino acid monomers, proteinogenic and nonproteinogenic, are selected and activated by 50 kDa proteins, sometimes freestanding, but most often joined in multidomain proteins. These activation domains are termed adenylation (A) domains for the chemistry they perform on their amino acid substrates.[2] Each A domain selects an amino acid. Once bound to the active site, the enzyme utilizes the amino acid’s carboxylate group to attack the α-phosphate of a bound ATP cosubstrate, yielding a bound aminoacyl-adenylate (aminoacyl-AMP) (Scheme 1). This is exactly the mechanism for the first step of aminoacyl-tRNA synthetases that activate amino acids used in protein biosynthesis: they are also amino acid adenylating enzymes.[12] However, structural analysis suggests, notwithstanding a few exceptions,[13] in most instances A domains and aminoacyl-tRNA synthetases evolved independently. Therefore, they are not readily interchangeable, for example in protein biosynthesis replacement studies.
Scheme 1.
Biosynthesis of ribosomal versus nonribosomal peptides. ATP = adenosine triphosphate, AMP = adenosine monophosphate, PPi = pyrophosphate, A domain = adenylation domain, T domain = thiolation (peptidyl carrier protein) domain, C domain = condensation domain.
In the second step the two enzyme classes diverge with respect to the nature of the cosubstrate to which the activated aminoacyl group is now transferred (Scheme 1). For the aminoacyl-tRNA synthetases the cosubstrate is a cognate tRNA and the attacking nucleophilic group is the 3’ or 2’-OH of the ribose ring of the terminal adenine in the CCA tail of each tRNA.[14] The resultant aminoacyl-tRNA is released and is subsequently chaperoned to the large subunit of the ribosome for mRNA-instructed protein biosynthesis.[14] In contrast, for the NRPS adenylation domains, the cosubstrate in the second step is the HS-pantetheinyl arm of the holo form of an 8–10 kDa thiolation (T) domain.[15] The T domains are often in cis with the A domains, as a contiguous downstream region of 80–100 residues following the ~500 residue A domains (A–T didomains by themselves or as parts of larger proteins). Transfer to the nucleophilic thiolate of the pantetheinyl arm yields T domain-tethered aminoacyl thioesters.[16]
Chain elongation on the ribosome and on NRPS assembly lines also has parallels and distinctions (Scheme 1). On the ribosome, an incoming aminoacyl-tRNA sits next to the growing peptidyl-tRNA, with occupancy of the incoming monomer determined by triplet coding between the mRNA codon and the tRNA anticodon. Once the proper aminoacyl-tRNA has been recognized and properly situated by action of conditional GTPases, peptide bond formation can occur in the peptidyl transferase center of the 50S ribosomal subunit, mediated by the 23S rRNA. The nucleophile is the N-terminal amino group of the peptidyl-tRNA. The electrophile is the activated carboxyl oxygen ester of the aminoacyl-tRNA. Peptide bond formation generates an elongated peptidyl-tRNA and the deacylated tRNA that had delivered the incoming aminoacyl group as coproduct. In contrast, in the mRNA-independent NRPS assembly lines, peptide bond formation is driven by a ~50 kDa condensation (C) domain. Again these domains are most often encoded in cis such that a typical NRPS elongation module has the composition C-A-T and would be about 110 kDa, consisting of three (semi-) autonomously folded domains.[2,17] Specificity of monomer incorporation into the growing peptide chain is not determined by mRNA triplet code but rather by which NRPS modules interact with each other sequentially.
A prototypical NRPS responsible for tyrocidine (1) biosynthesis is shown in Scheme 2.[18] As is evident from this example, in many cases 3–5 modules are encoded in a single protein and chain elongation proceeds from N-C direction.
Scheme 2.
Tyrocidine and its NRPS. A = adenylation domain, T = thiolation (peptidyl carrier protein) domain, C = condensation domain, E = epimerase domain, TE = thioesterase domain.
For hybrid NRP-PK natural products, particular NRPS modules have evolved to interface with polyketide synthase assembly lines.[17,19] One can find both possible orientations of NRPs and PKS modules in naturally occurring assembly lines. When an NRPS module is upstream of a PKS module, as is the case in epothilone C (2, Scheme 3A) biosynthesis, the aminoacyl-S-T domain must be recognized by and transferred to the first domain of the downstream PKS module.[20] Typically these are ketosynthase (KS) domains, which make C-C bonds by decarboxylative Claisen condensations. In the alternative configuration where an NRPS module is downstream of a PKS module, as in rapamycin (3, Scheme 3B) and FK506 assembly,[8] the C domain of that NRPS module must be able to recognize the immediately upstream polyketidyl-S-T domain.[21] Now the attacking nucleophile is the amino group of the aminoacyl-S-T domain in the NRPS module. Bond formation creates an amide linkage as the polyketidyl chain is transferred onto the NRPS module.[20,22]
Scheme 3.
Epothilone C and rapamycin biosynthesis. A. NRPS-PKS pathway. B. PKS-NRPS pathway. Nonproteinogenic amino acids: red = 2-methylthiazole-4-carboxylate, blue = L-pipecolate.
A. epoA: http://www.ncbi.nlm.nih.gov/protein/AAF62880.1
epoB: http://www.ncbi.nlm.nih.gov/protein/AAF62881.1
B. Hyperlink 1: L-pipecolate formation in rapamycin biosynthesis.
There are examples of hybrid assembly lines where NRPS domains are flanked both upstream and downstream by PKS domains (e.g., epothilone)[20] and the reverse (e.g., bleomycin,[23] yersiniabactin).[24] One of the virtues of hybrid assembly lines is that they can build complex scaffold architectures and mix a variety of oxygen and nitrogen functionalities into the frameworks of the natural products. Advances in structural determination of domains and whole modules in PKS and NRPS assembly lines has begun to give insights into engineering new PKS-NRPS connectivities and altered specificities of building block utilization.[25,26]
2.1 Para-Amino-Phe Biosynthesis in Streptomyces pristinaespiralis
The noncanonical para-dimethylamino-phenylalanine occurs in pristinamycin IA, an antibiotic produced by S. pristinaespiralis.[27] This building block arises from p-NH2-Phe (by S-adenosyl methionine (SAM)-mediated dimethylation), which is also a building block for chloramphenicol assembly in Streptomyces venezuelae.[28] Phe is not the direct precursor of p-NH2-Phe; instead it arises by diversion of metabolic flux from chorismate away from prephenate (Scheme 4). In keeping with NRPS assembly logic four pap genes are found in tandem next to the NRPS and PKS genes that build pristinamycin IA (5, Scheme 4) scaffolds in the S. pristinaespiralis genome.[29,30]
Scheme 4.
p-Aminophenylalanine biosynthesis in the pristinamycin IA pathway. Nonproteinogenic amino acids: red = L-(p-amino)-Phe (9) and L-((p-(N,N-dimethylamino))-Phe (10).
papA: http://www.ncbi.nlm.nih.gov/protein/AAC44866.1
papB: http://www.ncbi.nlm.nih.gov/protein/AAC44868.1
PapA is a glutamine-dependent chorismate amidotransferase, converting chorismate (6, Scheme 4) into 4-amino-4-deoxychorismate (7, Scheme 4) that is the key intermediate formed in p-aminobenzoic acid (PABA) biosynthesis.[31] However, in this case PapB intervenes and the enolpyruvyl side chain is not eliminated. Instead PapB is an aminodeoxychorismate mutase, generating 4-amino-4-deoxyprephenate (8, Scheme 4) presumably by a 3,3-electrocyclic rearrangement. This undergoes aromatizing decarboxylation/dehydrogenation by PapC to produce 4-aminophenylpyruvate, which in turn undergoes transamination to yield 4-amino-L-Phe (9). PapM can methylate the amino group once or twice to yield the mono- or dimethylamino-L-Phe (10) monomer, either of which can be incorporated into pristinamycins.
2.2 Incorporation of Nonproteinogenic Amino Acids into Proteins
Schultz and colleagues moved the papABC gene cluster from S. venezuelae into an Escherichia coli strain into which they had inserted an evolved tRNA/tRNA synthetase pair that would recognize p-NH2-Phe and incorporate it via amber-suppressor codon technology site-specifically in myoglobin.[32] In principle, the strategy for p-NH2-Phe incorporation could be generalized to any of the other nonproteinogenic amino acid building blocks described in this review, as long as the genes are known and the proteins are expressed in active form in a bacterial host (e.g. E. coli) harboring the engineered tRNA/tRNA synthetase pair to recognize the corresponding building block.[33] In practice, as discussed in the following sections, many nonproteinogenic amino acids required for NRP or mixed PK/NRP biosynthesis are generated as aminoacyl thioesters tethered to a protein-bound pantetheine arm. Once synthesized, these building blocks are directly utilized by the NRPS. Hijacking these unnatural amino acids for ribosomal protein synthesis will therefore require an engineered thioesterase. Fortunately, nature harbors a number of these aminoacyl thioesterases: for example, the nonproteinogenic amino acid coronamate is produced as a protein-bound thioester, which is hydrolyzed prior to incorporation into the natural product coronatine (Scheme 5).[34]
Scheme 5.
Coronamate biosynthesis and incorporation in the coronatine pathway. Nonproteinogenic amino acids: red = (2R)-2-amino-3-(chloromethyl)pentanoic acid, blue = coronamate (11).
3. Biosynthesis of Nonproteinogenic Amino Acids
3.1 ORFs within NRPS Clusters
In this review we focus on the unusual amino acid building blocks, the nonproteinogenic ones, that are fashioned by producer microbes for specific incorporation into the above classes of molecular scaffolds. Some of these nonproteinogenic amino acids are fashioned from the canonical 20 proteinogenic amino acids while others are biosynthesized de novo. For any nonproteinogenic amino acid with a known biosynthetic pathway, a hyperlink is provided within each scheme that shows the biosynthetic pathway with links to the relevant GenBank entry. In cases where the biosynthetic pathway is unknown, a hyperlink is included that shows a hypothetical biosynthetic pathway for pedagogical purposes.
In most cases the genes encoding these dedicated amino acid building blocks are adjacent to genes for the NRPS assembly line modules, enabling coordinate regulation to make the building blocks as they are needed. This feature greatly simplifies the task of finding open reading frames (ORFs) that encode enzymes dedicated to nonproteinogenic amino acids and then moving them to production organisms. For example, as discussed above, the genes encoding p-dimethylamino-phenylalanine are clustered in the pristinamycin IA producer (Scheme 4).[29]
3.2 Major Classes of Reactions that Build Nonproteinogenic Amino Acids
Most of the almost 500 of the nonproteinogenic amino acids are produced by a small number of enzymatic reactions, either from preexisting proteinogenic amino acids or by de novo construction. Examples discussed below include: 1) racemases that make D-amino acids from L-enantiomers;[35,36] 2) mutases that convert α-amino acids into β-amino acids;[37,38] 3) reactions that produce β-methyl substituted amino acids and hydroxylases that install -OH groups, often as β-OH substituents;[39,40] 4) oxygen-dependent halogenases that install chlorines on both activated and unactivated carbon centers;[41] 5) de novo construction of phenylglycines with both 4-OH- and 3,5-(OH)2 substitution patterns are shown to arise by entirely different molecular logic;[42,43] 6) biosynthesis of α,ω-diamino acids, such as 2,3-diaminopropionate (DAP) and 2,4-diaminobutyrate (DAB);[44,45,46] 7) several distinct types of cyclic nonproteinogenic amino acids are discussed, from three membered cyclopropanes (e.g. coronomate, Scheme 5), four membered tabtoxin β-lactam,[47] to five membered enduracididine,[48] to six membered capreomycidine,[49] pipecolate, and piperazate rings; and 8) the transformation of tryptophan to kynurenine,[50] and of tyrosine to propyl- and propenyl prolines is analyzed.[51]
Some of the carbon skeletons of noncanonical amino acids, such as 2-amino-3-hydroxy-4-methyl-6-octenoate, 2-amino-8-oxo-9,10-decanoate and 3-amino-9-methoxy-2,6,8-trimethyl-10-phenyl-4,6-decadienoate, are fashioned by PKSs as free monomers, which undergo late stage enzymatic reductive amination.[52] Other amino acids, such as statine and isostatine, are synthesized on hybrid NRPS-PKS assembly lines and directly incorporated into the growing chains.[53] Comparable hybrid logic and enzymatic machinery pertains for building vinyl-Arg and vinyl-Tyr units.[54]
To illustrate particular molecular contexts for major nonproteinogenic amino acid classes, in the next section we introduce 15 NRP scaffolds and an additional 4 hybrid NRP-PK scaffolds. Together these 19 frameworks contain about 50 of the most common nonproteinogenic building blocks, representative of almost all the major classes of amino acid scaffolds. We then review the details of how those and related nonproteinogenic building blocks are generated within the producer microbes.
4. Nonproteinogenic Amino Acid Residues in NRP frameworks
4.1 Linear Peptide Frameworks
Nonproteinogenic building blocks are found in linear NRP scaffolds, such as in the sponge-associated miraziridine A (13, Figure 1), probably produced by bacteria in the consortium.[55] This linear pentapeptide has four nonproteinogenic residues out of five total. In addition to aminobutyrate (Aba),[56] the next longer homolog of Ala, there are three remarkable building blocks: an aziridine dicarboxylate (Azd),[57] statine,[53] and vinyl-Arg.[54] The three-membered aziridine ring is unusual in biology and likely a reactive chemical functionality.[58,59] Statine and vinyl-Arg are synthesized by hybrid NRPS-PKS assembly lines, as explained in section 6.1.
Figure 1.
Miraziridine A. Nonproteinogenic amino acids: red = (3S, 4S)-statine, blue = (2R, 3R)-aziridine-2,3-dicarboxylate, green = L-α-aminobutyrate, purple = (S)-vinyl-Arg.
Hyperlink 2: Hypothetical biosynthesis of miraziridine A.
Hyperlink 3: Biosynthesis of α-aminobutyric acid (Aba) in E. coli K-12.
avtA: http://www.ncbi.nlm.nih.gov/protein/YP_491862.1
ilvA: http://www.ncbi.nlm.nih.gov/protein/CAA28577.1
Hyperlink 4: Biosynthesis of statine moiety of Thailandepsin E.
Hyperlink 5: Biosynthesis of vinylogous alanine moiety of Jamaicamide B
4.2 Cyclic Peptide Frameworks
There is sustained interest in cyclic frameworks whose conformational constraints generate fixed three dimensional molecular architectures that can produce high affinity ligands for biological targets.[60] Here we discuss fifteen cyclic peptide frameworks with 3–15 residues in their macrocyclic frameworks. Ring closing of such cyclic peptides can be achieved by macrolactam (cyclic amides), macrolactone (cyclic esters), cyclic amidine (as in the case of bottromycin A2 (14), Figure 2),[61] or aryl ether bridge formation. As seen in Scheme 2, the decapeptidic macrolactam of tyrocidine (1) is formed by head-to-tail condensation of Phe1 and Leu10,[62] whereas the polymyxin nonapeptides (such as polymyxin B2 (15), Figure 3) form stem-loop macrolactam structures in which a side chain amine from 2,4-DAB is the nucleophile.[46,63] Macrolactone linkages (also known as depsipeptides) can arise from the side chain hydroxyl of Thr, as in callipeltin B (16, Figure 4)[64,65] and daptomycin (17, Figure 5),[50,66] or Ser, as in enterobactin.[67]
Figure 2.
Bottromycin A2. Nonproteinogenic amino acids: red = (2S,3R)-3-methyl-Pro, blue = L-t-butyl-Gly, green = (2S,3S)-β-methyl-Phe, purple = (R)-methyl 3-amino-3-(thiazol-2-yl)propanoate.
Figure 3.
Polymyxin B2, a stem-loop macrolactam. Nonproteinogenic amino acid: red = L-2,4-diaminobutyrate (2,4-DAB).
Hyperlink 6: Biosynthesis of 2,4-diaminobutyrate (2,4-DAB) from aspartyl phosphate in Pseudomonas aeruginosa PAO1
Figure 4.
Callipeltin B, a Thr linked depsipeptide. Nonproteinogenic amino acids: red = (2R,3R)-β-methoxy-Tyr, blue = (2S,3S,4R)-3,4-dimethyl-5-oxo-Pro.
Hyperlink 7: Anti-HIV marine cyclicdepsipeptides containing a β-methoxytyrosine moiety.
Figure 5.
Daptomycin. Nonproteinogenic amino acids: red = L-kynurenine, blue = (3S,4S)-b-methylglutamate, green = L-ornithine.
Hyperlink 8: Biosynthesis of kyneurinine.
Alternatively an α-hydroxy acid building block incorporated in place of one of the amino acids can form a macrolactone, as in the kutznerides (e.g. kutzneride 1 (18), Figure 6).[68] A variant occurs in the N-acylated lipopeptide surfactin (19, Figure 7), where a β-OH fatty acyl chain participates in macrolactone formation.[69]
Figure 6.
Kutzneride 1, a cyclic hexa(depsi)peptide. Nonproteinogenic amino acids: red = dichloropyrroloindoline carboxylate, blue = L-O-methyl-Ser, green = (2R,3S)-β-hydroxy-Glu, purple = methylcyclopropyl-Gly (MeCPGly), orange = piperazate.
Hyperlink 9: MecPGly formation in kutzneride biosynthesis.
KtzA: http://www.ncbi.nlm.nih.gov/protein/157429059
KtzB: http://www.ncbi.nlm.nih.gov/protein/157429060
KtzC: http://www.ncbi.nlm.nih.gov/protein/157429061
KtzD: http://www.ncbi.nlm.nih.gov/protein/157429062
KtzF: http://www.ncbi.nlm.nih.gov/protein/157429064
Hyperlink 10: Piperazate formation in kutzneride biosynthesis.
KtzI: http://www.ncbi.nlm.nih.gov/protein/157429067
KtzQ: http://www.ncbi.nlm.nih.gov/protein/157429075
KtzR: http://www.ncbi.nlm.nih.gov/protein/157429076
Hyperlink 11: Hypothetical SAM mediated O-methylation biosynthesis.
Figure 7.
Surfactin, a β-hydroxycarboxylic lactone
Among the smallest natural peptide macrocycles is the twelve-atom macrolactam found in the proteasome inhibitor syringolin A (20, Figure 8).[70,71] It has a stem-loop structure with a macrocyclic lactam derived from only two residues, both noncanonical. One is a Δ3-lysene that may arise from dehydration of a 3-OH-Lys residue.[70] The other is a vinyl-Val formed by an NRPS-PKS hybrid module (see section 6.1 below). The conjugated enamide derived from vinyl-Val is the electrophilic moiety of syringolin that irreversibly inactivates the proteasome by capture of the active site Thr1-NH2.[72]
Figure 8.
Syringolin A. Nonproteinogenic amino acids: red = L-vinyl-Val, blue = L-Δ-3-Lysene.
Hyperlink 12: Biosynthesis of vinylvaline and 3,4-dehydrolysine on the syringolin (syl) synthase.
Eurypamide A (21, Figure 9) is a three-residue peptide macrocycle, isolated from a presumed bacterial community in Palauan sponges,[73] containing a Tyr-O-Tyr crosslink similar to the analogous Tyr-C-Tyr crosslink in arylomycin antibiotics (e.g. arylomycin B-C16 (22), Figure 9).[74,75] The side chain aryl ether in 21 is thought to arise from one electron phenoxy radical chemistry via iron-based oxygenase action on the two tyrosine side chains of a tripeptide precursor.[76] These crosslinks set the constrained architecture in 21 and 22 (Figure 9).[75,77] Eurypamide A (21) also harbors the unusual 3R,4S-dihydroxy-L-Arg,[77] although the timing of bis-hydroxylation is unknown. It could occur on the free amino acid, the presumed aminoacyl-S-T NRPS assembly line intermediate, or the released peptide.
Figure 9.
Aryl ethers and aryl-aryl linkages in cyclic peptides. Nonproteinogenic amino acids: red = (2S,3R,4S)-3,4-dihydroxy-Arg, blue = Aryl ether (L-Tyr/(R)-β-Tyr), green = Aryl ether (L-(p-hydroxy)phenyl-Gyl/L-m-nitro-Tyr).
Hyperlink 13: Hypothetical pathway for eurypamide biosynthesis.
Macrocyclic tetrapeptides are also found in the histone deactylase (HDAC) inhibitors of the trapoxin (21) and HC toxin class (22, Figure 10).[78,79] This compact architecture presents the epoxyketone side chain of the unusual 2-amino-8-oxo-9,10-decanoate (Aeo) epoxy residue to the active site of HDACs for irreversible capture of the active site nucleophile.[80] The strategy for construction of the Aeo monomer is discussed in section 6.2 below.
Figure 10.
Natural products containing L-Aeo. Nonproteinogenic amino acids: red = L-2-amino-8-oxo-9,10-decanoate (Aeo).
Hyperlink 14: Hypothetical pathway for Aeo biosynthesis.
Cyclic pentapeptide scaffolds are found in the antitubercular drugs, capreomycin (23, Figure 11A) and viomycin (24, Figure 11b).[49] They are notable for the presence of a 2,3-DAP as the macrolactam-forming residue,[44] a β-lys residue in the stem of the stem-loop structure, the cyclic arginine derivative capreomycidine, and a ureidodehydroAla residue. These unusual amino acids play important roles in the ability of both antibiotics to bind across the 30S–50S subunit interface on bacterial ribosomes.[81] The biosynthesis of the β-Lys and capreomycidine building blocks are discussed in sections 5.1.2 and 5.6.3, respectively.
Figure 11.
Natural products containing a capreomycidine moiety. Nonproteinogenic amino acids: red = β-Lys, blue = L-2,3-DAP, green = (E)-ureidodehydro-Ala, purple = capreomycidine and δ-hydroxycapreomycidine.
Hyperlink 15: Formation of L-2,3-diaminopropionate (2,3-DAP) from L-serine in zwittermycin biosynthesis
Cyclic hexa(depsi)peptides include the kutzneride family of hexapeptidolactones (e.g. 18, Figure 6)[82] with antifungal activity and the echinocandins (e.g. echinocandin B (25), Figure 12),[83,84] which target the fungal 1,3-glucan synthases. Kutznerides have five nonproteinogenic amino acid building blocks and an unusual hydroxy-acid (t-butylglycolate) that participates in forming the macrolactone framework.[68,68] The echinocandins have a hexapeptide scaffold with a metastable hemiaminal linkage between N5 of an N1-acylated ornithine and the carbonyl of a 3-methyl-4-hydroxyproline residue. Both the ornithine side chain and an unusual homotyrosine side chain are dihydroxylated; the timing of the hydroxylations is unknown.
Figure 12.
Echinocandin B. Nonproteinogenic amino acids: red = (2S,3S,4S)-3,4-dihydroxyhomo-Tyr, blue = (2S,4R)-4-hydroxy-Pro, green = (2R,4R,5R)-4,5-dihydroxy-Orn, purple = (2S,3S,4S)-3-hydroxy-4-methyl-Pro.
Hyperlink 16: Two possible routes for the biosynthesis of L-3-methyl-4-hydroxy-Pro from L-leucine
Hormaomycin (26, Figure 13) is another hexapeptidolactone that is also replete with noncanonical amino acid units, including two β-methyl-Phe residues, a propenyl-proline, and a remarkable nitrocyclopropyl-alanine residue.[85] The OH group of a Thr residue participates in the formation of the hexapeptidolactone macrocycle; its amino group is acylated with a dipeptide moiety containing another nitrocyclopropyl-Ala and a proline-derived N-OH-2-chloropyrrole carboxyl residue. A schematic for chain elongation of this remarkable natural product is shown in hyperlinked form in Figure 13.
Figure 13.
Hormaomycin. Nonproteinogenic amino acids: red = Nitrocyclopropyl-Ala, blue = 5-chloro-1-hydroxypyrrole-2-carboxylate, green = (2S,3R)-β-methyl-Phe, purple = (2R,4R)-4-((Z)-propenyl)-Pro.
Hyperlink 17: Hormaomycin biosynthesis.
Hyperlink 18: Propenylproline formation in hormaomycin biosynthesis.
HrmD: http://www.ncbi.nlm.nih.gov/protein/335346673
HrmE: http://www.ncbi.nlm.nih.gov/protein/AEH41783.1
HrmF: http://www.ncbi.nlm.nih.gov/protein/335346656
Hyperlink 19: Proposed 5-chloropyrrole-2-carboxylate formation in hormaomycin biosynthesis.
HrmK: http://www.ncbi.nlm.nih.gov/protein/AEH41789.1
HrmL: http://www.ncbi.nlm.nih.gov/protein/335346662
HrmM: http://www.ncbi.nlm.nih.gov/protein/335346663
HrmN: http://www.ncbi.nlm.nih.gov/protein/335346664
HrmQ: http://www.ncbi.nlm.nih.gov/protein/190610819
Hyperlink 20: Two possible pathways for nitrocyclopropylalanine formation in hormaomycin biosynthesis.
Figure 13. Heptapeptidyl frameworks are found in several natural products, including the N-acylated mycosubtilin (27, Figure 14),[86] and surfactin lipopeptides from Bacillus spp (see Figure 7).[69] The β-hydroxy substituent of the fatty acyl chain engages in macrolactonization in surfactin, while the fatty acyl chain in mycosubtilin instead bears a β-amino group that makes a macrolactam. Callipeltin B (16, Figure 4) is a sponge-derived heptadepsipeptide with a D-allo-Thr residue and also a β-methoxy-Tyr residue.[64] The antibacterial polymyxins (e.g. 15, Figure 3) have multiple 2,4-DAB residues, one of which participates in the heptapeptidyl macrolactam loop.[63] These side chains are cationic under physiologic conditions and may serve to sequester this antibiotic by interaction with anionic lipopolysaccharides on Gram-negative bacterial pathogens.[87] Cyanobacteria make a variety of toxic cyclic peptides via a nonribosomal strategy;[88] among them are the microcystins (e.g. microcystin LR (28), Figure 15), cyclic heptapeptides that target protein phosphatase I for inhibition.[89] More than 70 structural variants are known, reflecting promiscuity in the heptamodule NRPS assembly lines. Among its unusual residues are dehydroalanine, β-Me-D-isoaspartate, the polyketide-derived Adda, and isoglutamate. The didemnins (e.g. didemnin B (29), Figure 16) are cyclic heptadepsipeptides isolated from bacterial symbionts of sea squirts,[90] and have been of interest for their immunosuppressive properties.[91] Among the building blocks of special interest is the isostatine residue, with a 3-OH, 4-NH2 grouping, assembled by a hybrid NRPS-PKS module discussed in section 6.1.
Figure 14.
Mycosubtilin. Nonproteinogenic amino acid: red = (S)-β-aminostearate.
Figure 15.
Microcystin LR. Nonproteinogenic amino acids: red = vinylogous β-Ala, blue = (2R,3S)-β-methylaspartate, green= dehydro-Ala.
Figure 16.
Didemnin B. Nonproteinogenic amino acids: red = L-O-Me-Tyr, blue = (3S,4R,5S)-isostatine.
A distinct mode of converting a linear heptapeptidyl framework into a constrained architecture occurs in the maturation of the T7-domain tethered linear heptapeptidyl intermediate in assembly of the antibiotic vancomycin (30, Figure 17).[92,93] Three heme iron-containing monooxygenases act to generate phenoxy radical intermediates to couple residues 2–4, 4–6, and 5–7.[94,95] Each of these five participating residues is nonproteinogenic, and has an accessible redox potential for a radical coupling reaction. Residues 2 and 6 are β-OH-Tyr. Residues 4 and 5 are 4- OH-phenylglycines, and residue 7 is a 3,5-dihydroxyphenylglycine. In teicoplanin (31, Figure 17) a fourth crosslink occurs between Tyr1 and 3,5-(OH)2-Phegly3, such that all seven residues participate in a total of four crosslinks. The formation of these unusual aryl amino acids is discussed below.
Figure 17.
Vancomycin and Teicoplanin. Nonproteinogenic amino acids: red = (2R,3R)-m-chloro-(or β-hydroxy-m-chloro)Tyr aryl ether, blue = 3,5-dihydroxy-Tyr aryl ether, green = p-hydroxyphenyl-Gly aryl-aryl, purple =3,5-dihydroxyphenyl-Gly arly-aryl or aryl ether, orange = (2S,3R)-β-hydroxy-m-chloro-Tyr aryl ether.
Hyperlink 21: Aryl-aryl and aryl ether crosslinking in vancomycin.
Hyperlink 22: Aryl amino acid hydroxylation.
Hyperlink 23: Chlorination of amino acids
Daptomycin is a clinically used N-acylated 13-residue NRP antibiotic (17, Figure 5).[96] It has a 10-residue macrolactone ring, fashioned from the OH group of Thr4 and the carbonyl of residue 13 that is the nonproteinogenic kynurenine, which arises from oxidative metabolism of tryptophan.[97] It also harbors a nonproteinogenic 3-methyl-Glu residue 11 as one of several side chain carboxylates that coordinate Ca2+ ions and change conformation to a membrane active form.
Friulimicin is another nonribosomally produced lipopeptide (32, Figure 18),[45] with a decapeptidyl macrolactam between a 2,3-diaminobutyryl residue at position 2 and Pro11. Fruilimicin binds the lipid II carrier required for peptidoglycan formation in bacterial cell wall assembly.[98] Friulimicin also has additional nonproteinogenic amino acid building blocks, including pipecolate3, β-Me-Asp4 and another 2,3-DAB.
Figure 18.
Friulimicin. Nonproteinogenic amino acids: red = D-pipecolate, blue = β-methyl-Asp, green = 2,3-DAB.
An 11-residue macrolactam is formed in the last step of biosynthesis of the immunosuppressant undecapeptide cyclosporin A (33, Figure 19), when D-Ala1 is coupled head-to-tail to the thioesterified carbonyl of L-Ala11 by the megadalton NRPS cyclosporin synthetase.[99] Seven of the 11 residues of cyclosporin are N-methylated. There is also a nonproteinogenic 2- aminobutyryl residue and a methylbutenylthreonine residue that is generated by a PKS assembly line with subsequent transamination.[100]
Figure 19.
Cyclosporin A. Nonproteinogenic amino acids: red = (2S,3R,4R)-(E)-butenylmethyl-Thr, blue = L-α-aminobutyrate.
Among the largest nonribosomally generated peptide macrocycles are the ramoplanin A2(34),[101] and related enduracidine (35, Figure 20) antibiotics.[48] In both of these lipid II binders residues 2 and 16 are macrocyclized to yield a 15 residue cyclic framework.[102] Among the noncanonical building blocks are 4-hydroxyphenylglycines and in enduracidine, two cyclic arginine-derived enduracididine residues.
Figure 20.
Ramoplanin and Enduracidine. Nonproteinogenic amino acids: red = enduracididine, blue = D or L p-hydroxyphenyl-Gly, green = L-3,5-dichloro-4-hydroxyphenyl-Gly or 3-chloro-4-hydroxyphenyl-Gly (in 34), purple = D-Orn, orange = (2S,3S)-β-hydroxy-Asp.
4.3. Macrocyclic hybrid nonribosomal peptide-polyketide frameworks
There are also variable macrocyclic ring sizes in hybrid NRP-PK scaffolds. The PKS machinery can generate fully reduced CH2 groups, CHOH, HC=CH, or C=O groups from partial processing of tethered intermediates on the PKS assembly lines.[17,103] These three functional groups can mix and match with the amino acid building blocks, as exemplified by the macrocyclic portion of syringolin A (20, Figure 8), epothilone B (2, Scheme 3) and rapamycin (3, Scheme 3). The naturally occurring immunosuppressant sanglifehrin A (36, Figure 21) has 22 atoms in the macrocycle, spanning 3 amino acids (Val-(m-Tyr)-piperazate) and the rest is polyketide.[104] The phytotoxin coronatine (12, Scheme 5) is produced by hybrid PKS-NRPS machinery in the plant pathogenic Pseudomonas syringae through ligation of a cyclopropyl amino acid to the polyketide fragment coronafacic acid.[34,105,106]
Figure 21.
Sanglifehrin A. Nonproteinogenic amino acids: red = L-piperazate, blue = L-meta-Tyr.
5. Generation of Nonproteinogenic Amino Acid Building Blocks
5.1 D-Amino Acids and β-Amino Acids
As noted above, microcystin LR (28, Figure 15) has a D-isomer of β-methyl isoaspartate, and vancomcyin and teicoplanin (30 and 31, Figure 17) have D-hydroxyphenylglycine units, which engage in the crosslinking of the rigid scaffold. Cyclosporin has one D-Ala residue (33, figure 24). More generally, many dozens of D-amino acid residues are found in nonribosomal peptide scaffolds. Also there are β-amino acids, as in the side chain β-Lys residue in the antitubercular agents viomycin and capreomycin (25 and 26, Figure 11), as well as β-Phe in andrimid and β-Tyr in ene-diyne antitumor natural products.[107,108]
Figure 24.
Aldehyde, imine, ketone, and diazo-containing amino acids. Nonproteinogenic amino acids: red = formyl-Gly, blue = 2-amino-3-ketobutyrate, green= α-diazoketone or α-diazoester, purple = acivicin.
5.1.1 Racemases and Epimerases
The most common route to D-amino acids is by action of amino acid racemases and epimerases to convert L-isomers to D-isomers. For example, D-Ala in bacterial peptidoglycan layers and cyclosporine are generated by prototypic pyridoxal-phosphate (PLP)-dependent racemases that generate the stabilized C2 carbanion from Ala=PLP aldimine followed by stereorandom reprotonation (Scheme 6A).[35,36] On the other hand, the glutamate, proline and diaminopimelate racemases are PLP-independent and use two active site bases – one protonated as conjugate acid, the other deprotonated as initiating active site base – to catalyze Cα stereochemical equilibration (Scheme 6B).[109] Broad specificity PLP racemases acting on a range of proteinogenic amino acids have been isolated from Pseudomonas putida,[110] as has a threonine epimerase which is representative of diastereomer interconversion (L-Thr to D-allo Thr and D-Thr to L-allo-Thr.[111]
Scheme 6.
Racemase and epimerase activity. A. PLP-dependent catalysis for racemization. B. acid-base catalysis for racemization. C. template-bound catalysis for racemization.
In many NRPS assembly lines, including those for vancomycin and teicoplanin, there are ~50 kDa epimerization domains which act not on free amino acids, but only after an L-amino acid has been activated and tethered as the L-aminoacyl thioester.[112] The C2-H is then kinetically acidic, since the carbanion is stabilized as the thioester enolate (Scheme 6C). Stereorandom return of the hydrogen yields a mix of L- and D-aminoacyl thioester for subsequent chain elongation by downstream D-specific condensation domains.[113,114,115]
5.1.2 Aminomutases Interconvert α- and β-Amino Acids
The most general route to β-amino acids is by action of a family of amino acid mutases on the corresponding α-amino acids. Two types of microbial enzymes have been observed to have such mutase activity.[116,117] One involves conversion of Phe or Tyr to the corresponding β-amino acids. In the case of tyrosine, both 3R- and 3S-β-tyrosine are produced by mutases from various sources.[37,38] This subfamily of enzymes contain a covalently attached cofactor 4-methylidene-5-imidazole-5-one (MIO), which arises during automodifying maturation of an Ala-Ser-Gly tripeptide loop in the inactive proenzyme (Scheme 7A). The MIO prosthetic group acts as an electrophilic center for attack by the amino group of bound amino acid substrate. The resulting adduct can undergo Cβ-H bond cleavage to yield a transient noncovalent olefinic species (e.g. cinnamate when starting from Phe) with the itinerant amino group tethered to the MIO moiety. The amino group is transferred back to the cinnamate by attack at Cβ rather than Cα,[118] followed by elimination of β-Phe and restoration of the starting MIO prosthetic group. All this is catalyzed by a single enzyme.
Scheme 7.
Interconversion of α- and β-amino acids. A. MIO-mediated catalysis. B. Radical intermediate catalysis.
The second subfamily of amino acid mutases is found in Clostridium spp, and encompasses lysine- 2,3-aminomutase, glutamate 2,3-aminomutase, and lysine 5,6-aminomutase.[117] This class of enzymes utilizes both pyridoxal-P and adenosyl-B12 as required coenzymes (Scheme 7B). The PLP forms its typical aldimine with the substrates, and the B12 coenzyme mediates radical intermediates in the -H for -NH2 swaps. Also in this enzyme class are D-ornithine 4,5-aminomutase (yielding (2R,4S)-diaminopentanoate) and lysine-5,6-aminomutase, each of which act on the distal rather than the proximal amine of these dibasic amino acids. The last enzyme will also process 3S-β-Lys to 3S,5S-diaminohexanoate. Mutant forms of lysine 2,3-aminomutase can produce β-alanine from Lys as a biotechnological alternative to the aspartate decarboxylase route. The β-Lys in capreomycin and viomycin may arise by this kind of enzymatic action (25 and 26, Figure 11).
5.2 C-Methyl-O-Methyl-, and N-Methyl Amino Acids
5.2.1 Methylations at Cβ
Friulimicin contains a β-methyl-Asp residue (32, Figure 18), daptomycin a β-methyl-Glu (17, Figure 5), and hormaomycin two β-Me-Phe residues (26, Figure 13). In the cases that have been investigated for timing and mechanism to date, Cβ methylations on residues incorporated by NRPS assembly lines occur at the α-keto acid stage.[119] This strategy allows use of the abundant methyl donor SAM to transfer its “CH3+” equivalent to a Cβ carbanion equivalent. Subsequent diastereoselective transamination of the 3-methyl-2-keto acids yields the β-branched amino acids (Scheme 8A). These transformations occur prior to amino acid activation by the corresponding NRPS. Multiple methylations on pyruvate can generate the dimethyl and trimethyl versions of the keto acid and yield either the corresponding amino acid (t-butylglycine) or, via reduction, t-butylglycolate, as found in kutznerides (e.g. 18, Figure 6). In contrast to de novo assembly of the t-butyl moiety on an amino acid building block, such as the t-butylglycines in the ribosomal peptide polytheonamide B and bottromycin A2 (14, Figure 2),[120,121] appear to be introduced by post-translational C-methylation of valine side chains at the β carbon, via radical SAM mediated transfer of CH3• equivalents (Scheme 8B). An analogous C-methylation of an Ile side chain also occurs to give the corresponding branched side chain. Likewise the S-β-methyl-Phe residue in bottromycin is proposed to arise by such radical SAM chemistry. Even more remarkable is a related net sequential addition of four one carbon units to build a -CH2-C(CH3)3 moiety, presumably by the same radical SAM machinery (this provides t-butylglycine, which is found in 14, Figure 2).[61] Some C-methylations of the indole ring of tryptophan involve unreactive carbon sites; those SAM-dependent methyl transfers may also proceed via CH3• radical equivalents, related to polytheonamide B biosynthesis.
Scheme 8.
C-methylation of amino acids at the β-position. A. SAM-mediated C-methylation. B. radical SAM-mediated C-methylation.
Hyperlink 24: Polytheonamide B.
5.2.2 O- and N-Methylations
For O-methylations as in the O-Me-Ser in kutznerides (Figure 6) and the O-Me-Tyr in didemnin B (29, Figure 16), the side chain-OH groups are the proximal nucleophiles to attack the activated methyl group of cosubstrate SAM in a more conventional transfer of a “CH3+” equivalent.[68,90] The timing of O-methylation, in the free amino acid or after incorporation into a peptide framework is not known.
A number of NRPs have one or more peptide bonds that have undergone N-methylation. For example, 7 of the 11 peptide bonds in cyclosporin A (33, Figure 19) and the N-terminus of vancomycin (30, Figure 17) have N-methyl substituents. N-methylation also arises via SAM as cosubstrate and involves direct capture of the itinerant methyl group by the basic amine nitrogen atom acting as a nucleophile.[2] Most of these reactions are catalyzed by methyltransferase (MT) domains that are inserted into A domains in the relevant chain-elongating NRPS modules.[2] Preliminary studies have shown that some N-methylamino acids will function in ribosomal protein biosynthesis in E. coli.[122, 123]
5.3 Side Chain Hydroxylations
In the 15 nonribosomal peptide scaffolds noted in the previous section there are 11 hydroxylated noncanonical amino acid residues. Two of the hydroxyl groups, in the statine residue of miraziridine A (13, Figure 1) and the isostatine residue of didemnin B (29, Figure 16), do not derive from molecular oxygen (O2), but instead stem from the carboxyl of Val and Ile, respectively, as will be discussed in section 6.1 below on hybrid NRPS-PKS modules. All the others arise from O2 by action of hydroxylases that come in two varieties, depending on the cofactor involved. They either harbor an inorganic redox active metal iron center, or they use the organic coenzyme FAD. The former generates a high valent oxoiron intermediate (Scheme 9A), whereas the latter forms a less reactive FAD-OOH as oxygen transfer agent (Scheme 9B).[124] The choice between these two alternative mechanisms is tuned to the reactivity of the amino acid receiving the electrophilic oxygen atom.[125] The six amino acid units in the particular examples (Tyr, Arg, Pro, Glu, Orn, Homo-Tyr) hydroxylated at specific carbon sites all require iron as cofactor, either as heme iron in cytochrome P450 superfamily monooxygenases or as mononuclear nonheme ferrous iron.[126,127] Both forms of these iron-containing oxygenases generate high valent oxoiron species which can cleave unactivated C-H bonds homolytically and deliver back an OH• equivalent.
Scheme 9.
Hydroxylation of amino acids. A. Metal-dependent hydroxylation. B. FAD-dependent hydroxylation.
The proteinogenic building block p-Tyr is normally generated by the nonheme iron-containing phenylalanine hydroxylase, which oxygenates the phenyl ring of Phe regiospecifically. In sanglifehrin biosynthesis (36, Figure 21)[9] and also in the peptidyl nucleoside pacidiamycin antibiotic family,[128] an orthologous Phe hydroxylase acts instead with specificity at the meta position to generate nonproteinogenic meta-tyrosine. Enzymatic hydroxylation at the benzylic (C3) position of Tyr residues yields the β-OH-Tyr that is found in positions 2 and 6 of vancomycin and teicoplanin.[39,40] The three oxidative crosslinkings in the vancomcycin scaffold (and four in teicoplanin) occur by sequential action of three hemeprotein oxygenases encoded in the biosynthetic gene cluster and link β-OH-Tyr2 to 4-OH-Phegly4, 4-OH-Phegly4 to β-OH-Tyr6, and 4-OH-Phegly5 to dihydroxyPhegly7 (see hyperlink in Figure 17).[129] Importantly, whereas β-OH-Tyr is synthesized as a free amino acid, oxidative crosslinking occurs on the NRPS assembly line.
The glutamate-3-hydroxylase in kutzneride maturation (18, Figure 6) is a mononuclear nonheme iron oxygenase,[15] while the oxygenase acting on both C4 and C5 of the Orn side chain in the echinocandin scaffold (25, Figure 12) is a heme-containing oxygenase.[130] It is not yet clear how to predict whether a given iron-based hydroxylase will modify the free amino acid (a feature that would be most useful for their incorporation into proteins) or instead work only on that residue displayed on the deshydroxy peptide scaffold.
Flavin-dependent monooxygenases utilize FL-OOH as the transfer agent for an electrophilic (“OH+”) equivalent.[125] For example, formation of the 2-Cl-N-OH-pyrrole moiety in hormaomycin biosynthesis (26, Figure 13)[51] and hydroxylation of a 3-Cl-β-Tyr monomer to produce 3-Cl-5-OH-β-Tyr in C-1027 biosynthesis require the activity of FAD-containing monooxygenases.[131] Both oxidative reactions occur on aminoacyl substrates tethered to T domains via thioester linkages. Analogously, for N-hydroxylations at basic side chain amino groups (e.g. Orn, Lys in siderophore biosynthesis and at indole NH of Trp-based scaffolds), the nucleophilicity of the nitrogen is sufficiently high that a less robust oxygenation reagent can suffice. Many such amino acid N-oxygenases are flavin-containing enzymes and act on free-standing amino acid substrates.[125] Further oxygenation of N-OH to nitroso and even nitro groups occurs in some NRP biosynthetic pathways, as proposed in formation of the nitrocyclopropyl-alanine residue in hormaomycin (26, Figure 13).[51]
5.4 Chlorination as a Variant of Hydroxylations
Amino acid chlorinases can be viewed as variants of the above oxygenase families.[41] In cases where a formal replacement of C-H by C-Cl occurs at unactivated carbon atoms in amino acid side chains, the responsible catalysts are again mononuclear iron enzymes, consuming one equivalent of O2 and α-ketoglutarate in close analogy to their oxygenase counterparts. Such halogenases for Pro, Leu, and Ile act on aminoacyl-S-T substrates.[132]
In contrast, the electron rich phenol side chain of Tyr and the indole ring of Trp residues again require a weaker chlorinating agent, in analogy to the hydroxylation chemistry cited above that is effected by flavoenzymes. Here too, FADH2-containing halogenases are used, this time to deliver a “Cl+” equivalent.[133] Examples include the 3-Cl-β-OH-Tyr residues of glycopeptide antibiotics (Figure 17),[134] ClmS in chloramphenicol biosynthesis,[135] the 6,7-dichloro pyrroloindole residue in the kutznerides (Figure 6),[136] the biosynthesis of Dicyostelium differentiating factor Dif-1 and in the biosynthesis of the 2-Cl-N-OH-pyrrole carboxy residue of hormaomycin (Figure 13).[51,137] This is also the likely route to 3,5-dichloro-4-OH-Phegly found in enduracidine (35, Figure 20).[48] Intriguingly, the known iron-based halogenases recognize aminoacyl-S-T substrates, while the known flavin-dependent ones act upon free amino acids.
5.5 Diamino Acids
While the six-carbon dibasic (α,ω) amino acid lysine is a proteinogenic building block, the three-, four-, and five-carbon chain dibasic amino acids are nonproteinogenic building blocks. For example, the three carbon 2,3-DAP residue is found in capreomycin (25, Figure 11), five 2,4-DAB residues are incorporated into the cationic polymyxins (15, Figure 3), and the 2,5-diaminopentanoate (a.k.a. ornithine) occurs in echinocandin (25, Figure 12) (where it is doubly hydroxylated) and daptomycin (17, Figure 5) as well as in many siderophores.[138] In addition to the linear 2,4-DAB, the branched 2,3-DAB is also found, as the 2S,3S-diastereomer, in friulimcin (32, Figure 18).[55]
The biosynthesis of ornithine has been well studied, as it is in fact a component of primary amino acid metabolism. 2,3-DAP arises from pyridoxal-P-dependent enzymatic dehydration of serine to a dehydroAla-PLP intermediate which is captured by NH3 (see hyperlink for Figure 11).[44] The branched 2,3-DAB would arise correspondingly from Thr dehydration and regiospecific C3 amination, whereas the 2,4-DAB regioisomer presumably arises from enzymatic transamination of the primary metabolite aspartate semialdehyde (see the hyperlink in Figure 3).[45,46]
5.6 Cyclization of Amino Acid Units
5.6.1 Three Membered Rings
One of the important functional attributes of nonproteinogenic/noncanonical amino acids are the conformational constraints and rigidifications that are imparted into the structural scaffolds where they are incorporated. One such constraint is ring size. In the monocyclic amino acid category, one can observe three membered rings, such as 1-amino-1-carboxy-2-ethyl cyclopropane in the coronamate moiety of coronatine (12, Scheme 5),[105] the aziridine dicarboxylate in miraziridine A (13, Figure 1)[55] and in the nitrocyclopropyl ring embedded in the lysine-derived framework of hormaomycin (26, Figure 13).[51] The cyclopropane in coronamic acid derives from L-allo-Ile (whose direct precursor is unknown, but could be D-Ile). L-allo-Ile is activated and tethered on the S-pantetheinyl arm of a thiolation domain of the CmaA enzyme.[34] There it undergoes regiospecific Cγ-chlorination, prior to intramolecular attack of Cα on that chlorine-substituted Cγ to construct the cyclopropane framework of coronamic acid, still tethered via the thioester linkage. Thioesterase action releases the free coronamic acid,[139] which on subsequent ligation to the bicyclic polyketide coronofacic acid, yields the toxin coronatine (12, Scheme 5) a mimic of jasmonate plant hormones. Processing of Val in place of Ile through this chlorination/cyclization process yields the methyl-substituted cyclopropane known as norcoronamic acid. The cryptic chlorination chemistry is mediated by an O2-requiring iron-containing halogenase as discussed above; instead of transferring an OH•, CmaB transfers a Cl• from the active site high valent iron intermediate.[34]
Less obvious is the enzymatic pathway from Lys to nitrocyclopropyl alanine found in the hormaomycin (26, Figure 13); the C6-amino group of Lys has been oxidized to the nitro group and a cyclopropane has formed by a C-C bond-forming step between C4 and C6. Two possible variations of ring opening of an internal lactone following N6 and C4–oxygenation have been proposed but not yet validated.[51]
5.6.2 Orn and Lys Cyclizations
Six ring cyclic amino acids include the proline analog pipecolate, found in rapamycin (3, Scheme 3) and the 1,2-hydrazo analog piperazate, seen in sanglifehrin (36, Figure 21) and the kutznerides (18, Figure 6). Pipecolate is synthesized by lysine cyclodeaminase;[140] this enzyme class has tightly bound NAD and begins catalysis by oxidation of the α-amine to the imine, yielding NADH as a transient coproduct still bound to the enzyme. Intramolecular attack by the C6-NH2 generates a cyclic imine after elimination of NH3. That Δ1-piperidine carboxylate is finally reduced by hydride transfer from the bound NADH, regenerating the starting oxidation state of the bound coenzyme (NAD) and releasing pipecolate (Scheme 10A).
Scheme 10.
Pipecolate and piperazate biosynthesis. A. Proposed mechanism of pipecolate formation. B. Proposed mechanism of piperazate formation.
The piperazate ring found in kutznerides and sanglifehrin is also a cyclic six-atom framework but has two nitrogens in a hydrazo (N1-N2) linkage. It derives from ornithine and the first step is N5-hydroxylation by a dedicated flavoprotein.[141] It has been proposed that further oxygenation occurs to the nitroso-Orn to set up intramolecular attack of the α-NH2 as nucleophile on such an electrophilic Nδ to form the unusual N-N bond (Scheme 10B).
5.6.3 Cyclization of the Guanidino Moiety of Arg
Five- and six-membered, Arg-derived cyclic guanidines are observed in enduracidine (35, Figure 20) and capreomycin (25, Figure 11), respectively. The six member capreomycidine ring is proposed to arise following hydroxylase action to create a 3-OH-Arg.[142,143] Subsequent processing by a pyridoxal-P dependent enzyme presumably leads to water elimination and the formation of an eneamino-PLP adduct. Intramolecular capture by the terminal guanidino nitrogen and release from the PLP-enzyme active site yields the cyclic six-membered ring (Scheme 11A). In viomycin (26, Figure 11) the same ring system has undergone an additional hydroxylation to generate a tuberacidine residue. The route to the five membered cyclic guanidine in enduracidine is not as well worked out (the route shown in Scheme 11B is hypothetical), although a similar ring system in guadinomine is formed via intramolecular attack of the guanidine group terminal nitrogen on a C3–C4 unsaturated intermediate.[144] In mannopeptimycins (e.g. 37, Scheme 11B), a subsequent C3 hydroxylation yields β-OH enduracididine.[145]
Scheme 11.
Biosynthesis of cyclic guanidines. A. Biosynthesis of six-membered cyclic guanidines. B. Biosynthesis of five-membered cyclic guanidines. Nonproteinogenic amino acids: red = capreomycidine, blue = tubericidine, green = (2S,3S)-β-methyl-Phe, purple = β-hydroxyenduracididine, orange = enduracididine.
5.6.4 Pyrroloindole tricycle formation
Tryptophan residues can be converted to tricyclic pyrroloindole cores by addition of two types of electrophiles across the 2,3-double bond of the pyrrole ring. In generation of the Bacillus competence factor peptides, a geranyl group is added to C3 while the downstream amide NH attacks C2.[146] Alternatively, delivery of electrophilic oxygen “OH+” by oxygenases (to generate a transient indole epoxide/hydroxyiminium species) can induce capture by the intramolecular amide to build a pyrroloindole, as in kutzneride biosynthesis (Figure 6).[147] These modifications must occur either on the elongating linear chain or after the macocyclic peptide scaffold has formed, and are not possible on free Trp. A comparable indole epoxygenation is observed during fungal biosynthesis of fumiquinazoline A.[148]
5.7 Kynurenine and Alkyl/Alkenyl Proline Biosynthesis
In daptomycin (17, Figure 5), the indole ring of tryptophan is oxidatively cleaved by tryptophan 2,3-dioxygenase action to yield N-formylkynurenine. Enzymatic deformylation yields free kynurenine which is selected, activated, and incorporated by the adenylation domain of NRPS module 13 to become the terminal residue in daptomycin.
Analogous oxidative cleavage of the aromatic ring of tyrosine occurs in the formation of the propenyl proline monomer of hormaomycin (26, Figure 13) as well as in anthramycin, suburomycin, and tomayomycin biosynthesis.[149] The pathway starts by conventional ortho-hydroxylation of Tyr to L-DOPA and subsequent oxidation of the diphenol to the orthoquinone.[150] Intramolecular attack by the amine on the quinone yields a dihydroxyindole carboxylate that is substrate for oxygenative cleavage of the ring by an intradiol-type dioxygenase, yielding a diene-diacid-substituted proline. Removal of the C2 enoyl acid moiety must occur as well as addition of a C1 fragment, probably by radical SAM enzymology, to construct the C3-substituted propenyl group.[51] The propenyl Pro is presumed to be a free amino acid building block for the hormaomycin NRPS assembly line, with a total of six enzymes required to morph Tyr into propenyl proline.[51]
5.8 Hydroxylated Phenylglycines
The 4-OH-Phegly monomer of vancomycin (30, Figure 17), teicoplanin (31, Figure 17) and enduracidine (35, Figure 20) is synthesized by four enzymes.[42] The first enzyme, a homolog of prephenate decarboxylase, utilizes the primary metabolite prephenate to produce p-hydroxyphenylpyruvate. A second decarboxylase oxygenates this intermediate to form p-hydroxymandelate. Oxidation of the hydroxy group to the ketone yields p-hydroxybenzoylformate; the last enzyme transaminates this keto acid to L-p-OH-Phegly. Phenylglycine in pristinamycin (Scheme 4) is also derived from an analogous pathway.[43]
By contrast, 3,5-dihydroxyphenylycine building block in glycopeptides is assembled via a distinct four enzyme pathway (Figure 17). The pathway commences with a type III PKS that uses 4 molecules of malonyl CoA and via iterative decarboxylative Claisen reaction to build the eight carbon framework of 3,5-dihydroxyphenylacetyl CoA.[151,152] The next enzyme performs a remarkable chain-shortening oxidation, dependent on O2, to generate the 3,5-dihydroxybenzoylformate. The last enzyme is a transaminase to produce the 2S (L-) form of 3,5-(OH)2-Phegly.
6 Amino Acids Synthesized by Enzymatic Assembly lines
A variety of nonproteinogenic amino acids, or peptides containing them, have been isolated in which the amino group is at C4 rather than at C2. In many of these cases it appears a two-carbon extender, derived from a CH2-COOH equivalent has been inserted into a proteinogenic amino acid scaffold and now comprises C2 and C1, respectively. C3 in these scaffolds is often an OH or an O-CH3 group, as seen in the amino acids statine in miraziridine A (13, Figure 1) and isostatine in didemnin B (29, Figure 16). This is also the case for dolaisoleucine in dolastatin 10.[153] The statine residue with its tetrahedral C3-OH has been a useful inhibitory mimic of tetrahedral intermediates in the action of several aspartyl proteases.[154] Correspondingly miraziridine has a vinyl-Arg residue at its C terminus (Figure 1) and syringolin A has a vinyl-Val (20, Figure 8).
These residues are also extended by an acetate-derived two-carbon unit, but now C2 is an sp2 rather than an sp3 carbon.
6.1 NRPS-PKS Hybrids
These amino acid connectivity observations could be fully rationalized when biosynthetic gene clusters became available for natural products containing such 3-OH-4-NH2 acid frameworks, such as didemnin B (29, Figure 16) and andrimid.[155,107,156] The didemnin cluster encodes a PKS module interspersed between NRPS modules. In that case the growing N-acyl-Pro-D-MeLeu-Thr-Ile-S-tetrapeptide is tethered as a thioester to the S-pantetheinyl arm of the peptidyl carrier protein domain of the fourth NRPS module. Immediately downstream is a PKS module, which tethers a malonyl unit also in S-pantetheinyl linkage, on its own acyl carrier protein. Decarboxylation of the malonyl moiety generates the C2 carbanion for capture of the upstream peptidyl thioester. The C-C bond-forming step moves the peptidyl moiety onto the two-carbon unit and sets the peptidyl amino carbon as C3.The PKS module has a ketoreductase domain so the β-keto group is then reduced to the β-OH, yielding the isostatyl residue as a ketide-extended N-acyl-pentapeptidyl thioester. This intermediate is then subjected to further chain elongation by the downstream NRPS modules, with macrocyclizing release, embedding the isostatyl residue within the macrocyclic framework. The mechanism of an isostatyl biosynthesis is shown in Scheme 12A.
Scheme 12.
Hybrid NRPS-PKS assembly lines. A. Isostatine assembly into didemnin B. Mechanism of statine biosynthesis. C. Vinylogous amino acid biosynthesis. Nonproteinogenic amino acids: (A) red = O-Me-Tyr, blue = isostatine; (B) red = statine; (C): red = vinyl-Val, blue = vinyl-Arg, green = vinyl-Tyr.
Analogously, if Leu is activated by such an NRPS-PKS hybrid, the resulting chain-extended hybrid building block would be a statyl residue (Scheme 12B) as in miraziridine. In the andrimid biosynthetic pathway, the comparable NRPS-PKS pair lacks a reductase domain within the PKS module, so the β-ketone functionality is retained.[156] If the chain-elongated intermediate in the statine pathway with its β-OH group were further subjected to typical α,β-dehydration by a dehydratase domain, the resulting product would be vinyl-Leu (Scheme 12C). The comparable vinyl-Val residue is known (Figure 8). Alternatively, starting from Arg, comparable logic in an NRPS-PKS bimodule would generate vinyl-Arg for the miraziridine assembly line (Figure 1).[54] Similarly, a vinyl-Tyr can be found in cyclotheonamide.[157] Finally, if the PKS module has an enoyl reductase (ER) domain, the vinyl group would undergo hydrogenation, resulting in an amino acid extended by a CH2COOH group. Such extended Tyr and Phe residues are found in the tubulin inhibitors tubulysins B (39) and E (40, Scheme 13A), where they are known as tubu-tyrosine and tubu-phenylalanine.[158]
Scheme 13.
Hoiamide and tubulysin biosynthesis. A. Tubu-tyrosine and tubu-phenylalanine biosynthesis. B. Mechanism of 4-amino-3-hydroxy-2,5-dimethylheptanoate biosynthesis Nonproteinogenic amino acids: red = 4-amino-3-hydroxy-2,5-dimethylheptanoate, blue = tubu-Tyr or tubu-Phe.
Many PKS assembly lines use methylmalonyl-CoA instead of malonyl-CoA as chain extender units. Indeed the hoiamides (e.g. hoiamide A (41), Scheme 13B) harbor a 4-amino-3-hydroxy-2,5-dimethyl heptanoate residue that results from an Ile -and methylmalonyl-utilizing NRPS/PKS hybrid module.[159] Of course one can have more than one PKS-mediated chain extension cycle to build longer alkyl chains. For example, the 3-amino-2,5,7,8-tetrahydroxy-10-methylundecanoate in cyanobacterial pahayokolides may arise from leucine and three malonyl-CoA extender units with additional adjustments of redox states.[160] Here, the β-amino acid would be generated by reductive amination of the β-keto acid produced by the PKS machinery.
It should be noted that the opposite arrangement of modules (i.e. PKS-NRPS) also exists in nature’s repertoire of hybrid assembly lines, as in the case of rapamycin (3, Scheme 3). However, these configurations typically do not yield an amino acid product; instead, they catalyze amide bond formation.
6.2. Amino Acids Synthesized by PKSs Followed by Reductive Amination
The nonproteinogenic amino acids Aeo in the cyclic tetrapeptide HDAC inhibitors trapoxin and HC toxin (23 and 24, Figure 10), Adda (3-amino-9-methoxy-2,6,8-trimethyl-10-phenyl-4,6-decadienoate) in microcystin LR (28, Figure 15),[79,161] and butenylmethylthreonine in cyclosporin A (33, Figure 19)[100] are clearly reminiscent of PKS assembly line origins. For Adda in particular, a benzoyl CoA starter unit gets extended by three methylmalonyl CoA and 2 malonyl CoA extender units, yielding a β-keto acid that undergoes transamination (Scheme 14A).[79,162] A similar route and PKS machinery may be involved in the formation of the Atmu (3-amino-2,5,7,8- tetrahydroxy-10-methyl-undecanoate) unit in pahayokolides,[160160] this time with additional hydroxylation events at C2 and C6. The long chain (C15–17) β-amino acid starter units in mycosubtilin (27, Figure 14) and iturin are also derived by a similar mechanism. In this instance a β-keto acyl-CoA, from fatty acid biosynthesis, undergoes transthiolation on the NRPS heptapeptide assembly lines and then undergoes enzymatic transamination in situ as preamble to building the acylated peptide chain.[86b] That N-terminal β-amino fatty acyl group is subsequently the nucleophile in the last step of mycosubtilin assembly to create the heptapeptidyl macrolactam scaffold. Note that this route to β-amino acids is distinct from the action of amino acid mutases on β-amino acids, such as the conversion of α-Phe to β-Phe carried out by MIO-containing mutases noted in section 5.1.2.
Scheme 14.
Biosynthesis of polyketide precursors to amino acids. A. Biosynthesis of 3-amino-9-methoxy-2,6,8-trimethyl-10-phenyl-4,6-decadienoate (Adda). B. Biosynthesis of butenylmethyl-Thr.
The butenylmethylthreonine residue (2S,3R,4R,6E-2-amino3-hydroxy-4-methyl-6-octenoate) in the immunosuppressant undecapeptide cyclosporin A (33, Figure 19) is an α-rather than a β-amino acid but similarly has a PKS origin (Scheme 14B).[163] The nine carbon thioester is assumed to undergo oxygenation and oxidation of the CH2 at Cα, followed by hydrolysis and transamination to give the α amino acid. An analogous route is envisioned for the 2-amino-8-keto-9,10-epoxy-decanoate (Aeo) residue in the HC-toxin and trapoxin family of cyclic tetrapeptide antagonists of histone deacetylases (Figure 10).[79] The epoxide is thought to arise from a Δ9 olefin. The saturated 2-amino-8-ketodecanoate residue homologous to Aeo is found in apicidin,[164] and the 9-OH analog in another member of the cyclic tetrapeptide fungal metabolite family, consistent with differential tailoring of the polyketide precursors.[165]
7. HomoTyr and Other Homologated Amino Acids
HomoPhe and homoTyr, where an additional methylene has been inserted into the chain, arise from the scaffolds of Phe and Tyr, respectively. Homotyr residues are found in some cyanopeptolins and 3,4-dihydroxy-homoTyr is found in echinocandins.[85,166] Early labeling studies had shown that the dihydroxy homoTyr in echinocandins (e.g. 25, Figure 12) arises from Tyr and the two carbons of acetate. It is presumed that the carboxylate moiety of the Tyr precursor is lost (two carbons added, one carbon lost). Precedent for such net CH2 homologation occurs during leucine biosynthesis.[167] By analogy, for homoTyr, the C2 carbanion of acetyl CoA could add into the ketone of p-OH-phenylpyruvate (arising from Tyr transamination) to yield a β-OH adduct, which can isomerize to the α-OH, β-COOH adduct via reversible dehydration and rehydration in the opposite regiochemical sense (Scheme 15). Oxidation of the hydroxyl to the ketone and resultant facile decarboxylation of the β-carboxylate yields the homologated α-keto acid. Transamination then gives homoTyr. Dihydroxylation at C3 and C4 to complete the framework found in echinocandins is presumed to occur via iron-based oxygenase action. An analogous pathway from Phe and acetyl CoA would lead to homoPhe, found in some nonribosomal peptides including in pahayokolides as the D-isomer.[160]
Scheme 15.
Biosynthesis of homo-Tyr.
In norvaline biosynthesis, α-ketobutyrate can substitute for p-OH-phenylpyruvate in the initial condensation with the acetyl CoA carbanion. If instead α-keto-β-methyl valerate (derived from transamination of Ile) is the keto acid condensing partner with acetyl-CoA for isopropylmalate synthase, homoisoleucine will arise at the final transamination step. The simplest homologated nonproteinogenic amino acid is 2-aminobutyrate, (homologated alanine), available from transamination of the same α-ketobutyrate.
8. Amino Acids with Potentially Reactive Functional Groups
Proteinogenic amino acids do not contain any reactive or potentially reactive electrophilic side chain functionalities such as olefins, aldehydes or ketones, ruling out unwanted side reactions of these groups once installed in proteins. On the other hand natural nonproteinogenic amino acids do contain some potentially reactive functional groups such as olefins (and very rarely alkynes), epoxides (and very rarely aziridines), and, exceedingly rarely, the β-lactam acylation warheads.[168]
8.1 Olefins and Alkynes
α,β-Olefinic amino acids are enamines and will not be stable, due to spontaneous isomerization to the imines, followed by hydrolysis to ammonia and the keto acid. One can observe posttranslational dehydration of Ser and Thr residues in small proteins to yield the α,β-unsaturated amides (dehydroAla and also dehydrobutyrine and also in pahayokolide).[160,169] While these are kinetically stable they are electrophilic at Cβ due to conjugation to the amide and undergo capture by nucleophiles: this is the basis for lanthionine and methylanthionine residue formation, notable in the lantipeptides. In capreomycin the dehydroalanine moiety is further modified with a ureido group (25, Figure 11).
In contrast, β,γ-olefinic amino acids are stable.[170] The parent in this category, vinyl-Gly (42, Figure 22A) is a known natural metabolite,[171] although its biosynthesis is not fully clear: a byproduct of PLP-containing γ-elimination enzymes would be one route. A related β,γ-olefinic amino acid is dehydro-Val (43, Figure 22A), which occurs in the hexapeptide mycotoxin phomopsin A.[172] The β-alkoxy substituted vinylglycines, including methoxyvinylglycine (44) and rhizobitoxine (45, Figure 22A), are less reactive than vinyl-Gly as free metabolites.[173,174] Once they undergo elimination of the -OR group in the active site of PLP-enzymes they are subject to capture by other nucleophiles including oxygen and sulfur side chains of amino acids in the enzyme. Notably, these amino acids are synthesized as free metabolites (i.e., not tethered to T domains of NRPS modules).
Figure 22.
Alkyne and alkene containing amino acids. A. red = β,γ-olefins, B. blue = γ,δ and δ,ε-olefins. C. green = alkynes and an allene. D. purple = dienes.
The simplest γ,δ-olefinic amino acid allylglycine (46, Figure 22B) is also a known metabolite, in mushrooms, but its biosynthetic pathway has not been delineated.[175,176] It is a stable molecule but the olefin can be isomerized into conjugation and become electrophilic by the subset of PLP-enzymes that carry out γ-elimination and replacement reactions.[177] Several other olefinic amino acids have also been isolated from a variety of plants, including β,γ-unsaturated (methylenecyclopropyl)-Gly (47) which occurs in the seeds of Litchi chinensis;[178] δ,ε-unsaturated amino acid Hypoglycin A (48), which occurs in the seeds of Blighia sapida (ackee) fruit and is known to cause Jamaican vomiting sickness if ingested;[179] and cyclopentenyl-Gly, which is known to inhibit isoleucine utilization in E. coli (49, Figure 22B).[180] The alkyne congener of vinyl-Gly, L-propargylglycine (50, Figure 22C) has the same stability and latent reactivity with the same set of PLP enzymes.[181], Several other naturally occurring alkyne containing amino acids are known, such as 2-amino-4-methyl-5-hexynoic acid (51), 2-amino-4-hydroxy-5-hexynoic acid (52), and 2-amino-4-hydroxy-6-heptynoic acid (53, Figure 22C), all isolated from the seeds of Euphoria longan.[182] Additionally, an allene containing amino acid (2-amino-4,5-hexadienoic acid (54), Figure 22C) was isolated from the mushroom Amanita solitaria.[183]
2,5-Dihydrophenylalanine (55, Figure 22D) is another γ,δ-olefinic amino acid that can act a bacterial antimetabolite interfering with Phe metabolism.[184] Another amino acid of particular interest in the natural dienoic dihydro amino acid category is gabaculine (56, Figure 22D).[185] Its biosynthesis is not yet resolved but its mechanism of action with certain PLP-dependent enzymes has been studied.[186] Formation of the gabaculine=PLP aldimine, followed by Cα-H abstraction generates an intermediate that can aromatize and in so doing capture the PLP coenzyme as a nonhydrolyzable amine adduct.[185]
8.2 Epoxides
While prevalent in certain terpene and polyketide classes, epoxides are relatively rare in amino acid-derived natural products.[168] One amino acid noted above is the 2-amino-8-keto-9,10-epoxydecanoate, Aeo, responsible for the covalent modification of histone deacetylases when embedded in the HC-toxin/trapoxin class of cyclic tetrapeptides (Figure 10).[79] Almost assuredly the epoxide arises from the T-domain bound 9,10-olefin precursor by action of an iron-based monooxygenase, as recently proven for dapdiamide C.[187] When bound to active sites of their target enzymes, such epoxides require an acid catalyst to transfer a proton to the oxygen, to lower the transition state energy for C-O bond cleavage as some nucleophile attacks the carbon.
8.3 Aldehydes and Ketones
Aldehyde and ketone functional groups are absent in proteins, as pointed out by Schultz and colleagues in their successful efforts to expand the genetic code and incorporate ketone-containing unnatural amino acids into engineered proteins to increase functional group capacity.[33]
However, aldehydic amino acids are central intermediates in the metabolism of the amino acids aspartate and glutamate. Each of the side chain carboxylates can be phosphorylated enzymatically to yield aspartyl-β-phosphate (57) and glutamyl-γ-phosphates (58, Figure 23A), respectively. These are then enzymatically reduced by hydride transfer from dihydronicotinamide cofactors.ref The Asp aldehyde is a key intermediate in microbial and fungal biosynthesis of lysine, methionine isoleucine, and threonine.[167] The glutamyl-γ-semialdehyde is a corresponding intermediate by transamination to ornithine or by intramolecular imine formation and NADH-mediated reduction to proline. A cyclized form of the glutamyl semialdehyde is found in some nonribosomal peptides where it has been termed an Ahp residue (3-amino-6-hydroxypiperidone (59), Figure 23A).[188] The nitrogen of the immediately downstream amide in the peptide backbone acts as a nucleophile to form the cyclic hemiaminal linkage, a conformationally restricting cyclization event. Examples of such natural products include stigonemapeptin,[188] symplocamide,[189] and cyanopeptolins.[190]
Figure 23.
Amino acids accessed through aldehyde intermediates. A. glutamyl and aspartyl phosphates. B. Paraherquamide A biosynthesis. Nonproteinogenic amino acids: blue = 5-oxo-Ile, red = β-Me-Pro, green= Ahp.
In contrast to the reduction of acid groups to aldehydes for Asp and Glu, the terminal δ-CH3 of Ile is oxidized to the corresponding aldehyde (5-oxo-Ile, 60), presumably through the alcohol intermediate, both in the free amino acid on the way to form β-methylPro (61) which is incorporated into paraherquamide A (62, Figure 23B).[191]A similar residue is also embedded in the peptide macrocycle of GE37468 to yield the β-methyl-δ-OH proline residue in that scaffold.[192] Cysteine residues in some proteins, such as sulfatases, can be oxidized posttranslationally to the side chain aldehyde, yielding formylglycine (63, Figure 24) residues which participate as electrophiles in sulfate ester substrate hydrolysis.[118] These unique formylglycine residues can be captured by external nucleophiles to engineer site-specifically modified proteins.
Ketones are more thermodynamically stable carbonyl groups than aldehydes. A few cases of ketone-containing nonproteinogenic amino acids are known. The most prominent is 2-amino-3- ketobutyrate (64, Figure 24), arising from enzymatic oxidation of the side chain alcohol of L-threonine.[193] This four carbon α-amino-β-keto acid is subject to retro aldol cleavage in the presence of CoASH as cosubstrate to yield acetyl CoA from C3,4 and glycine from C1,2. Retro aldol fragmentation would be a continuing but presumably muted liability if 2-amino-3-keto amino acids were incorporated into protein backbones.
6-Diazo-5-oxonorleucine (DON (65), Figure 24) provides a diazomethyl ketone in its compact scaffold.[194] Its biosynthesis has not been reported even though it was isolated 60 years ago. A related α-diazoester, azaserine (66, Figure 23), is also naturally occurring and has antitumor and antibiotic properties.[194,195] The diazoketone group is set up for loss of dinitrogen (N2) on attack by nucleophiles, so an engineered tRNA/tRNA synthetase pair for DON might be of use for selective reactivity.
The cyclic amino acid acivicin (67, Figure 24) is another example of a heterocyclic electrophile that mimics glutamine residues.[196] Its biosynthesis remains unknown.
8.4. α,β-Unsaturated Amides, Epoxyamides, and Epoxyketones
When olefins are α,β- to carbonyl groups they become activated as electrophiles for conjugate addition. We noted the vinyl-Val residue in syringolin (20, Figure 8) as the reactive site for covalent modification of the proteasome.[197] Another example is in the dapdiamide family of antibiotics, where the nonproteinogenic amino acid 2,3-DAP has a fumaramoyl group appended to the β-amino functionality.[198] When dapdiamide A (69, Figure 25) binds to the active site of the glutaminase domain of a bacterial glucosamine-6-P synthase, its fumaramoyl-DAP moiety mimics a “stretched” glutamine whose electrophilic group can covalently inactivate the active site Cys.[199] In this instance the conjugated olefinic group is not built de novo into the amino acid but brought in by enzymatic acylation (of a DAP-Val dipeptide intermediate) with fumarate and then converted from the fumaroyl- to the fumaramoyl-DAP (Figure 25).
Figure 25.
Dapdiamides and anticapsin. Nonproteinogenic amino acids: red = L-2,3-DAP, blue = anticapsin.
In some members of the dapdiamide antibiotic family noted above, such as dapdiamide E (70, Figure 25), the fumaramoyl group is converted by a mononuclear iron enzyme to the epoxysuccinamoyl group, yielding the epoxysuccinamoyl-DAP as the glutamine analog which again captures the active site Cys thiolate in the glutaminase domain of glucosamine-6-P synthase.[187]
Another amino acid with a epoxycabonyl side chain is anticapsin (71, Figure 25), the warhead in bacilysin.[200] The keto group ultimately derives from the 7-OH group of prephenate, by enzymatic oxidation at a late stage in diversion of prephenate through to anticapsin.[201] Similar to dapdiamide E, the epoxycyclohexanone side chain can capture the same active site cysteine of glucosamine-6-P synthase.[202]
9. Challenges and Opportunities for Utilization of Nonproteinogenic Amino Acids as Protein Building Blocks
In this review, we have introduced the reader to a broad range of nonproteinogenic amino acids. A number of these can be readily biosynthesized as freestanding building blocks, and simply await exploitation by the protein engineer. However, several levels of potential challenges can be envisioned for the ready utilization of some of the other amino acids. Consider, for example, five examples. The olefinic allylglycine (46, Figure 22B) and the alkynoic propargylglycine (50, Figure 22C) and congeners could provide useful in providing bioorthogonally reactive functional groups in proteins. Likewise, the diazoketone amino acids DON (67, Figure 24) and azaserine (68, Figure 24) have specific reactivity features, as does aziridine dicarboxylate (Figure 1). A third set of unusual building blocks is represented by the β-oxy, γ-amino dolaisoleucine.[153]
A first level of challenge is whether the biosynthetic genes are known from the producer organism. For none of these five building blocks is that currently the case. This problem may be readily addressable for the DON producer Streptomyces ambofaciens which has been known for decades and whose genome could be readily sequenced, albeit it is not entirely clear what kind of encoded enzymes to look for bioinformatically.[194] In contrast, the dolastatin 10 producer comes from Symploca sp. V642 in a complex bacterial community associated with the sea hare Dolabella auricularia so gene identification is substantially more challenging.[203] Finally, for allylglycine (46, Figure 22B) and 2-amino-4,5-hexadienoic acid (54, Figure 22C) which are mushroom metabolites, those genomes have yet to be sequenced and it is also not clear what the biosynthetic clusters will look like a priori.[175,176,183]
A second level of challenge is to assign function to enzymes encoded by a given gene cluster so that genes necessary and sufficient to make the amino acid building block can be moved to a desired surrogate production host. Related to this issue is deciphering the chemical logic and the number of protein-mediated steps required to construct the novel amino acid. A corollary is whether the pathway generates the free amino acid or one tethered to a domain on an NRPS or PKS thiolation domain. In the latter case, as noted earlier, a thioesterase needs to be found and included with the NRPS/PKS genes to release the free amino acid.
The third level of challenge includes engineering an orthogonal tRNA/aminoacyl-tRNA synthetase pair to recognize and activate the specific nonproteinogenic amino acid building block. This has been successfully accomplished for many α-acids over the past decade but β- and γ-amino acids are likely to be substantially more problematic for such evolution. These latter building blocks will also provide problems for ligation by the peptidyl transferase centers of the ribosome. On the other hand as noted earlier in this review it appears that at least some aliphatic N-methyl but not N-ethyl amino acids can be ligated during ribosomal protein biosynthesis.
Notwithstanding these potential barriers, amino acids are central and versatile building blocks in the chemistry of life, and amino acid metabolism in microbes can go far beyond the familiar 20–22 proteinogenic α-amino acids. Although no single organism shows the full biosynthetic capacity, the inventory of known naturally occurring nonproteinogenic amino acids can be up to 20 fold greater than the proteinogenic set. While many of the amino acids can only be found in secondary or conditional metabolic pathways, others such as ornithine, homoserine, and aminolevulinate are central players in primary metabolism. The enzymatic pathways from proteinogenic to nonproteinogenic amino acids are often short and efficient both in number of steps and in atom economy considerations. The genes encoding for these conversions are almost invariably clustered, and offer straightforward prospects for moving specific modification pathways between microbes.
Freestanding nonproteinogenic amino acids have been long recognized for their antimetabolite properties and tendency to be uncovered to reactive functionalities by the catalytic action of target enzymes. By installing them regiospecifically into biogenic peptides and proteins, it may be possible to usher a new era at the interface between small molecule and large molecule medicinal chemistry. Furthermore, site-selective protein functionalization offers uniquely attractive strategies for posttranslational protein modification.[4] Last but not least, many of the amino acids not selected by nature for protein incorporation offer rich architectural possibilities in the context of ribosomally derived polypeptides.
Hyperlink Scheme 1.
L-pipecolate formation in rapamycin biosynthesis.[1]
Hyperlink Scheme 2.
Hypothetical biosynthesis of miraziridine A. Nonproteinogenic amino acids: red= statine, blue = Azd, green = Aba, purple = vinyl-Arg.
Hyperlink Scheme 3.
Biosynthesis of α-aminobutyric acid (Aba) in E. coli K-12.[2]
Hyperlink Scheme 4.
Biosynthesis of statine moiety of Thailandepsin E.[3]
Hyperlink Scheme 5.
Biosynthesis of vinylogous alanine moiety of Jamaicamide B.[4]
Hyperlink Scheme 6.
Biosynthesis of 2,4-diaminobutyrate (2,4-DAB) from aspartyl phosphate in Pseudomonas aeruginosa PAO1.[5]
Hyperlink Scheme 7.
Anti-HIV marine cyclicdepsipeptides containing a β-methoxytyrosine moiety.[6] Nonproteinogenic amino acids: red = β-methoxy-Tyr, blue= β-hydroxy-Tyr.
Hyperlink Scheme 8.
Biosynthesis of kyneurinine.[7]
Hyperlink Scheme 9.
MecPGly formation in kutzneride biosynthesis.[8]
Hyperlink Scheme 10.
Piperazate formation in kutzneride biosynthesis.[8] Nonproteinogenic amino acids: red = dichloropyrroloindoline carboxylate, blue = L-O-methyl-Ser, green = (2R,3S)-β-hydroxy-Glu, purple = methylcyclopropyl-Gly (MeCPGly), orange = γ-chloropiperazate.
Hyperlink Scheme 11.
Hypothetical SAM mediated O-methylation biosynthesis.
Hyperlink Scheme 12.
Biosynthesis of vinylvaline and 3,4-dehydrolysine on the syringolin (syl) synthase. Nonproteinogenic amino acids: red = L-vinyl-Val, blue = L-Δ-3-Lysene.
Hyperlink Scheme 13.
Hypothetical pathway for eurypamide biosynthesis.[11] Nonproteinogenic amino acids: red = (2S,3R,4S)-3,4-dihydroxy-Arg, blue = Aryl ether (L-Tyr/(R)-β-Tyr).
Hyperlink Scheme 14.
Hypothetical pathway for Aeo biosynthesis.[12]
Hyperlink Scheme 15.
Formation of L-2,3-diaminopropionate (2,3-DAP) from L-serine in zwittermycin biosynthesis.[13]
Hyperlink Scheme 16.
Two possible routes for the biosynthesis of L-3-methyl-4-hydroxy-Pro from L-leucine.[14]
Hyperlink Scheme 17.
Hormaomycin biosynthesis.[15] Nonproteinogenic amino acids: red = Nitrocyclopropyl-Ala, blue = 5-chloro-1-hydroxypyrrole-2-carboxylate, green = (2S,3R)-β-methyl-Phe, purple = (2R,4R)-4-((Z)-propenyl)-Pro.
Hyperlink Scheme 18.
Propenylproline formation in hormaomycin biosynthesis.[16]
Hyperlink Scheme 19.
Proposed 5-chloropyrrole-2-carboxylate formation in hormaomycin biosynthesis.[16]
Hyperlink Scheme 20.
Two possible pathways for nitrocyclopropylalanine formation in hormaomycin biosynthesis.[16]
Hyperlink Scheme 21.
Aryl-aryl and aryl ether crosslinking in vancomycin.[17,18] Nonproteinogenic amino acids: red = (2R,3R)-m-chloro-(or β-hydroxy-m-chloro)Tyr aryl ether, blue = 3,5-dihydroxy-Tyr aryl ether, green = p-hydroxyphenyl-Gly aryl-aryl, purple =3,5-dihydroxyphenyl-Gly arly-aryl or aryl ether, orange = (2S,3R)-β-hydroxy-m-chloro-Tyr aryl ether.
Hyperlink Scheme 22.
Aryl amino acid hydroxylation. A. Aryl ring hydroxylation. B. Cβ-hydroxylation. Nonproteinogenic amino acids: red = m-Tyr, blue = β-hydroxy-m-Tyr.
Hyperlink Scheme 23.
Chlorination of amino acids. A. γ-Halogenases. B. Chlorination of β-hydroxy-Tyr. Nonproteinogenic amino acids: red = (2R)-2-amino-3-(chloromethyl)pentanoic acid, blue = β-hydroxy-m-chloro-Tyr.
Hyperlink Figure 1.
Polytheonamide B.[19] Nonproteinogenic amino acids: red = t-butyl-Gly, blue = β-methyl-Ile, green = β-hydrox-Val, orange = β-hydrox-N-methyl-Asn or β-hydroxy-N-methyl-Gln, purple = (2R)-2-amino-3-methyl-3-(methylsulfinyl)butanoic acid. Also note the numerous N-methyl-Asn residues.
Hyperlink Scheme 24.
Formation of aziridine carboxylate in azicemicin A biosynthesis.
Acknowledgments
We are grateful to the National Institutes of Health for financial support (GM20011 and GM49338 for CTW and GM 087934 for CK).
References
- 1. Wagner I, Musso H. Angew. Chem. Int. Ed. Engl. 1983;22:816–828. (b) for an online database of nonproteinogenic amino acids (norine), visit http://bioinfo.lifl.fr/norine/
- 2.Finking R, Marahiel MA. Annu. Rev. Microbiol. 2004;58:453–488. doi: 10.1146/annurev.micro.58.030603.123615. [DOI] [PubMed] [Google Scholar]
- 3.Walsh CT. Acc. Chem. Res. 2008;41:4–10. doi: 10.1021/ar7000414. [DOI] [PubMed] [Google Scholar]
- 4.Liu CC, Schultz PG. Annu. Rev. Biochem. 2010;79:413–444. doi: 10.1146/annurev.biochem.052308.105824. [DOI] [PubMed] [Google Scholar]
- 5.Johnson JA, Lu YY, Van Deventer JA, Tirrell DA. Curr. Opin. Chem. Biol. 2010;14:774–780. doi: 10.1016/j.cbpa.2010.09.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.(a) Hertweck C. Angew. Chem. Int. Ed. Engl. 2009;48:4688–4716. doi: 10.1002/anie.200806121. [DOI] [PubMed] [Google Scholar]; (b) Sattely ES, Fischbach MA, Walsh CT. Nat. Prod. Rep. 2008;25:757–793. doi: 10.1039/b801747f. [DOI] [PubMed] [Google Scholar]
- 7.Schwecke T, Aparicio JF, Molnár I, König A, Khaw LE, Haydock SF, Oliynyk M, Caffrey P, Cortés J, Lester JB, Böhm GA, Staunton J, Leadlay PF. Proc. Natl. Acad. Sci. USA. 1995;92:7839–7843. doi: 10.1073/pnas.92.17.7839. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Barreiro C, Priteo C, Sola-Landa A, Solera E, Martínez-Castro M, Pérez-Redondo R, Garcia-Estrada C, Aparicio JF, Fernández-Martínez LT, Santos-Aberturas J, Salehi-Najafabadi Z, Rodriguez-Garcia A, Tauch A, Martin JF. J. Bacteriol. 2012;194:3756–3757. doi: 10.1128/JB.00692-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Qu X, Jiang N, Xu F, Shao L, Tang G, Wilkinson B, Liu W. Mol. Biosyst. 2011;7:852–861. doi: 10.1039/c0mb00234h. [DOI] [PubMed] [Google Scholar]
- 10.Molnár I, Schupp T, Ono M, Zirkle R, Milnamow M, Nowak-Thompson B, Engel N, Toupet C, Stratmann A, Cyr DD, Gorlach J, Mayo JM, Hu A, Goff S, Schmid J, Ligon JM. Chem. Biol. 2000;7:97–109. doi: 10.1016/s1074-5521(00)00075-2. [DOI] [PubMed] [Google Scholar]
- 11.Tang L, Shah S, Chung L, Carney J, Katz L, Khosla C, Julien B. Science. 2000;287:640–642. doi: 10.1126/science.287.5453.640. [DOI] [PubMed] [Google Scholar]
- 12.Burbaum JJ, Schimmel P. J. Biol. Chem. 1991;266:16965–16968. [PubMed] [Google Scholar]
- 13.Mocibob M, Ivic N, Bilokapic S, Maier T, Luic M, Ban N, Weygand-Durasevic I. Proc. Natl. Acad. Sci. USA. 2010;107:14585–14590. doi: 10.1073/pnas.1007470107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Ibba M, Soll D. Ann. Rev. Biochem. 2000;69:617–650. doi: 10.1146/annurev.biochem.69.1.617. [DOI] [PubMed] [Google Scholar]
- 15.Stachelhaus T, Hüser A, Marahiel MA. Chem. Biol. 1996;3:913–921. doi: 10.1016/s1074-5521(96)90180-5. [DOI] [PubMed] [Google Scholar]
- 16.Lambalot RH, Gehring AM, Flugel RS, Zuber P, LaCelle M, Marahiel MA, Reid R, Khosla C, Walsh CT. Chem. Biol. 1996;3:923–936. doi: 10.1016/s1074-5521(96)90181-7. [DOI] [PubMed] [Google Scholar]
- 17.Fischbach MA, Walsh CT. Chem. Rev. 2006;106:3468–3496. doi: 10.1021/cr0503097. [DOI] [PubMed] [Google Scholar]
- 18.Mootz HD, Marahiel MA. J. Bacteriol. 1997;179:6843–6850. doi: 10.1128/jb.179.21.6843-6850.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Du L, Shen B. Curr. Opin. Drug Discov. Devel. 2001;4:215–228. [PubMed] [Google Scholar]
- 20.Chen H, O’Connor S, Cane DE, Walsh CT. Chem. Biol. 2001;8:899–912. doi: 10.1016/s1074-5521(01)00064-3. [DOI] [PubMed] [Google Scholar]
- 21.Gatto GJ, Jr, McLoughlin SM, Kelleher NL, Walsh CT. Biochemistry. 2005;44:5993–6002. doi: 10.1021/bi050230w. [DOI] [PubMed] [Google Scholar]
- 22.O’Connor SE, Chen H, Walsh CT. Biochemistry. 2002;41:5685–5694. doi: 10.1021/bi020006w. [DOI] [PubMed] [Google Scholar]
- 23.Du L, Sánchez C, Chen M, Edwards DJ, Shen B. Chem. Biol. 2000;7:623–642. doi: 10.1016/s1074-5521(00)00011-9. [DOI] [PubMed] [Google Scholar]
- 24.Pfeifer BA, Wang CCC, Walsh CT, Khosla C. Appl. Environ. Microbiol. 2003;69:6698–6702. doi: 10.1128/AEM.69.11.6698-6702.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Khosla C, Kapur S, Cane DE. Curr. Opin. Chem. Biol. 2009;13:135–143. doi: 10.1016/j.cbpa.2008.12.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Strieker M, Tanović A, Marahiel MA. Curr. Opin. Struct. Biol. 2010;20:234–240. doi: 10.1016/j.sbi.2010.01.009. [DOI] [PubMed] [Google Scholar]
- 27.Mast Y, Weber T, Gölz M, Ort-Winklbauer R, Gondran A, Wohlleben W, Schinko E. Microb. Biotechnol. 2011;4:192–206. doi: 10.1111/j.1751-7915.2010.00213.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.He J, Magarvey N, Piraee M, Vining LC. Microbiology. 2001;147:2817–2829. doi: 10.1099/00221287-147-10-2817. [DOI] [PubMed] [Google Scholar]
- 29.Blanc V, Gil P, Bamas-Jacques N, Lorenzon S, Zagorec M, Schleuniger J, Bisch D, Blanche F, Debussche L, Crouzet J, Thibaut D. Mol. Microbiol. 1997;23:191–202. doi: 10.1046/j.1365-2958.1997.2031574.x. [DOI] [PubMed] [Google Scholar]
- 30.Mast YJ, Wohlleben W, Schinko E. J. Biotechnol. 2011;155:63–67. doi: 10.1016/j.jbiotec.2010.12.001. [DOI] [PubMed] [Google Scholar]
- 31.Green JM, Nichols BP. J. Biol. Chem. 1991;266:12971–12975. [PubMed] [Google Scholar]
- 32.Mehl RA, Anderson JC, Santoro SW, Wang L, Martin AB, King DS, Horn DM, Schultz PG. J. Am. Chem. Soc. 2003;125:935–939. doi: 10.1021/ja0284153. [DOI] [PubMed] [Google Scholar]
- 33.Young TS, Schultz PG. J. Biol. Chem. 2010;285:11039–11044. doi: 10.1074/jbc.R109.091306. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Vaillancourt FH, Yeh E, Vosburg DA, O’Connor SE, Walsh CT. Nature. 2005;436:1191–1194. doi: 10.1038/nature03797. [DOI] [PubMed] [Google Scholar]
- 35.Shaw JP, Petsko GA, Ringe D. Biochemistry. 1997;36:1329–1342. doi: 10.1021/bi961856c. [DOI] [PubMed] [Google Scholar]
- 36.Mehta PK, Christen P. Adv. Enzymol. Relat. Areas Mol. Biol. 2000;74:129–184. doi: 10.1002/9780470123201.ch4. [DOI] [PubMed] [Google Scholar]
- 37.Krug D, Müller R. Chembiochem. 2009;10:741–750. doi: 10.1002/cbic.200800748. [DOI] [PubMed] [Google Scholar]
- 38.Christenson SD, Wu W, Spies MA, Shen B, Toney MD. Biochemistry. 2003;42:12708–12718. doi: 10.1021/bi035223r. [DOI] [PubMed] [Google Scholar]
- 39.Süssmuth RD, Wohlleben W. Appl. Microbiol. Biotechnol. 2004;63:344–350. doi: 10.1007/s00253-003-1443-z. [DOI] [PubMed] [Google Scholar]
- 40.Puk O, Bischoff D, Kittel C, Pelzer S, Weist S, Stegmann E, Süssmuth RD, Wohlleben W. J. Bacteriol. 2004;186:6093–6100. doi: 10.1128/JB.186.18.6093-6100.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Vaillancourt FH, Yeh E, Vosburg DA, Garneau-Tsodikova S, Walsh CT. Chem. Rev. 2006;106:3364–3378. doi: 10.1021/cr050313i. [DOI] [PubMed] [Google Scholar]
- 42.Hubbard BK, Thomas MG, Walsh CT. Chem. Biol. 2000;7:931–942. doi: 10.1016/s1074-5521(00)00043-0. [DOI] [PubMed] [Google Scholar]
- 43.Ningsih F, Kitani S, Fukushima E, Nihira T. Microbiology. 2011;157:3213–3220. doi: 10.1099/mic.0.050203-0. [DOI] [PubMed] [Google Scholar]
- 44.For the formation of 2,3-diaminopropionate in zwittermycin biosynthesis, see: Kevany BM, Rasko DA, Thomas MG. Appl. Environ. Microbiol. 2009;75:1144–1155. doi: 10.1128/AEM.02518-08.
- 45.Müller C, Nolden S, Gebhardt P, Heinzelmann E, Lange C, Puk O, Welzel K, Wohlleben W, Schwartz D. Antimicrob. Agents Chemother. 2007;51:1028–1037. doi: 10.1128/AAC.00942-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Vandenende CS, Vlasschaert M, Seah SYK. J. Bacteriol. 2004;186:5596–5602. doi: 10.1128/JB.186.17.5596-5602.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.(a) Kinscherf TG, Coleman RH, Barta TM, Willis DK. J. Bacteriol. 1991;173:4124. doi: 10.1128/jb.173.13.4124-4132.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]; (b) Durbin RD, Uchytil TF, Steele JA, Ribeiro LD. Phytochemistry. 1978;17:147–148. [Google Scholar]
- 48.Yin X, Zabriskie TM. Microbiology. 2006;152:2969–2983. doi: 10.1099/mic.0.29043-0. [DOI] [PubMed] [Google Scholar]
- 49.Barkei JJ, Kevany BM, Felnagle EA, Thomas MG. ChemBioChem. 2010;10:366–376. doi: 10.1002/cbic.200800646. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Steenbergen JN, Alder J, Thorne GM, Tally FP. J. Antimicrob. Chemother. 2005;55:283–288. doi: 10.1093/jac/dkh546. [DOI] [PubMed] [Google Scholar]
- 51.Höfer I, Crüsemann M, Radzom M, Geers B, Flachschaar D, Cai X, Zeeck A, Piel J. Chem. Biol. 2011;18:381–391. doi: 10.1016/j.chembiol.2010.12.018. [DOI] [PubMed] [Google Scholar]
- 52.Ahn J-H, Walton JD. Mol. Plant-Microbe Interact. 1997;10:207–214. doi: 10.1094/MPMI.1997.10.2.207. [DOI] [PubMed] [Google Scholar]
- 53.For biosynthesis of statine containing Thaliandepsin E and Thailandepsin B (also known as Burholdac B), see: Wang C, Flemming CJ, Cheng Y-Q. Med Chem Comm. 2012;3:976–981. doi: 10.1039/C2MD20024D. Wang C, Henkes LM, Doughty LB, He M, Wang D, Meyer-Almes F-J, Cheng Y-Q. J. Nat. Prod. 2011;74:2031–2038. doi: 10.1021/np200324x. Biggins JB, Gleber CD, Brady SF. Org. Lett. 2011;13:1536–1539. doi: 10.1021/ol200225v.
- 54.Miraziridine is the only natural product known to contain a vinylogous arginine moiety. Jamaicamides contain a methyl pyrrolinone moeity derived from vinylogous alanine, see: Edwards DJ, Marquez BL, Nogle LM, McPhall K, Goeger DE, Roberts MA, Gerwick WH. Chem. Biol. 2004;11:817–833. doi: 10.1016/j.chembiol.2004.03.030. Cyclotheonamides contain vinylogous tyrosine, see: Fusetani N, Matsunaga S, Matsumoto H, Takebayashi Y. J. Am. Chem. Soc. 1990;112:7053–7054. Hagihara M, Anthony NJ, Stout TJ, Clardy J, Schreiber SL. J. Am. Chem. Soc. 1992;114:6568–6570. The vinylogous alanine moiety of Jamaicamides are biosynthesized by a hybrid NRPS-PKS assemblyline; vinylogous arginine and vinylogous tyrosine are likely produced by an analogous pathway.
- 55.Nakao Y, Fujita M, Warabi K, Matsunaga S, Fusetani N. J. Am. Chem. Soc. 2000;122:10462–11043. [Google Scholar]
- 56.For studies on the biosynthesis of Aba, see: Sebek OK, Barker HA. J. Bacteriol. 1968;96:2094–2098. doi: 10.1128/jb.96.6.2094-2098.1968. Tokarski Z, Klei HE, Berg CM. Biotechnol. Lett. 1988;10:7–10. Fotheringham IG, Grinter N, Pantaleone DP, Senkpeil RF, Taylor PP. Bioorg. Med. Chem. 1999;7:2209–2213. doi: 10.1016/s0968-0896(99)00153-4.
- 57.(a) Azd was first isolated from Streptomyces sp. MD398-A1, see: Naganawa H, Usui N, Takita T, Hamada M, Umezawa H. J. Antibiot. 1975;28:828–829. doi: 10.7164/antibiotics.28.828. (b) Miraziridine is known to inhibit serine proteases, and this activity has been attributed to the aziridine moiety, see: Shaschke N. Biorg. Med. Chem. Lett. 2004;14:855–857. doi: 10.1016/j.bmcl.2003.12.030.
- 58.For other aziridine containing natural products including azicemicin (an aziridine containing polyketide isolated from Kibdelosporangium sp. MJ126-NF4, which displays antimicrobial activity against Gram positive bacteria) as well as acinomycin, mitomycin, and ficellomycin, see: Tsuchida T, Iinuma H, Kinoshita N, Ikeda T, sawa R, Takahashi Y, Nagawana H, Sawa T, Hamada M, Takeuchi T. J. Antibiot. 1993;46:1772–1773. doi: 10.7164/antibiotics.46.1772. and Ogasawara Y, Liu H-w. J. Am. Chem. Soc. 2009;131:18066–18068. doi: 10.1021/ja907307h. and references therein.
- 59.Synthetic deriviatives of Azd are utilized as protease inhibitors, see: Ponte-Sucre A, Vicik R, Schultheis M, Schirmeister T, Moll H. Antimicrob. Agents Chemother. 2006;50:2439–2447. doi: 10.1128/AAC.01430-05. Schirmeister T. J. Med. Chem. 1999;42:560–572. doi: 10.1021/jm981061z. Schirmeister T, Peric M. Biorg. Med. Chem. 2000;8:1281–1291. doi: 10.1016/s0968-0896(00)00058-4. Vicik R, Hoerr V, Glaser M, Schultheis M, Hnasell E, McKerrow JH, Holzgrabe U, Caffrey CR, Ponte-Sucre A, Moll H, Stich A, Schirmeister T. Biorg. Med. Chem. Lett. 2006;16:275–2757. doi: 10.1016/j.bmcl.2006.02.026.
- 60.Schlippe YVG, Hartman MCT, Josephson K, Szostak JW. J. Am. Chem. Soc. 2012;134:10469–10477. doi: 10.1021/ja301017y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Shimamura H, Gouda H, Nagai K, Hirose T, Ichioka M, Furuya Y, Kobayashi Y, Hirono S, Sunazuka T, Omura S. Angew. Chem. Int. Ed. 2009;48:914–917. doi: 10.1002/anie.200804138. [DOI] [PubMed] [Google Scholar]
- 62.(a) Kopp F, Marahiel MA. Nat. Prod. Rep. 2007;24:735–749. doi: 10.1039/b613652b. [DOI] [PubMed] [Google Scholar]; (b) Trauger JW, Kohli RM, Mootz HD, Marahiel MA, Walsh CT. Nature. 2000;407:215–218. doi: 10.1038/35025116. [DOI] [PubMed] [Google Scholar]; (c) Kohli RM, Walsh CT, Burkart MD. Nature. 2002;418:658–661. doi: 10.1038/nature00907. [DOI] [PubMed] [Google Scholar]
- 63.(a) Storm DR, Rosenthal KS, Swanson PE. Ann. Rev. Biochem. 1977;46:723–763. doi: 10.1146/annurev.bi.46.070177.003451. [DOI] [PubMed] [Google Scholar]; (b) Shaheen M, Li J, Ross AC, Vederas JC, Jensen SE. Chem. Biol. 2011;18:1640–1648. doi: 10.1016/j.chembiol.2011.09.017. [DOI] [PubMed] [Google Scholar]
- 64.Krishnamoorthy R, Vazquez-Serrano LD, Turk JA, Kowalski JA, Benson AG, Breaux NT, Lipton MA. J. Am. Chem. Soc. 2006;128:15392–15393. doi: 10.1021/ja0666250. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.For a review on marine cyclic depsipeptides, see: Andavan GSB, Lemmens-Gruber R. Marine Drugs. 2010;8:810–834. doi: 10.3390/md8030810. there are other natural products that contain β-methoxy-Tyr, e.g. Mirabamide C, see: Plaza A, Gustchina E, Baker HL, Kelly M, Bewley CA. J. Nat. Prod. 2007;70:1753–1760. doi: 10.1021/np070306k. and also Neamphamide A, see Oku N, Gustafson KR, Cartner LK, Wilson JA, Shigematsu N, Hess S, Pannell LK, Boyd MR, McMahon JB. J. Nat. Prod. 2004;67:1047–1411. doi: 10.1021/np040003f.
- 66.Miao V, Cöeffet-LeGal M-F, Brian P, Brost R, Penn J, Whiting A, Martin S, Ford R, Parr I, Bouchard M, Silva CJ, Wrigley SK, Baltz RH. Microbiology. 2005;151:1507–1523. doi: 10.1099/mic.0.27757-0. [DOI] [PubMed] [Google Scholar]
- 67.(a) O’Brien IG, Cox GB, Gibson F. Biochim. Biophys. Acta. 1970;201:453–460. doi: 10.1016/0304-4165(70)90165-0. [DOI] [PubMed] [Google Scholar]; (b) Raymond KN, Dertz EA, Kim SS. Proc. Nat. Acad. Sci. USA. 2003;100:3584–3588. doi: 10.1073/pnas.0630018100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Fujimori DG, Hrvatin S, Neumann CS, Strieker M, Marahiel MA, Walsh CT. Proc. Natl. Acad. Sci. USA. 2007;104:16498–16503. doi: 10.1073/pnas.0708242104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.(a) Yeh MS, Wei YH, Chang JS. Biotechnol. Prog. 2005;4:1329–1334. doi: 10.1021/bp050040c. [DOI] [PubMed] [Google Scholar]; (b) Peypoux F, Bonmatin JM, Wallach J. Appl. Microbiol. Biotechnol. 1999;51:553–563. doi: 10.1007/s002530051432. [DOI] [PubMed] [Google Scholar]
- 70.Wäsp U, Blanc D, Winkler T, Rüedi P, Dudler R. Mol. Plant-Microbe Interact. 1998;11:727–733. [Google Scholar]
- 71.Amrein H, Makart S, Granado J, Shakya R, Schneider-Pokorny J, Dudler R. Mol. Plant Microbe Interact. 2004;17:90–97. doi: 10.1094/MPMI.2004.17.1.90. [DOI] [PubMed] [Google Scholar]
- 72.(a) Groll M, Schellenberg B, Bachmann AS, Archer CR, Huber R, Powell TK, Lindow S, Kaiser M, Dudler R. Nature. 2008;452:755–758. doi: 10.1038/nature06782. [DOI] [PubMed] [Google Scholar]; (b) Schellenberg B, Bigler L, Dudler R. Environ. Microbiol. 2007;9:1640–1650. doi: 10.1111/j.1462-2920.2007.01278.x. [DOI] [PubMed] [Google Scholar]
- 73.Reddy MVR, Harper MK, Faulkner DJ. Tetrahedron. 1998;54:10649–10656. [Google Scholar]
- 74.Smith PA, Roberts TC, Romesberg FE. Chem. Biol. 2010;17:1223–1231. doi: 10.1016/j.chembiol.2010.09.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Roberts TC, Smith PA, Romesberg FE. J. Nat. Prod. 2011;74:956–961. doi: 10.1021/np200163g. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.For a related oxidative phenolic coupling in teicoplanin and vancomycin biosynthesis, see: Widboom PF, Bruner SD. Chem Bio Chem. 2009;10:1757–1764. doi: 10.1002/cbic.200900117.
- 77.Ito M, Yamanaka M, Kutsumura N, Nishiyama S. Tetrahedron Lett. 2003;44:7949–7952. [Google Scholar]
- 78.Kijima M, Yoshida M, Sugita K, Horinouchi S, Beppu T. J. Biol. Chem. 1993;268:22429–22435. [PubMed] [Google Scholar]
- 79.Walton J. Phytochemistry. 2006;67:1406–1413. doi: 10.1016/j.phytochem.2006.05.033. [DOI] [PubMed] [Google Scholar]
- 80.Kijima M, Yoshida M, Sugita K, Horinouchi S, Beppu T. J. Biol. Chem. 1993;268:22429–22435. [PubMed] [Google Scholar]
- 81.Yamada T, Mizugichi Y, Nierhaus KH, Wittmann HG. Nature. 1978;275:460–461. doi: 10.1038/275460a0. [DOI] [PubMed] [Google Scholar]
- 82.Strieker M, Nolan EM, Walsh CT. J. Am. Chem. Soc. 2009;131:13523–13530. doi: 10.1021/ja9054417. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Keller-Juslén C, Kuhn M, Loosli HR, Petcher TJ, Weber HP, von Wartburg A. Tetrahedron Lett. 1976;17:4147–4150. [Google Scholar]
- 84.For the biosynthesis of 3-hydroxy-4-methylproline, see: Adefarati AA, Giacobbe RA, Hensens OD, Tkacz JS. J. Am. Chem. Soc. 1991;113:3542–3545. Adefarati AA, Hensens OD, Jones ETT, Tkacz JS. J. Antibiotics. 1992;45:1953–1957. doi: 10.7164/antibiotics.45.1953. Schonewolf M, Rohr J. Angew. Chem. Int. Ed. Engl. 1991;30:183–185.
- 85.Rössner E, Zeeck A, König WA. Angew. Chem. Int. Ed. Engl. 1990;29:64–65. [Google Scholar]
- 86.(a) Duitman EH, Hamoen LW, Rembold M, Venema G, Seitz H, Saenger W, Bernhard F, Reinhardt R, Schmidt M, Ullrich C, Stein T, Leenders F, Vater J. Proc. Natl. Acad. Sci. USA. 1999;96:13294–13299. doi: 10.1073/pnas.96.23.13294. [DOI] [PMC free article] [PubMed] [Google Scholar]; (b) Hansen DB, Bumpus SB, Aron ZD, Kelleher NL, Walsh CT. J. Am. Chem. Soc. 2007;129:6366–6367. doi: 10.1021/ja070890j. [DOI] [PubMed] [Google Scholar]
- 87.Dixon RA, Chopra I. J. Antimicrob. Chemother. 1986;18:557–563. doi: 10.1093/jac/18.5.557. [DOI] [PubMed] [Google Scholar]
- 88.Kehr JC, Gatte Picchi D, Dittmann E. Beilstein J. Org. Chem. 2011;7:1622–1635. doi: 10.3762/bjoc.7.191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Fujiki H, Suganuma M. Anti. Canc. Agents Med. Chem. 2011;11:4–18. doi: 10.2174/187152011794941163. [DOI] [PubMed] [Google Scholar]
- 90.Rinehart KL, Jr, Gloer JB, Cook JC., Jr J. Am. Chem. Soc. 1981;103:1857–1859. [Google Scholar]
- 91.(a) Montgomery DW, Zukoski CF. Transplantation. 1985;40:49–56. [PubMed] [Google Scholar]; (b) SirDeshpande BV, Toogood PL. Biochemistry. 1995;34:9177–9184. doi: 10.1021/bi00028a030. [DOI] [PubMed] [Google Scholar]
- 92.Bischoff D, Bister B, Bertazzo M, Pfeifer V, Stegmann E, Nicholson GJ, Keller S, Pelzer S, Wohlleben W, Süssmuth RD. Chem bio chem. 2005;6:267–272. doi: 10.1002/cbic.200400328. [DOI] [PubMed] [Google Scholar]
- 93.Li DB, Woithe K, Geib N, Abou-Hadeed K, Zerbe K, Robinson JA. Methods in Enzymology. 2009;458:487–509. doi: 10.1016/S0076-6879(09)04819-8. [DOI] [PubMed] [Google Scholar]
- 94.Hadatsch B, Butz D, Schmiederer T, Steudle J, Wohlleben W, Süssmuth R, Stegmann E. Chem. Biol. 2007;14:1078–1089. doi: 10.1016/j.chembiol.2007.08.014. [DOI] [PubMed] [Google Scholar]
- 95.Stegmann E, Pelzer S, Bischoff D, Puk O, Stockert S, Butz D, Zerbe K, Robinson J, Süssmuth RD, Wohelleben W. J. Biotechnol. 2006;124:640–653. doi: 10.1016/j.jbiotec.2006.04.009. [DOI] [PubMed] [Google Scholar]
- 96.Eisenstein BI, Oleson FB, Jr, Baltz RH. Clin. Infect. Dis. 2010;50:S10–S15. doi: 10.1086/647938. [DOI] [PubMed] [Google Scholar]
- 97.Robbel L, Marahiel MA. J. Biol. Chem. 2010;285:27501–27508. doi: 10.1074/jbc.R110.128181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Schneider T, Gries K, Josten M, Wiedemann I, Pelzer S, Labischinski H, Sahl HG. Antimicrob. Agents Chemother. 2009;53:1610–1618. doi: 10.1128/AAC.01040-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.(a) Weber G, Leitner E. Curr. Genet. 1994;26:461–467. doi: 10.1007/BF00309935. [DOI] [PubMed] [Google Scholar]; (b) Laupacis A, Keown PA, Ulan RA, McKenzie N, Stiller CR. Can. Med. Assoc. J. 1982;126:1041–1046. [PMC free article] [PubMed] [Google Scholar]
- 100.Offenzeller M, Santer G, Totschnig K, Su Z, Moser H, Traber R, Schneider-Scherzer E. Biochemistry. 1996;35:8401–8412. doi: 10.1021/bi960224n. [DOI] [PubMed] [Google Scholar]
- 101.Farver DF, Hedge DD, Lee SC. Ann. Pharmacother. 2005;39:863–868. doi: 10.1345/aph.1E397. [DOI] [PubMed] [Google Scholar]
- 102.(a) Cudic P, Kranz JK, Behenna DC, Kruger RG, Tadesse H, Wand AJ, Veklich YI, Weisel JW, McCafferty DG. Proc. Natl. Acad. Sci. USA. 2002;99:7384–7379. doi: 10.1073/pnas.102192099. [DOI] [PMC free article] [PubMed] [Google Scholar]; (b) Fang X, Tiyanont K, Zhang Y, Vanner J, Boger D, Walker S. Mol. BioSyst. 2006;2:69–76. doi: 10.1039/b515328j. [DOI] [PubMed] [Google Scholar]
- 103.Khosla C, Tang Y, Chen AY, Schnarr NA, Cane DE. Annu. Rev. Biochem. 2007;76:195–221. doi: 10.1146/annurev.biochem.76.053105.093515. [DOI] [PubMed] [Google Scholar]
- 104.Zhang L-H, Liu JO. J. Immunol. 2001;166:5611–5618. doi: 10.4049/jimmunol.166.9.5611. [DOI] [PubMed] [Google Scholar]
- 105.Bender CL, Alarcón-Chaidez F, Gross DC. Microbiol. Mol. Biol. Rev. 1999;63:266–292. doi: 10.1128/mmbr.63.2.266-292.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Strieter ER, Koglin A, Aron ZD, Walsh CT. J. Am. Chem. Soc. 2009;131:2113–2115. doi: 10.1021/ja8077945. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Fredenhagen A, Tamura SY, Kenny PTM, Komura H, Naya Y, Nakanishi K, Nishiyama K, Sugiura M, Kita H. J. Am. Chem. Soc. 1987;109:4409–4411. [Google Scholar]
- 108.Chen Y, Yin M, Horsman GP, Shen B. J. Nat. Prod. 2011;74:420–424. doi: 10.1021/np100825y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Tanner ME. Acc. Chem. Res. 2002;35:237–246. doi: 10.1021/ar000056y. [DOI] [PubMed] [Google Scholar]
- 110.Kino K, Sato M, Yoneyama M, Kirimura K. Appl. Microbiol. Biotechnol. 2007;73:1299–1305. doi: 10.1007/s00253-006-0600-6. [DOI] [PubMed] [Google Scholar]
- 111.Lim YH, Yokoigawa K, Esaki N, Soda K. J. Bacteriol. 1993;175:4213–4217. doi: 10.1128/jb.175.13.4213-4217.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Stachelhaus T, Walsh CT. Biochemistry. 2000;39:5775–5787. doi: 10.1021/bi9929002. [DOI] [PubMed] [Google Scholar]
- 113.Clugston SL, Sieber SA, Marahiel MA, Walsh CT. Biochemistry. 2003;42:12095–12104. doi: 10.1021/bi035090+. [DOI] [PubMed] [Google Scholar]
- 114.Balibar CJ, Vaillancourt FH, Walsh CT. Chem. Biol. 2005;12:1189–1200. doi: 10.1016/j.chembiol.2005.08.010. [DOI] [PubMed] [Google Scholar]
- 115.Stein DB, Linne U, Marahiel MA. FEBS J. 2005;272:4506–4520. doi: 10.1111/j.1742-4658.2005.04871.x. [DOI] [PubMed] [Google Scholar]
- 116.Wu B, Szymanski W, Wybenga GG, Heberling MM, Bartsch S, de Wildeman S, Poelarends GJ, Feringa BL, Dijkstra BW, Janssen DB. Angew. Chem. Int. Ed. Engl. 2012;51:482–486. doi: 10.1002/anie.201106372. [DOI] [PubMed] [Google Scholar]
- 117.Wu B, Szymanski W, Heberling MM, Feringa BL, Janssen DB. Trends Biotechnol. 2011;29:352–362. doi: 10.1016/j.tibtech.2011.02.005. [DOI] [PubMed] [Google Scholar]
- 118.Walsh CT. Posttranslational modification of proteins. Colorado: Roberts and Company Publishers; 2006. [Google Scholar]
- 119.Mahlert C, Kopp F, Thirlway J, Micklefield J, Marahiel MA. J. Am. Chem. Soc. 2007;129:12011–12018. doi: 10.1021/ja074427i. [DOI] [PubMed] [Google Scholar]
- 120.Hamada T, Matsunaga S, Yano G, Fusetani N. J. Am. Chem. Soc. 2005;127:110–118. doi: 10.1021/ja045749e. [DOI] [PubMed] [Google Scholar]
- 121.(a) Crone WJK, Leeper FJ, Truman AW. Chem. Sci. 2012;3:3516–3521. [Google Scholar]; (b) Gomez-Escribano JP, Song L, Bibb MJ, Challis GL. Chem. Sci. 2012;3:3522–3525. [Google Scholar]
- 122.Kawakami T, Murakami H, Suga H. Chem. Biol. 2008;15:32–42. doi: 10.1016/j.chembiol.2007.12.008. [DOI] [PubMed] [Google Scholar]
- 123.Subtelny AO, Harman MCT, Szostak JW. J. Am. Chem. Soc. 2008;130:6131–6136. doi: 10.1021/ja710016v. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 124.(a) Entsch B, Ballou DP, Massey V. J. Biol. Chem. 1976;251:2550–2563. [PubMed] [Google Scholar]; (b) Entsch B, Ballou DP, Husain M, Massey V. J. Biol. Chem. 1976;251:7367–7369. [PubMed] [Google Scholar]
- 125.Wencewicz T, Walsh CT. Nat. Prod. Rep. 2012 doi: 10.1039/c2np20069d. In press. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 126.Ortiz de Montellano PR, editor. Cytochrome P450: Structure, Mechanism, and Biochemistry. 3rd Ed. New York: Kluwer Academic/Plenum Publishers; 2005. [Google Scholar]
- 127.Abu-Omar MM, Loaiza A, Hontzeas N. Chem. Rev. 2005;105:2227–2252. doi: 10.1021/cr040653o. [DOI] [PubMed] [Google Scholar]
- 128.Zhang W, Ames BD, Walsh CT. Biochemistry. 2011;50:5401–5403. doi: 10.1021/bi200733c. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 129.Cryle MJ, Meinhart A, Schlichting I. J. Biol. Chem. 2010;285:24562–24574. doi: 10.1074/jbc.M110.131904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 130.Tang Y, Walsh C. Unpublished observations [Google Scholar]
- 131.Van Lanen SG, Lin S, Horsman GP, Shen B. FEMS Microbiol. Lett. 2009;300:237–241. doi: 10.1111/j.1574-6968.2009.01802.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 132.Neumann CS, Fujimori DG, Walsh CT. Chem. Biol. 2008;15:99–109. doi: 10.1016/j.chembiol.2008.01.006. [DOI] [PubMed] [Google Scholar]
- 133.Yeh E, Garneau S, Walsh CT. Proc. Natl. Acad. Sci. USA. 2005;102:3960–3965. doi: 10.1073/pnas.0500755102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 134.Puk O, Huber P, Bischoff D, Recktenwald J, Jung G, Sussmuth RD, van Pée KH, Wohlleben W, Pelzer S. Chem. Biol. 2002;9:225–235. doi: 10.1016/s1074-5521(02)00101-1. [DOI] [PubMed] [Google Scholar]
- 135.Podzelinska K, Latimer R, Bhattacharya A, Vining LC, Zechel DL, Jia Z. J. Mol. Biol. 2010;397:316–331. doi: 10.1016/j.jmb.2010.01.020. [DOI] [PubMed] [Google Scholar]
- 136.Heemstra JR, Jr, Walsh CT. J. Am. Chem. Soc. 2008;130:14024–14025. doi: 10.1021/ja806467a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 137.Neumann CS, Walsh CT. Proc. Natl. Acad. Sci. USA. 2010;107:5798–5803. doi: 10.1073/pnas.1001681107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 138.Cheung J, Beasley FC, Liu S, Lajoie GA, Heinrichs DE. Mol. Microbiol. 2009;74:594–608. doi: 10.1111/j.1365-2958.2009.06880.x. [DOI] [PubMed] [Google Scholar]
- 139.Kelly WL, Boyne MT, II, Yeh E, Vosburg DA, Galonić DP, Kelleher NL, Walsh CT. Biochemistry. 2007;46:359–368. doi: 10.1021/bi061930j. [DOI] [PubMed] [Google Scholar]
- 140.Gatto GJ, Jr, Boyne MT, II, Kelleher NL, Walsh CT. J. Am. Chem. Soc. 2006;128:3838–3847. doi: 10.1021/ja0587603. [DOI] [PubMed] [Google Scholar]
- 141.Neumann CS, Jiang W, Heemstra JR, Jr, Gontang EA, Kolter R, Walsh CT. Chembiochem. 2012;13:972–976. doi: 10.1002/cbic.201200054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 142.Thomas MG, Chan YA, Ozanick SG. Antimicrob. Agents Chemother. 2003;47:2823–2830. doi: 10.1128/AAC.47.9.2823-2830.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 143.Helmetag V, Samel SA, Thomas MG, Marahiel MA, Essen LO. FEBS J. 2009;276:3669–3682. doi: 10.1111/j.1742-4658.2009.07085.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 144.(a) Iwatsuki M, Uchida R, Yoshijima H, Ui H, Shiomi K, Matsumoto A, Takhashi Y, Abe A, Tomoda H, Omura S. J. Antibiot. 2008;61:222–229. doi: 10.1038/ja.2008.32. [DOI] [PubMed] [Google Scholar]; (b) Holmes TC, May AE, Zaleta-Rivera K, Ruby JG, Skewes-Cox P, Fischbach MA, DeRisi JL, Iwatsuki M, Omura S, Khosla C. J. Am. Chem. Soc. 2012;134:17797–17806. doi: 10.1021/ja308622d. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 145.Magarvey NA, Haltli B, He M, Greenstein M, Hucul JA. Antimicron. Agents Chemother. 2006;50:2167–2177. doi: 10.1128/AAC.01545-05. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 146.Okada M, Sato I, Cho SJ, Iwata H, Nishio T, Dubnau D, Sakagami Y. Nat. Chem. Biol. 2005;1:23–24. doi: 10.1038/nchembio709. [DOI] [PubMed] [Google Scholar]
- 147.Broberg A, Menkis A, Rimvydas V. J. Nat. Prod. 2006;69:97–102. doi: 10.1021/np050378g. [DOI] [PubMed] [Google Scholar]
- 148.Ames BD, Liu X, Walsh CT. Biochemistry. 2010;49:8564–8576. doi: 10.1021/bi1012029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 149.Walsh CT, Haynes SW, Ames BD. Nat. Prod. Rep. 2012;29:37–59. doi: 10.1039/c1np00072a. [DOI] [PubMed] [Google Scholar]
- 150.Colabroy KL, Hackett WT, Markham AJ, Rosenberg J, Cohen DE, Jacobson A. Arch. Biochem. Biophys. 2008;479:131–138. doi: 10.1016/j.abb.2008.08.022. [DOI] [PubMed] [Google Scholar]
- 151.Chen H, Tseng CC, Hubbard BK, Walsh CT. Proc. Natl. Acad. Sci. USA. 2001;98:14901–14906. doi: 10.1073/pnas.221582098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 152.Tseng CC, Vaillancourt FH, Druner SD, Walsh CT. Chem. Biol. 2004;11:1195–1203. doi: 10.1016/j.chembiol.2004.06.012. [DOI] [PubMed] [Google Scholar]
- 153.Pettit GR, Kamano Y, Dufresne C, Cerny RL, Herald CL, Schmidt JM. J. Org. Chem. 1989;54:6005–6006. [Google Scholar]
- 154.Rich DH. J. Med. Chem. 1985;28:263–273. doi: 10.1021/jm00381a001. [DOI] [PubMed] [Google Scholar]
- 155.Xu Y, Kersten RD, Nam S-J, Lu L, Al-Suwailem AM, Zheng H, Fenical W, Dorrestein PC, Moore BS, Qian P-Y. J. Am. Chem. Soc. 2012;134:8625–8632. doi: 10.1021/ja301735a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 156.Jin M, Fischbach MA, Clardy J. J. Am. Chem. Soc. 2006;128:10660–10661. doi: 10.1021/ja063194c. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 157.(a) Fusetani N, Matsunaga S, Matsumoto H, Takebayashi Y. J. Am. Chem. Soc. 1990;112:7053–7054. [Google Scholar]; (b) Hagihara M, Anthony NJ, Stout TJ, Clardy J, Schreiber SL. J. Am. Chem. Soc. 1992;114:6568–6570. [Google Scholar]
- 158.(a) Kaur G, Hollingshead M, Holbeck S, Schaeuer-Vukašinović V, Camalier RF, Dömling A, Agarwal S. Biochem. J. 2006;396:235–242. doi: 10.1042/BJ20051735. [DOI] [PMC free article] [PubMed] [Google Scholar]; (b) Steinmetz H, Glaser N, Herdtweck E, Sasse F, Reichenbach H, Höfle G. Angew. Chem. Int. Ed. 2004;43:4888–4892. doi: 10.1002/anie.200460147. [DOI] [PubMed] [Google Scholar]
- 159.Choi H, Pereira AR, Cao Z, Shuman CF, Engene N, Byrum T, Matainaho T, Murray TF, Mangoni A, Gerwick WH. J. Nat. Prod. 2010;73:1411–1421. doi: 10.1021/np100468n. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 160.Liu L, Bearden DW, Rein KS. J. Nat. Prod. 2011;74:1535–1538. doi: 10.1021/np200362q. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 161.Christiansen G, Fastner J, Erhard M, Börner T, Dittman E. J. Bacteriol. 2003;185:564–572. doi: 10.1128/JB.185.2.564-572.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 162.Scott-Craig JS, Panaccione DG, Pocard JA, Walton JD. J. Biol. Chem. 1992;267:26044–26049. [PubMed] [Google Scholar]
- 163.Offenzeller M, Su Z, Santer G, Moser H, Traber R, Memmert K, Schneider-Scherzer E. J. Biol. Chem. 1993;268:26127–26134. [PubMed] [Google Scholar]
- 164.Darkin-Rattray SJ, Gurnett AM, Myer RW, Dulski PM, Crumley TM, Allocco JJ, Cannova C, Meinke PT, Colletti SL, Bednarek MA, Singh SB, Goetz MA, Dombrowski AW, Polishook JD, Schmatz DM. Proc. Natl. Acad. Sci. USA. 1996;93:13143–13147. doi: 10.1073/pnas.93.23.13143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 165.Jiang Z, Barret MO, Boyd KG, Adams DR, Boyd ASF, Burgess JG. Phytochemistry. 2002;60:33–38. doi: 10.1016/s0031-9422(02)00061-4. [DOI] [PubMed] [Google Scholar]
- 166.Okumura HS, Philmus B, Portmann C, Hemscheidt TK. J. Nat. Prod. 2009;72:172–176. doi: 10.1021/np800557m. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 167.McMurry J, Begley TP. Organic Chemistry of Biological Pathways. Colorado: Roberts & Co; 2005. [Google Scholar]
- 168.Thibodeaux CJ, Chang WC, Liu HW. Chem. Rev. 2011;112:1681–1709. doi: 10.1021/cr200073d. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 169.Willey JM, van der Donk WA. Annu. Rev. Microbiol. 2007;61:477–501. doi: 10.1146/annurev.micro.61.080706.093501. [DOI] [PubMed] [Google Scholar]
- 170.Berkheij M. Doctoral dissertation. University of Amsterdam; 2008. Synthesis of 1,3-Diene-containing α-amino acids and peptides via N-acyliminium ions. [Google Scholar]
- 171.Dardenne G, Casimir J, Marlier M, Larsen PO. Phytochemistry. 1974;13:1897–1900. [Google Scholar]
- 172.Mackay MF, Donkelaar AV, Culvenor CCJ. J. Chem. Soc. Chem. Commun. 1986:1219–1221. [Google Scholar]
- 173.Sahm U, Knobloch G, Wagner F. J. Antibiot. 1973;26:389–390. doi: 10.7164/antibiotics.26.389. [DOI] [PubMed] [Google Scholar]
- 174.Yasuta T, Okazaki S, Mitsui H, Yuhashi K, Ezura H, Minamisawa K. Appl. Environ. Microbiol. 2001;67:4999–5009. doi: 10.1128/AEM.67.11.4999-5009.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 175.Hatanaka S-I, Niimura Y, Takishima K. Trans. Mycol. Soc. Japan. 1985;26:61–68. [Google Scholar]
- 176.Yamaura Y, Fukuhara M, Takabatake E, Ito N, Hashimoto T. Toxicology. 1986;38:161–173. doi: 10.1016/0300-483x(86)90117-4. [DOI] [PubMed] [Google Scholar]
- 177.Rando RR. Nature. 1974;250:587–587. doi: 10.1038/250586a0. [DOI] [PubMed] [Google Scholar]
- 178.Gray DO, Fowden L. Biochem. J. 1962;82:385–389. doi: 10.1042/bj0820385. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 179.Tanaka K, Miller EM, Isselbacher KJ. Proc. Natl. Acad. Sci. USA. 1970;68:20–24. doi: 10.1073/pnas.68.1.20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 180.(a) Andersen L, Nielsen B, Jaroszewski JW. Chirality. 2000;12:665–669. doi: 10.1002/1520-636X(2000)12:9<665::AID-CHIR4>3.0.CO;2-U. [DOI] [PubMed] [Google Scholar]; (b) Dennis RL, Plant WJ, Skinner CG, Sutherland GL, Shive W. J. Am. Chem. Soc. 1955;77:2362–2364. [Google Scholar]
- 181.Scannell JP, Pruess DL, Demny TC, Weiss F, Williams T. J. Antibiot. 1971;24:239–244. doi: 10.7164/antibiotics.24.239. [DOI] [PubMed] [Google Scholar]
- 182.Sun M-Y, Fowden L, Millington DS, Sheppard RC. Phytochemistry. 1969;8:1227–1233. [Google Scholar]
- 183.Chilton WS, Tsou G, Kirk L. Tetrahedron Lett. 1968;9:6283–6284. [Google Scholar]
- 184.(a) Nass G, Poralla K, Zahner H. Naturwissenschaften. 1971;58:603–608. doi: 10.1007/BF01185604. [DOI] [PubMed] [Google Scholar]; (b) Crawford JM, Mahlstedt SA, Malcolmson SJ, Clardy J, Walsh CT. Chem. Biol. 2011;18:1102–1112. doi: 10.1016/j.chembiol.2011.07.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 185.Rando RR, Relyea N, Cheng L. J. Biol. Chem. 1976;251:3306–3312. [PubMed] [Google Scholar]
- 186.Rando RR, Bangerter FW. J. Am. Chem. Soc. 1976;98:6762–6764. doi: 10.1021/ja00437a090. [DOI] [PubMed] [Google Scholar]
- 187.Hollenhorst MA, Bumpus SB, Matthews ML, Bollinger JM, Jr, Kelleher NL, Walsh CT. J. Am. Chem. Soc. 2010;132:15773–15781. doi: 10.1021/ja1072367. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 188.Kang HS, Krunic A, Orjala J. J. Nat. Prod. 2012;75:807–811. doi: 10.1021/np300150h. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 189.Linington RG, Edwards DJ, Shuman CF, McPhail KL, Matainaho T, Gerwick WH. J. Nat. Prod. 2008;71:22–27. doi: 10.1021/np070280x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 190.Tooming-Klunderud A, Rohrlack T, Shalchian-Tabrizi K, Kristensen T, Jakobsen KS. Microbiology. 2007;153:1382–1393. doi: 10.1099/mic.0.2006/001123-0. [DOI] [PubMed] [Google Scholar]
- 191.Stocking EM, Sanz-Cervera JF, Williams RM. J. Am. Chem. Soc. 1996;118:7008–7009. [Google Scholar]
- 192.Young TS, Walsh CT. Proc. Natl. Acad. Sci. USA. 2011;108:13053–13058. doi: 10.1073/pnas.1110435108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 193.Marcus JP, Dekker EE. Biochem. Biophys. Res. Commun. 1993;190:1066–1072. doi: 10.1006/bbrc.1993.1157. [DOI] [PubMed] [Google Scholar]
- 194.Nawrat CC, Moody CJ. Nat. Prod. Rep. 2011;28:1426–1444. doi: 10.1039/c1np00031d. [DOI] [PubMed] [Google Scholar]
- 195.(a) Ehrlich J, Knudsen MP, Anderson LE, Koepsell HJ, Coffrey GL, Kohberger DL, Hillegas AB, Oyaas JE. Nature. 1954;173:72. doi: 10.1038/173072a0. [DOI] [PubMed] [Google Scholar]; (b) Bartz QR, Haskell TH, Elder CC, Johannessen DW, Frohardt RP, Ryder A, Fusari SA. Nature. 1954;173:72–73. doi: 10.1038/173072b0. [DOI] [PubMed] [Google Scholar]
- 196.Allen L, Meck R, Yunis A. Res. Commun. Chem. Pathol. Pharmacol. 1980;27:175–182. [PubMed] [Google Scholar]
- 197.Krahn D, Ottmann C, Kaiser M. Nat. Prod. Rep. 2011;28:1854–1867. doi: 10.1039/c1np00048a. [DOI] [PubMed] [Google Scholar]
- 198.Dawlaty J, Zhang X, Fischbach MA, Clardy J. J. Nat. Prod. 2010;73:441–446. doi: 10.1021/np900685z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 199.Hollenhorst MA, Ntai I, Badet B, Kelleher NL, Walsh CT. Biochemistry. 2011;50:3859–3861. doi: 10.1021/bi2004735. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 200.Mahlstedt SA, Walsh CT. Biochemistry. 2010;49:912–923. doi: 10.1021/bi9021186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 201.(a) Parker JB, Walsh CT. Biochemistry. 2012;51:3241–3251. doi: 10.1021/bi300254u. [DOI] [PMC free article] [PubMed] [Google Scholar]; (b) Parker JB, Walsh CT. Biochemistry. 2012;51:5622–5632. doi: 10.1021/bi3006362. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 202.Chmara H, Zahner H, Borowski E. J. Antibiot. 1984;37:1038–1043. doi: 10.7164/antibiotics.37.1038. [DOI] [PubMed] [Google Scholar]
- 203.Luesch H, Moore RE, Paul VJ, Mooberry SL, Corbett TH. J. Nat. Prod. 2001;64:907–910. doi: 10.1021/np010049y. [DOI] [PubMed] [Google Scholar]
References
- 1. Schwecke T, Aparicio JF, Molnár I, König A, Khaw LE, Haydock SF, Oliynyk M, Caffrey P, Cortés J, Lester JB, Böhm GA, Staunton J, Leadlay PF. Proc. Natl. Acad. Sci. USA. 1995;92:7839–7843. doi: 10.1073/pnas.92.17.7839. (Reference 7)
- 2.For studies on the biosynthesis of Aba, see: Sebek OK, Barker HA. J. Bacteriol. 1968;96:2094–2098. doi: 10.1128/jb.96.6.2094-2098.1968. Tokarski Z, Klei HE, Berg CM. Biotechnol. Lett. 1988;10:7–10. Fotheringham IG, Grinter N, Pantaleone DP, Senkpeil RF, Taylor PP. Bioorg. Med. Chem. 1999;7:2209–2213. doi: 10.1016/s0968-0896(99)00153-4.
- 3.For biosynthesis of statine containing Thaliandepsin E and Thailandepsin B (also known as Burholdac B), see: Wang C, Flemming CJ, Cheng Y-Q. MedChemComm. 2012;3:976–981. doi: 10.1039/C2MD20024D. Wang C, Henkes LM, Doughty LB, He M, Wang D, Meyer-Almes F-J, Cheng Y-Q. J. Nat. Prod. 2011;74:2031–2038. doi: 10.1021/np200324x. Biggins JB, Gleber CD, Brady SF. Org. Lett. 2011;13:1536–1539. doi: 10.1021/ol200225v.
- 4.Miraziridine is the only natural product known to contain a vinylogous arginine moiety. Jamaicamides contain a methyl pyrrolinone moeity derived from vinylogous alanine, see: Edwards DJ, Marquez BL, Nogle LM, McPhall K, Goeger DE, Roberts MA, Gerwick WH. Chem. Biol. 2004;11:817–833. doi: 10.1016/j.chembiol.2004.03.030. Cyclotheonamides contain vinylogous tyrosine, see: Fusetani N, Matsunaga S, Matsumoto H, Takebayashi Y. J. Am. Chem. Soc. 1990;112:7053–7054. Hagihara M, Anthony NJ, Stout TJ, Clardy J, Schreiber SL. J. Am. Chem. Soc. 1992;114:6568–6570. The vinylogous alanine moiety of Jamaicamides are biosynthesized by a hybrid NRPS-PKS assemblyline; vinylogous arginine and vinylogous tyrosine are likely produced by an analogous pathway.
- 5.Vandenende CS, Vlasschaert M, Seah SYK. J. Bacteriol. 2004;186:5596–5602. doi: 10.1128/JB.186.17.5596-5602.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Plaza A, Gustchina E, Baker HL, Kelly M, Bewley CA. J. Nat. Prod. 2007;70:1753–1760. doi: 10.1021/np070306k. Oku N, Gustafson KR, Cartner LK, Wilson JA, Shigematsu N, Hess S, Pannell LK, Boyd MR, McMahon JB. J. Nat. Prod. 2004;67:1047–1411. doi: 10.1021/np040003f. For a review on marine cyclicdepsipeptides, see: Andavan GSB, Lemmens-Gruber R. Marine Drugs. 2010;8:810–834. doi: 10.3390/md8030810.
- 7.Steenbergen JN, Alder J, Thorne GM, Tally FP. J. Antimicrob. Chemother. 2005;55:283–288. doi: 10.1093/jac/dkh546. [DOI] [PubMed] [Google Scholar]
- 8.Fujimori DG, Hrvatin S, Neumann CS, Strieker M, Marahiel MA, Walsh CT. Proc. Natl. Acad. Sci. USA. 2007;104:16498–16503. doi: 10.1073/pnas.0708242104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Fujimori DG, Hrvatin S, Neumann CS, Strieker M, Marahiel MA, Walsh CT. Proc. Natl. Acad. Sci. USA. 2007;104:16498–16503. doi: 10.1073/pnas.0708242104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Amrein H, Makart S, Granado J, Shakya R, Schneider-Pokorny J, Dudler R. Mol. Plant Microbe Interact. 2004;17:90–97. doi: 10.1094/MPMI.2004.17.1.90. [DOI] [PubMed] [Google Scholar]
- 11.For a related oxidative phenolic coupling in teicoplanin and vancomycin biosynthesis, see: Widboom PF, Bruner SD. ChemBioChem. 2009;10:1757–1764. doi: 10.1002/cbic.200900117.
- 12.Ahn J-H, Walton JD. Mol. Plant-Microbe Interact. 1997;10:207–214. doi: 10.1094/MPMI.1997.10.2.207. [DOI] [PubMed] [Google Scholar]
- 13.Kevany BM, Rasko DA, Thomas MG. Appl. Environ. Microbiol. 2009;75:1144–1155. doi: 10.1128/AEM.02518-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.(a) Adefarati AA, Giacobbe RA, Hensens OD, Tkacz JS. J. Am. Chem. Soc. 1991;113:3542–3545. [Google Scholar]; Adefarati AA, Hensens OD, Jones ETT, Tkacz JS. J. Antibiotics. 1992;45:1953–1957. doi: 10.7164/antibiotics.45.1953. [DOI] [PubMed] [Google Scholar]; (c) Schonewolf M, Rohr J. Angew. Chem. Int. Ed. Engl. 1991;30:183–185. [Google Scholar]
- 15.Rössner E, Zeeck A, König WA. Angew. Chem. Int. Ed. Engl. 1990;29:64–65. [Google Scholar]
- 16.Höfer I, Crüsemann M, Radzom M, Geers B, Flachshaar D, Cai X, Zeeck A, Piel J. Chem. and Biol. 2011;18:381–391. doi: 10.1016/j.chembiol.2010.12.018. [DOI] [PubMed] [Google Scholar]
- 17.Bischoff D, Bister B, Bertazzo M, Pfeifer V, Stegmann E, Nicholson GJ, Keller S, Pelzer S, Wohlleben W, Süssmuth RD. Chembiochem. 2005;6:267–272. doi: 10.1002/cbic.200400328. [DOI] [PubMed] [Google Scholar]
- 18.Li DB, Woithe K, Geib N, Abou-Hadeed K, Zerbe K, Robinson JA. Methods in Enzymology. 2009;458:487–509. doi: 10.1016/S0076-6879(09)04819-8. [DOI] [PubMed] [Google Scholar]
- 19.Hamada T, Matsunaga S, Yano G, Fusetani N. J. Am. Chem. Soc. 2005;127:110–118. doi: 10.1021/ja045749e. [DOI] [PubMed] [Google Scholar]