Skip to main content
Journal of Microbiology and Biotechnology logoLink to Journal of Microbiology and Biotechnology
. 2025 Jul 14;35:e2502030. doi: 10.4014/jmb.2503.02030

Comparative Analysis of Signature Sequences from Adenylation Domains Situated within Bacterial-Origin Nonribosomal Peptide Synthetase Modules

Weina Gao 1,, Zhishen Zhang 1,, Huiying Yu 2,*, Xin Li 2, Chunshan Quan 3, Yun Xue 1, Pengchao Zhao 1,*
PMCID: PMC12283262  PMID: 40659553

Abstract

Nonribosomal peptides are assembled by large enzymes that contain multiple active sites, which function in a modular manner. The adenylation (A) domains present within typical nonribosomal peptide synthetase (NRPS) modules contain specificity-conferring codes or signature sequences (SNSs). In this study, we obtained 2051 A domain sequences from 67 bacterial species. Their alignment and clustering identified 508 SNSs. Over 80% of the SNSs displayed distinct specificity for 36 proteinogenic and nonproteinogenic α-amino acid moieties (α-AAMs). Furthermore, modifications such as N-methylation, monooxygenase activity, and oxidation contributed to the elongation of the A domains, while conferring pronounced affinities for certain α-AAMs. Notably, β-hydroxylation demonstrated particular preferences. Specifically, ornithine, threonine, tyrosine, and phenylalanine moieties frequently underwent atypical covalent modifications, and 41 modules were used iteratively. These insights significantly facilitate the identification of uncharacterized NRPS systems—expediting traditional identification processes—although novel modifications, unusual domain organizations, and dormant domains pose challenges for their accurate prediction.

Keywords: α-Amino acid moiety, adenylation domain, distribution algorithm, sequence alignment, signature sequences

Introduction

Nonribosomal peptides (NRPs) constitute a prominent family of structurally diverse bacterial secondary metabolites, characterized by their brevity in that they consist of 30 or fewer residues [1]. These peptides have a remarkable potential in biotechnological and biopharmaceutical applications based on their multifaceted properties, which include antiallergic, antitumor, and immunosuppressive effects, as well as siderophore and antibiotic functions [2-5]. Their antibiotic spectrum includes activities against the bacterium Mycobacterium tuberculosis which causes tuberculosis, fungi, bacteria, viruses, protozoa, and even insect pests [6-11]. NRPs are synthesized by nonribosomal peptide synthetases (NRPSs), large multifunctional enzymes with a modular architecture [12-14]. Typically, the protein subunits comprise modules that process a-amino acid moieties (α-AAMs), arranging them in a linear fashion. The structural and functional parallels between NRPs and polyketides (PKs) allow NRPSs to engage with polyketide synthases (PKSs) to form hybrid assembly lines that yield novel compounds [15]. To date, an extensive array of bacterial genera—including Bacillaceae, Burkholderia, Chondromyces, Cystobacter, Lysobacter, Myxococcus, Photorhabdus, Pseudomonas, Sorangium, Vibrio, and Xenorhabdus—has emerged as prolific producers of NRPs and hybrid PK–NRPs [16, 17].

A quintessential NRPS module comprises an adenylation (A) domain adjacent to a peptidyl carrier protein (PCP) domain and a condensation domain, and these components collaboratively catalyze each step of the condensation process and chain elongation [18-20]. The A domain is responsible for recognizing and activating specific α-AAMs before incorporating them into the final product. Previous investigations have demonstrated the structural conservation within these domains—highlighting 10 core motifs, designated from A1–A10—and have clarified the substrate specificity inherent in the A domains [21, 22]. Notably, 10 residues located at positions 235, 236, 239, 278, 299, 301, 322, 330, 331, and 517 act as specificity-conferring codes or signature sequences (SNSs). Based on these characteristics—particularly noting that all α-AAM-activating domains possess conserved Asp235 and Lys517 residues—the putative substrates potentially recruited by A domains can be effectively predicted. Furthermore, their positions are delineated within the sequence of the NRPS subunit GrsA phenylalanine (Phe)-activating domain (consisting of 530 residues). Several NRPS services, such as NP.Searcher, SBSPKSv2, AntiSMASH, and SANDPUMA, leverage these SNSs to reflect the confidence associated with each α-AAM predicted [23-26]. Partial A domains are also endowed with under-exploited sequences situated between the consensus sequence motifs of A8 and A9. This unique positioning facilitates the modification of core NRPs with N-methylation (N-Me), monooxygenase (MOX) activity, and oxidation (Ox) [27-29].

In recent decades, many catalytic substrates of NRPSs and hybrid PKS–NRPSs have been reported. These include 20 proteinogenic and 16 nonproteinogenic α-AAMs, and among the latter are compounds such as α-aminobutyrate (Abu) [29], α-aminoadipic acid (AMA) [30], α-amino-δ-nitropentanoic acid (ANPA) [31], azetidine α-carboxylic acid (AZC) [32], β-cyano-alanine (Cya-3) [33], α-γ-diaminobutyric acid (Dab) [34], a,β-diaminopropionic acid (Dap) [35], dehydro-α-aminopropanoic acid/dehydroalanine (Dha) [36], dehydrobutyrine (Dhb) [37], dehydrovaline (Dhv) [38], enduracididine (End) [39], homoarginine (Har) [40], γ-hydroxy-phenyl glycine (Hpg) [41], homoserine/homoserine lactone (Hse/Hsl) [42], ornithine (Orn) [43], and pipecolic acid (cyclic lysine) (Pip) [8]. However, many existing SNSs cannot precisely identify these compounds. Notably, many silent biosynthetic gene clusters (BGCs) related to NRPSs and PKS–NRPSs have been identified with genomic sequencing and bioinformatic methodologies—indicating a variability in SNS matches [7, 44-47]. In this study, we examined the SNSs associated with published A domains together with their respective substrates, which were categorized based on their similarities, and especially examined indistinguishable SNS profiles for atypical modifications, unconventional domain organizations, and inactive domains. Our finding extend our understanding of NRPS systems and suggest innovative frameworks with which to decode their biosynthetic intricacies.

Materials and Methods

Data Sources

In total, 321 NRPS and PKS–NRPS sequences were sourced from recognized databases, including the National Center for Biotechnology Information (NCBI, http://www.ncbi.nlm.nih.gov/), the Pseudomonas Genome Database (https://www.pseudomonas.com/), and AntiSMASH (https://antismash.secondarymetabolites.org/). These data included a diverse range of substrates, bacterial sources, and biological evaluations (Table S1). Of these sequences, we screened representatives from 67 bacterial species-approximately two-thirds of whose genomes have now been sequenced.

Determination of A Domain Sequences and SNSs

We used ClustalX version 1.81 and NCBI BLAST to construct intricate multiple alignments of the downloaded NRPS and PKS–NRPS sequences against the GrsA Phe-activating domain, culminating in the identification of 2051 A domain sequences. Ultimately, multiple analytical tools-including PKS/NRPS Analysis website (http://nrps.igs.umaryland.edu/), AntiSMASH, NCBI BLAST, ClustalX, and SnapGene (USA) - were used for the acquisition of the SNS data.

Multiple Alignment of SNSs

In an exhaustive analysis involving 36 substrates, a total of 508 SNSs clustered together. Appropriate α-AAMs displaying highly homologous SNSs were aggregated according to their occurrence frequencies. The sequences within each group were aligned with Excel 2010 and ClustalX, and their subsequent examination was based on observed similarities.

Results

Leucine-, Ioleucine-, Valine-/Dehydrovaline-, and γ-Hydroxyphenyl-Glycine-Activating Domains

Table S2 shows that among the 36 substrates examined, leucine (Leu), isoleucine (Ile), and valine (Val)—including its modified derivative, Dhv—were represented most abundantly, cumulatively with 525 occurrences (n).

A comprehensive analysis of 224 Leu-activating domains, 104 Ile-activating domains, and 197 Val-activating domains detected a total of 117 SNSs. The majority of these motifs were significantly conserved, conferring structural similarities (Tables S3−S5). However, they were still distinguishable. The most commonly encountered codes included DAWFLGNVVK, DAMFLGCTYK, and DALWIGGTFK (Table 1). This demonstrates that Leu-activating domains had the most-conserved SNS profiles. Furthermore, a group of 17 SNSs showed marked prevalence. It is noteworthy that approximately two-thirds of these SNS occurrences were represented across various species—illustrating the inherent sequence diversity within three distinct A domains (Tables S3−S5).

Table 1.

SNSs located in Leu-, Ile- and Val-activating domains and their frequency.

graphic file with name jmb-35-e2502030-t1.jpg

The phenomenon of substrate promiscuity associated with identical SNS patterns is further highlighted in Table 2. It is evident that Ile shares greater homology within its SNS structures with Val than with Leu, implying a closer structural relationship between Ile and Val, despite their classification as isomers. Furthermore, six notable sequences are expected to display considerable flexibility towards N-Me and β-hydroxylated (β-OH) moieties (Table 3). Simultaneously, DAWCIGAVCK, DLYNLSGVWK, and DAMHLGCTFK/DILHLGCTFK also manifest a capacity to accommodate Phe, Ala, and Hpg moieties, respectively. The sequence DAWWIGGTFK, derived from the Dhv-activating domain, is not identical to the sequences found in Val-activating domains, but impressively shares 90% homology with sequences such as DALWIGGTFK, DAWFIGGTFK, and DAWWLGGTFK.

Table 2.

Identical SNSs located in Leu-/Ile-, Leu-/Val-, Ile-/Val-, and Leu-/Ile-/Val-activating domains.

graphic file with name jmb-35-e2502030-t2.jpg

Table 3.

SNSs located in N-Me-Leu-/Val-, N-Me-Ile, β-OH-Leu-, Phe-/Leu-, Ala-/Leu-/Val-, Hpg-, and Dhvactivating domains.

graphic file with name jmb-35-e2502030-t3.jpg

Alanine-/α-Aminobutyrate-, Methionine-, Glycine-, and Cysteine-Activating Domains

In this study, we identified 155 alanine (Ala)-activating domains distributed across 73 distinct assembly lines (Table S2). Among the 46 SNSs examined, five carried two specific residues at positions 239 and 330 (n = 101; Table 4). The overall analysis identified the predominant variability, but six sequence clusters shared ≥ 80%identity. Furthermore, three sequence pairs accommodated serine (Ser), Abu, and glycine (Gly) moieties. Notably, DVFYLGGVFK, DVFYLGGVCK, and DVWYLGGICK incorporated methionine (Met) moieties, and two of them shared ≥ 70% homology with DVFWLGGTFK within the Ala-activating domain.

Table 4.

SNSs located in Val- and Met-activating domains and their similarities.

graphic file with name jmb-35-e2502030-t4.jpg

Concurrently, within the 72 megasynthases identified, we detected 93 Gly-activating domains (Table S2). Of these, 12 SNSs demonstrated remarkable (70%) homology, distinguished by three distinctive residues at positions 299, 322, and 330 (n = 83; Table 5). Furthermore, two sequence pairs shared 90% identity. MOX is a specialized domain that cleaves the carbon backbone inherent in Gly moieties. It is particularly noteworthy that four Gly-activating domains showed MOX characteristics and all shared the identical sequence DILQLGMIWK.

Table 5.

SNSs located in Gly-activating domains and their similarities.

graphic file with name jmb-35-e2502030-t5.jpg

Moreover, 42 megasynthases included 74 cysteine (Cys)-activating domains (Table S2). Remarkably, a predominant cluster comprising 17 homologous SNSs seems to have emerged through recombination events between Ala- and Gly-activating domains (Table S8). Residues Asp235, Leu236, Tyr/Trp239, and Asn278 trace back to the origins of the Ala-activating domain, whereas Leu/Met299, Leu/Met322, Ile/Val330, Trp331, and Lys517 derive from Gly-activating domains. Obviously, five of these have garnered considerable attention (n = 55). In addition, DLYNLSLIWK, DLYNMSLIWK, DLWNLSLIWK, DLYNLALVWK and DLYNWSLIWK catalyzed Ox modified moieties.

Ornithine-/α-Amino-δ-Nitropentanoic Acid-, Lysine-/Cyclic Lysine-, Proline-/Azetidine α-Carboxylic Acid-, and Arginine-/Enduracididine-/Homoarginine-Activating Domains

It is noteworthy that 147 Orn-/lysine (Lys, including Pip)-activating domains were distributed across 71 assembly lines (Table S2). Remarkably, ten SNSs derived from Orn-activating domains showed substantial divergence, with only 70% identity (Table 6). Fifteen SNSs had the capacity to accommodate modifications on the Orn moiety—specifically Nδ-hydroxylation (Nδ-OH), Nδ-acetylation (Nδ-acetyl), Nδ-formylation (Nδ-formyl), Nδ-butyrylation (Nδ-butyryl), and Nδ-nitrosylation (Nδ-nitroso). With exceptions such as DGEACGGVTK, DGECTGGITK, DGEGSGGVTK, and DVWNIGLIHK, these sequences can be readily distinguished from one another. Furthermore, six specific sequences were extended through an innovative modification involving Nδ-OH-Orn, subsequently undergoing cyclization into stable six-membered rings via the closure of the thioesterase domain. Consequently, these variants are predominantly situated at terminal positions. DMEDVGSVDK and DVETLGGISK also demonstrated an ability to assemble Lys and ANPA moieties, respectively.

Table 6.

SNSs located in Orn- and ANPA--activating domains and their similarities.

graphic file with name jmb-35-e2502030-t6.jpg

Equally remarkably, among the 58 Lys-activating domains examined, 13 SNSs were detected with notable frequency (n = 39; Table 7). Four sequences may facilitate Lys cyclization (i.e., Pip) between the N-terminal and ε-amino groups. Furthermore, two sequence pairs are representations of Pro and Arg moieties, respectively. Subsequent interaction analyses revealed that the sequence DGEDHGTVVK shares 90% identity with its counterpart DGEDHGTVTK, derived from Orn-activating domains.

Table 7.

SNSs located Lys (including Pip)-activating domains and their similarities.

graphic file with name jmb-35-e2502030-t7.jpg

We also established that 71 proline (Pro)-activating domains were distributed across 60 assembly lines (Table S2). Notably, 25 SNSs containing identical residues—specifically Val236 and Gln239—were frequently observed (n = 59; Table S11). Almost 90% of these instances have been grouped, with each showing 80% identity (Table 8). Three other sequences showing 70% identity were also identified. It is important to note that both sequences DVQCLSEVTK and DMQLVSQQVK have evolved separately to accommodate AZC.

Table 8.

SNSs located Pro- and AZC-activating domains and their similarities.

graphic file with name jmb-35-e2502030-t8.jpg

Thirty-one megasynthases collectively featured 37 arginine (Arg)-activating domains (Table S2). Although 29 SNSs were utilized infrequently, the majority showed a significant clustering tendency, including Asp235, Ala/Val236, Glu239, Asp278, Ile/Val/Leu299, Gly301, Ala322, Val/Ile330, Thr/Asp331, and Lys517 (Table S12). Furthermore, the sequences DIGDLGIIDK and DAEDVAAMIK were distinctly linked to α,β-dehydro-Arg and End, respectively, and feeding experiments confirmed their origin from Arg. Both showed degrees of clustering exceeding 70%, highlighting the intrinsic challenges associated with modifying the Arg moiety [48]. Notably, DVESIGGVTK showed particular versatility by also accommodating the Har moiety, suggesting structural similarity between them.

Glutamate-/Glutamine-, Aspartic Acid-/Asparagine-, β-Cyano-Alanine-, and α-Aminoadipic Acid-Activating Domains

We established 227 glutamate (Glu)-/glutamine (Gln)-activating domains and aspartic acid (Asp)-/asparagine (Asn)-activating domains positioned across 108 assembly lines (Table S2). Among the delineated Glu-/Gln-activating domains, 21 SNSs featured prominently (n = 95; Table 9). It is evident that the sequence DAQDLGVVDK recognizes both Glu and Gln moieties.

Table 9.

SNSs located Glu- and Gln-activating domains and their similarities.

graphic file with name jmb-35-e2502030-t9.jpg

Similarly, an analysis of Asp-/Asn-activating domains revealed the presence of 24 SNSs (Table S2). Remarkably, both domains showed an identical sequence DLTKVGHVGK (Table 10). However, most sequences favored the Asp moiety (Table S15). Moreover, 16 SNSs were characterized with n = 100. Notably, two-thirds of Asn-activating domains displayed 80% identity (Table 10). Further investigations indicated that various sequences can accommodate β-OH-Asp/-Asn moieties, implying strong amenability for modification within the Asp moiety. We also discovered that two sequence clusters from Asp- and Glu-activating domains shared ≥ 80% homology, and both Asn- and Gln-activating domains shared the identical sequence DAVQMGCVDK. Collectively, these findings suggest a cross-linkage among four distinct A domains—a phenomenon that may present slight challenges to their precise differentiation. DLTKIGEVGK is also compatible with the Cya-3 moiety, whereas DPRHLALLAK, specific for the AMA moiety, showed 80% identity with its counterpart DPRHVSLLAK, derived from Asn-activating domains.

Table 10.

SNSs located Asp-/AMA-and Asn/Cya-3-activating domains and their similarities.

graphic file with name jmb-35-e2502030-t10.jpg

Threonine-/Dhb-Activating Domains

A comprehensive analysis revealed that 213 threonine (Thr)-/Dhb-activating domains occurred within 127 assembly lines (Table S2). One-third of the SNSs that recognize both the Thr and Dhb moieties were characterized by two distinct residues located at positions 278 and 299 (n = 205; Table S16). Three other sequences—DMFCAGLIWK, DMFSAGLIWK, and DMFVAGLIWK—showed remarkable flexibility towards the Thr moiety, and shared 90% similarity, whereas DFWNIGMVHK, DFWSVGMVHK, DMFNFGVLWK, and DMFCNGIIWK also showed significant selectivity for N-Me-/N,O-diMe-, γ-Cl-, β-OH-Thr, and serine (Ser) moieties, respectively.

Serine-/Dehydroalanine- and Homoserine/Homoserine Lactone-Activating Domains

We identified 201 Ser-/Dha-activating domains-characterized by identical sequences DVWHLSLVDK and DVWHMSLVDK-arranged across 132 assembly lines (Tables S2 and S17). Ten SNSs contained two distinct residues at positions 299 and 330 (n = 191), four of which were also specific for Ox and N-Me modifications. DLKNVGSDVK and DLKNLGTDVK were also identified within the Hse/Hsl-activating domains, frequently accompanied by occurrences of Ser/Dha (Tables S1 and S17).

Tyrosine-, Phenylalanine-, and Tryptophan-Activating Domains

The 169 domains responsible for the activation of tyrosine (Tyr)-, Phe-, and tryptophan (Trp) were distributed across 114 assembly lines (Tables S2). Among these, seven SNSs derived from Tyr-activating domains displayed 70% similarity (Table 11). Homology of ≥ 80% was also observed among two distinct sequence clusters. Particularly noteworthy was the affinity shown by various sequences toward modifications such as β-OH, N-Me, ε-OH, β-Cl, β-Cl-β-OH, and β-OH-δ-Me-O-Me.

Table 11.

SNSs located Tyr-activating domains and their similarities.

graphic file with name jmb-35-e2502030-t11.jpg

Substantial variability was detected in the pool of 54 Phe-activating domains. However, the frequency of SNSs remained relatively low (n £ 5; Tables S2 and S19), eight of which had been grouped (Table 12). High homologies, up to 90%, were observed among two distinct sequence pairs. Four sequences recognized N-Me and β-OH-p-NO2 modifications, and 11 SNSs from Trp-activating domains were classified based on their ≥ 80% homology (n = 19; Table S20).

Table 12.

SNSs located Phe-activating domains and their similarities.

graphic file with name jmb-35-e2502030-t12.jpg

Modified Tyr and Phe moieties were activated with greater facility, paving the way for unconventional substrates to emerge. However, the corresponding sequences displayed only ≤ 60% identity, suggesting that novel moieties may be introduced through intricate modifications, concurrently altering SNSs. An identical sequence, DVSAIGCVTK, was present within these three distinct A domains (Table 11). In pairwise alignments between them, the Phe- and Trp-activating domains showed four identical sequences, which significantly surpassed the number shared between the Tyr- and Phe-activating domains, or between the Tyr- and Trp-activating domains (Table 12). This observation confirms the strong association between Phe and Trp moieties during the chain elongation processes.

α,γ-Diaminobutyric Acid, α,β-Diaminopropionic Acid-, and Histine-Activating Domains

The 86 Dab-activating domains were integrated into 43 assembly lines, seven of which were classified (n = 82; Tables S2 and 13). Notably, DVWQMIGDDK specifically recognizes the β-OH modification (Table 13). Intriguingly, two sequences to accommodate Dap moieties shared 80% identity with those derived from Dab-activating domains. This observation underscores the notable similarities between the Dab and Dap domains, although they are probably differentiated by subtle but significant variations.

Table 13.

SNSs located Dab- and Dap-activating domains and their similarities.

graphic file with name jmb-35-e2502030-t13.jpg

Finally, four SNSs were incorporated within 14 His-activating domains. Remarkably, the first two shared 90%homology and appeared most frequently (n = 11; Table S21). It is noteworthy that DSALIAEVWK can accept the β-OH modification. A multiple-sequence alignment revealed their weak homology with other α-AAMs, allowing the precise identification of His moieties.

Based on the genetic findings outlined above, over the past 2 years, NRPS services have shown that the genomes of various bacteria have BGCs encoding known NRPSs, including Bacillus amyloliquefaciens MR14M3 [49], Bacillus velezensis NDB [50], Bacillus halotolerans AQ11M9 [51], Bacillus paralicheniformis PBl 36 [52], and Bacillus subtilis BS21 and PBs12, together with 13 isolates of Xenorhabdus and Photorhabdus (Table S22) [53, 54]. They have also demonstrated genetically and biochemically that BGC5 (Paenibacillus brasilensis KACC 13842), corA (Corallococcus exiguus SDU70), crz123456 (Corallococcus coralloides B035), mgpABCDEF (two Burkholderiales strains), selAB (Burkholderia sp. FERM BP-3421) and solFGH (Dickeya solani MK10) orchestrate novel NRPs and PK–NRPs, including bracidin [55], coralinone [56], corallorazines [57], megapolipeptins [58], selethramide [59], and solanimycins [60]. The predicted α-AAMs align remarkably well with those characterized with traditional methods, such as liquid chromatography–mass spectrometry (LC–MS), LC–MS/MS, and nuclear magnetic resonance (NMR).

Discussion

This study demonstrates that bacterial genera contain a strong diversity of NRPS and PKS–NRPS chains. An exhaustive investigation into 2051 A domain sequences revealed the existence of 36 proteinogenic and non-proteinogenic α-AAMs, each characterized by its specific SNSs (roughly two thirds are shown in Table 14). It is clear that these motifs are frequently used. However, 89 SNSs have a demonstrated capacity to accommodate two or more α-AAMs, highlighting the complexity involved in their recognition. Distinguishing noncanonical moieties, such as Hpg, Abu, ANPA, Pip, AZC, Har, Cya-3, Dhb, and Dha, from their parent or structurally analogous proteinogenic α-AAM counterparts is a formidable challenge given their indistinguishable SNS profiles. Among these atypical compounds, Hpg—a rare phenylglycine—has been shown to play significant roles in structural stabilization [41, 61]. ANPA represents an unusual nitro α-AAM, which is exceedingly scarce in nature. Its biosynthesis involves the NRPS subunit ornithine MOX (MbaC), which facilitates N-OH conversion, whereas unprotected hydroxylamine undergoes spontaneous Ox to yield nitroso and nitro derivatives [31]. Pip features a hydrazine N–N bond formed through the action of piperazate synthase together with Pro. Together they impose conformational constraints on peptides-essential for pharmaceutical activities [62]. Although AZC has been detected in certain plant species, it remains relatively uncommon among bacteria [63]. Its biosynthetic pathway shows that azetidine originates from the intramolecular cyclization of Met-derived S-adenosylmethionine-a process akin to the synthesis observed in plant nicotianamine. The existence of two identical SNSs between AZC and Pro moieties opens new avenues for investigating the structural diversity and pharmaceutical functions within the NRPs. Studies of the structure–activity relationships of peptidomimetic inhibitors have indicated that elongating the side chain of Lys through the incorporation of a Har moiety yields more active and more-stable analogues [64]. The metabolic routes to Har formation in bacteria include amidinotransferase AmtA (in Pseudomonas) and inosamine-phosphate amidinotransferase McyK (in Fischerella) [65]. Notably, Cya-3 was exclusively detected within the PK–NRP albicidin [33]. Its activating domain encompasses an extraordinary insertion of 342 amino acids, probably affiliated with the adenosine nucleotide a-hydrolase superfamily, suggesting that the NRPS subunit AlbIV facilitates the transformation of Asn into Cya-3 in situ. Dhb and Dha, generated from Ser and Thr residues, respectively, have profound effects on the pharmaceutical properties [66]. Both often appear together with their biosynthetic precursors, so these NRPS modules must act iteratively.

Table 14.

Specific SNSs located 25 α-AAM-activating domains.

graphic file with name jmb-35-e2502030-t14.jpg

To enhance their structural complexity, the PCP-domain-bound α-AAMs may undergo further modification by cis-acting moieties, such as N-methylase, oxidase, or MOX. In this study, we have shown that the modification modalities of N-Me, Ox, and MOX typically induce the elongation of A domains. This elongation confers a pronounced affinity for specific moieties. However, some retain congruence with their parental structures (Table 15). Among these structural modification, N-Me is a prevalent alteration that contributes significantly to pharmaceutical properties by enhancing proteolytic stability, oral bioavailability, membrane permeability, and selective binding to target sites [67, 68]. The Ox modification transforms protein-bound Ser-S-PCP, Gln-S-PCP, and Cys-S-PCP to oxazole, 5-amino-3H-pyridine-2,6- dione, and thiazole, respectively, on the growing intermediates. These crucial structural motifs often play key roles in the related NRP activity [69]. The MOX modification engages with assembly lines to instate β-hydroxylation, followed by terminal amide formation through the cleavage of the carbon backbone—often contributing significantly to the pharmaceutical efficacy of the final product [70]. In this study, all MOX modules were integrated within Gly-activating domains. However, among the newly identified PK–NRP megapolipeptins biosynthesized, MgpD lacked a MOX module. Conversely, its adjacent MgpG could catalyze the hydroxylation and dealkylation of the Val moiety [58]. β-OH α-AAMs are ubiquitous across NRPS assembly lines [71]. In this study, β-OH showed distinct preferences while maintaining the invariant lengths of its A domains—indicating that this modification requires separately encoded hydroxylases, acting in trans. To date, four families of hydroxylases have been discovered within the NRPS biosynthetic pathways. Free hydroxyl groups often act as recognition sites for specific pharmaceutical targets. The Orn moiety is frequently encountered in various modified forms—most notably as unique variants, such as Nδ-OH-, Nδ-OH-Nδ-formyl, Nδ-OH-Nδ-acetyl, Nδ-OH-Nδ-butyryl, Nδ-OH-Nδ-nitroso-, and cyclo-Nδ-OH-Orn. Similarly, Thr, Tyr, and Phe are susceptible to atypical covalent modifications. However, most of these derivatives are indistinguishable from the parent α-AAMs. The inherent complexity associated with α-AAMs is further exacerbated by NRPS–PKS chains. Therefore, the heterologous reconstitution of PKS and NRPS domains in a combinatorial fashion can potentially generate an extensive library of “unnatural” natural products characterized by novel structures.

Table 15.

Usual modification modalities and their preferred moieties.

graphic file with name jmb-35-e2502030-t15.jpg

Importantly, during the complex process of peptide elongation, a total of 41 modules were used in an iterative manner (Table S23)—although some, such as XcnA, may have remained inactive [72]. In stark contrast to this highly coordinated process, their biosynthesis is characterized by iteration and nonlinear NRPSs. However, the fundamental mechanisms underlying these phenomena remain unclear. Nonetheless, notable advances have been made. The amphi-enterobactin “waiting room” model of biosynthesis has been proposed, wherein the thioesterase (TE) domain functions as a “waiting room” for the Ser moiety throughout each iterative cycle [73]. Cereulide is synthesized through TE-mediated trimerization and the macrocyclization of the anticipated tetradepsipeptide substrate based on the canonical activities of NRPS subunits CesA and CesB [74]. DidA operates bimodularly in an iterative fashion to incorporate three to four Gln residues into didemnins, which may involve an export-hydrolysis model [75]. The long-range transactivation whereby the tandem PCP domains of KorC are charged by the A domain of KorD could represent a rate-limiting step [76]. Consequently, unconventional domain organizations present substantial challenges for prediction, and inactive domains further complicate these intricacies.

In summary, over 80% of identified SNSs show exceptional selectivity, so aligning these sequence motifs can allow the analysis of unknown NRPSs while streamlining traditional identification methodologies, although substantial challenges persist due to novel modifications, unconventional domain organizations, and dormant domains, which complicate predictive efforts. However, incorporating these unpredictable elements diversifies and complicates the final product structure. Simultaneously, it inspires researchers to identify additional BGCs with suitable pharmacological activity. An array of bioinformatics tools—including Global Natural Products Social Molecular Networking (GNPS), metabolomics, and various machine learning algorithms-has been introduced into this field, so we believe that this will become an important topic in the foreseeable future.

Supplemental Materials

Supplementary data for this paper are available on-line only at http://jmb.or.kr.

Footnotes

Funding

This research was funded by the National Natural Science Foundation of China (NSFC, Grant No. 32072899), Science and Technology Innovation Teams of Shanxi Province (Grant No. 202204051001035), Fundamental Research Program of Shanxi Province (Grant No. 202203021211114), and Open Fund of Key Laboratory of Biotechnology and Bioresources Utilization (Dalian Minzu University), Ministry of Education (Grant No. KF2021007), China.

Conflict of Interest

The authors have no financial conflicts of interest to declare.

References

  • 1.Sharma K, Ghiffary MR, Kim HU, Lee SY. Engineering heterologous hosts for the enhanced production of non-ribosomal peptides. Biotechnol. Bioprocess Eng. 2020;25:795–809. doi: 10.1007/s12257-020-0080-z. [DOI] [Google Scholar]
  • 2.Kim GJ, Li X, Kim SH, Yang I, Hahn D, Chin J, et al. Seongsanamides A-D: antiallergic bicyclic peptides from Bacillus safensis KCTC 12796BP. Org. Lett. 2018;20:7539–7543. doi: 10.1021/acs.orglett.8b03293. [DOI] [PubMed] [Google Scholar]
  • 3.Romanowski SB, Lee S, Kunakom S, Paulo BS, Recchia MJJ, Liu DY, et al. Identification of the lipodepsipeptide selethramide encoded in a giant nonribosomal peptide synthetase from a Burkholderia bacterium. Proc. Natl. Acad. Sci. USA. 2023;120:e2304668120. doi: 10.1073/pnas.2304668120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Chen H, Zhong L, Zhou H, Sun T, Zhong G, Tu Q, et al. Biosynthesis of glidomides and elucidation of different mechanisms for formation of β-OH amino acid building blocks. Angew Chem. Int. Ed. Engl. 2022;61:e202203591. doi: 10.1002/anie.202283562. [DOI] [PubMed] [Google Scholar]
  • 5.Patel KD, MacDonald MR, Ahmed SF, Singh J, Gulick AM. Structural advances toward understanding the catalytic activity and conformational dynamics of modular nonribosomal peptide synthetases. Nat. Prod. Rep. 2023;40:1550–1582. doi: 10.1039/D3NP00003F. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Zhao P, Hou P, Zhang Z, Li X, Quan C, Xue Y, et al. Microbial-derived peptides with anti-mycobacterial potential. Eur. J. Med. Chem. 2024;276:116687. doi: 10.1016/j.ejmech.2024.116687. [DOI] [PubMed] [Google Scholar]
  • 7.Borgio JF, Alhujaily R, Alfaraj AS, Alabdullah MJ, Alaqeel RK, Kaabi A, et al. Genome-guided identification of surfactinproducing Bacillus halotolerans AQ11M9 with anti-Candida auris potential. Int. J. Mol. Sci. 2024;25:10408. doi: 10.3390/ijms251910408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Hashizume H, Sawa R, Kubota Y, Adachi S, Harada S, Igarashi M. Precursor-directed biosynthesis and biological activity of tripropeptin Cpip, a new tripropeptin C analog containing pipecolic acid. J. Antibiot. (Tokyo) 2024;77:238–244. doi: 10.1038/s41429-024-00703-9. [DOI] [PubMed] [Google Scholar]
  • 9.Baindara P, Chowdhury T, Roy D, Mandal M, Mandal SM. Surfactin-like lipopeptides from Bacillus clausii efficiently bind to spike glycoprotein of SARS-CoV-2. J. Biomol. Struct. Dyn. 2023;41:14152–14163. doi: 10.1080/07391102.2023.2196694. [DOI] [PubMed] [Google Scholar]
  • 10.Kegler C, Bode HB. Artificial splitting of a non-ribosomal peptide synthetase by inserting natural docking domains. Angew. Chem. Int. Ed. Engl. 2020;59:13463–13467. doi: 10.1002/anie.201915989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Wirtz DA, Ludwig KC, Arts M, Marx CE, Krannich S, Barac P, et al. Biosynthesis and mechanism of action of the cell wall targeting antibiotic hypeptin. Angew Chem. Int. Ed. Engl. 2021;60:13579–13586. doi: 10.1002/anie.202102224. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Behsaz B, Bode E, Gurevich A, Shi YN, Grundmann F, Acharya D, et al. Integrating genomics and metabolomics for scalable non-ribosomal peptide discovery. Nat. Commun. 2021;12:3225. doi: 10.1038/s41467-021-24441-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Vinnik V, Zhang F, Park H, Cook TB, Throckmorton K, Pfleger BF, et al. Structural and biosynthetic analysis of the fabrubactins, unusual siderophores from Agrobacterium fabrum strain C58. ACS Chem. Biol. 2021;16:125–135. doi: 10.1021/acschembio.0c00809. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Bozhüyük KAJ, Präve L, Kegler C, Schenk L, Kaiser S, Schelhas C, et al. Evolution-inspired engineering of nonribosomal peptide synthetases. Science. 2024;383:eadg4320. doi: 10.1126/science.adg4320. [DOI] [PubMed] [Google Scholar]
  • 15.Wu Y, Wang M, Liu L. Advances on structure, bioactivity, and biosynthesis of amino acid-containing trans-AT polyketides. Eur. J. Med. Chem. 2023;262:115890. doi: 10.1016/j.ejmech.2023.115890. [DOI] [PubMed] [Google Scholar]
  • 16.Zhao P, Xue Y, Gao W, Li J, Zu X, Fu D, et al. Bacillaceae-derived peptide antibiotics since 2000. Peptides. 2018;101:10–16. doi: 10.1016/j.peptides.2017.12.018. [DOI] [PubMed] [Google Scholar]
  • 17.Xue Y, Wang M, Zhao P, Quan C, Li X, Wang L, et al. Gram-negative bacilli-derived peptide antibiotics developed since 2000. Biotechnol. Lett. 2018;40:1271–1287. doi: 10.1007/s10529-018-2589-1. [DOI] [PubMed] [Google Scholar]
  • 18.Folger IB, Frota NF, Pistofidis A, Niquille DL, Hansen DA, Schmeing TM, et al. High-throughput reprogramming of an NRPS condensation domain. Nat. Chem. Biol. 2024;20:761–769. doi: 10.1038/s41589-023-01532-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Heard SC, Winter JM. Structural, biochemical and bioinformatic analyses of nonribosomal peptide synthetase adenylation domains. Nat. Prod. Rep. 2024;41:1180–1205. doi: 10.1039/D3NP00064H. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Zhong L, Diao X, Zhang N, Li F, Zhou H, Chen H, et al. Engineering and elucidation of the lipoinitiation process in nonribosomal peptide biosynthesis. Nat. Commun. 2021;12:296. doi: 10.1038/s41467-020-20548-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Kudo F, Miyanaga A, Eguchi T. Structural basis of the nonribosomal codes for nonproteinogenic amino acid selective adenylation enzymes in the biosynthesis of natural products. J. Ind. Microbiol. Biotechnol. 2019;46:515–536. doi: 10.1007/s10295-018-2084-7. [DOI] [PubMed] [Google Scholar]
  • 22.Okamoto T, Yamanaka K, Hamano Y, Nagano S, Hino T. Crystal structure of the adenylation domain from an ε-poly-l-lysine synthetase provides molecular mechanism for substrate specificity. Biochem. Biophys. Res. Commun. 2022;596:43–48. doi: 10.1016/j.bbrc.2022.01.053. [DOI] [PubMed] [Google Scholar]
  • 23.Arulprakasam KR, Dharumadurai D. Genome mining of biosynthetic gene clusters intended for secondary metabolites conservation in actinobacteria. Microb. Pathog. 2021;161:105252. doi: 10.1016/j.micpath.2021.105252. [DOI] [PubMed] [Google Scholar]
  • 24.Khater S, Gupta M, Agrawal P, Sain N, Prava J, Gupta P, et al. SBSPKSv2: structure-based sequence analysis of polyketide synthases and non-ribosomal peptide synthetases. Nucleic Acids Res. 2017;45:W72–W79. doi: 10.1093/nar/gkx344. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Blin K, Shaw S, Augustijn HE, Reitz ZL, Biermann F, Alanjary M, et al. antiSMASH 7.0: new and improved predictions for detection, regulation, chemical structures and visualisation. Nucleic Acids. Res. 2023;51:W46–W50. doi: 10.1093/nar/gkad344. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Chevrette MG, Aicheler F, Kohlbacher O, Currie CR, Medema MH. SANDPUMA: ensemble predictions of nonribosomal peptide chemistry reveal biosynthetic diversity across Actinobacteria. Bioinformatics. 2017;33:3202–3210. doi: 10.1093/bioinformatics/btx400. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Wu C, Shang Z, Lemetre C, Ternei MA, Brady SF. Cadasides, calcium-dependent acidic lipopeptides from the soil metagenome that are active against multidrug-resistant bacteria. J. Am. Chem. Soc. 2019;141:3910–3919. doi: 10.1021/jacs.8b12087. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Panter F, Krug D, Müller R. Novel methoxymethacrylate natural products uncovered by statistics-based mining of the Myxococcus fulvus secondary metabolome. ACS Chem. Biol. 2019;14:88–98. doi: 10.1021/acschembio.8b00948. [DOI] [PubMed] [Google Scholar]
  • 29.Pogorevc D, Tang Y, Hoffmann M, Zipf G, Bernauer HS, Popoff A, et al. Biosynthesis and heterologous production of argyrins. ACS Synth. Biol. 2019;8:1121–1133. doi: 10.1021/acssynbio.9b00023. [DOI] [PubMed] [Google Scholar]
  • 30.Masschelein J, Mattheus W, Gao LJ, Moons P, Van Houdt R, Uytterhoeven B, et al. A PKS/NRPS/FAS hybrid gene cluster from Serratia plymuthica RVH1 encoding the biosynthesis of three broad spectrum, zeamine-related antibiotics. PLoS One. 2013;8:e54143. doi: 10.1371/journal.pone.0054143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Franke J, Ishida K, Ishida-Ito M, Hertweck C. Nitro versus hydroxamate in siderophores of pathogenic bacteria: effect of missing hydroxylamine protection in malleobactin biosynthesis. Angew. Chem. Int. Ed. Engl. 2013;52:8271–8275. doi: 10.1002/anie.201303196. [DOI] [PubMed] [Google Scholar]
  • 32.Patteson JB, Lescallette AR, Li B. Discovery and biosynthesis of azabicyclene, a conserved nonribosomal peptide in Pseudomonas aeruginosa. Org. Lett. 2019;21:4955–4959. doi: 10.1021/acs.orglett.9b01383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Hashimi SM. Albicidin, a potent DNA gyrase inhibitor with clinical potential. J. Antibiot. (Tokyo) 2019;72:785–792. doi: 10.1038/s41429-019-0228-2. [DOI] [PubMed] [Google Scholar]
  • 34.Zhao H, Liu YP, Zhang LQ. In silico and genetic analyses of cyclic lipopeptide synthetic gene clusters in Pseudomonas sp. 11K1. Front. Microbiol. 2019;10:544. doi: 10.3389/fmicb.2019.00544. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Patel KD, Oliver RA, Lichstrahl MS, Li R, Townsend CA, Gulick AM. The structure of the monobactam-producing thioesterase domain of SulM forms a unique complex with the upstream carrier protein domain. J. Biol. Chem. 2024;300:107489. doi: 10.1016/j.jbc.2024.107489. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Murphy AC, Corney M, Monson RE, Matilla MA, Salmond GPC, Leeper FJ. Biosynthesis of antifungal solanimycin may involve an iterative nonribosomal peptide synthetase module. ACS Chem. Biol. 2023;18:1148–1157. doi: 10.1021/acschembio.2c00947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Wang S, Brittain WDG, Zhang Q, Lu Z, Tong MH, Wu K, et al. Aminoacyl chain translocation catalysed by a type II thioesterase domain in an unusual non-ribosomal peptide synthetase. Nat. Commun. 2022;13:62. doi: 10.1038/s41467-021-27512-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Wang X, Zhou H, Chen H, Jing X, Zheng W, Li R, et al. Discovery of recombinases enables genome mining of cryptic biosynthetic gene clusters in Burkholderiales species. Proc. Natl. Acad. Sci. USA. 2018;115:E4255–E4263. doi: 10.1073/pnas.1720941115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Shukla R, Lavore F, Maity S, Derks MGN, Jones CR, Vermeulen BJA, et al. Teixobactin kills bacteria by a two-pronged attack on the cell envelope. Nature. 2022;608:390–396. doi: 10.1038/s41586-022-05019-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Zhao L, Awori RM, Kaiser M, Groß J, Opatz T, Bode HB. Structure, biosynthesis, and bioactivity of photoditritide from Photorhabdus temperata Meg1. J. Nat. Prod. 2019;82:3499–3503. doi: 10.1021/acs.jnatprod.9b00932. [DOI] [PubMed] [Google Scholar]
  • 41.Yoshimura A, Covington BC, Gallant É, Zhang C, Li A, Seyedsayamdost MR. Unlocking cryptic metabolites with mass spectrometry-guided transposon mutant selection. ACS Chem. Biol. 2020;15:2766–2774. doi: 10.1021/acschembio.0c00558. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Dose B, Ross C, Niehs SP, Scherlach K, Bauer JP, Hertweck C. Food-poisoning bacteria employ a citrate synthase and a type II NRPS to synthesize bolaamphiphilic lipopeptide antibiotics. Angew. Chem. Int. Ed. Engl. 2020;59:21535–21540. doi: 10.1002/anie.202009107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Hubrich F, Bösch NM, Chepkirui C, Morinaka BI, Rust M, Gugger M, et al. Ribosomally derived lipopeptides containing distinct fatty acyl moieties. Proc. Natl. Acad. Sci. USA. 2022;119:e2113120119. doi: 10.1073/pnas.2113120119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Ma Z. Genome mining and chemical characterization of a new cyclic lipopeptide associated with MDN-0066 from Pseudomonas moraviensis HN2 cultured in a valine-rich medium. J. Antibiot. (Tokyo) 2023;76:244–248. doi: 10.1038/s41429-023-00597-z. [DOI] [PubMed] [Google Scholar]
  • 45.Kirchner N, Cano-Prieto C, Schulz-Fincke AC, Gütschow M, Ortlieb N, Moschny J, et al. Discovery of thanafactin A, a linear, proline-containing octalipopeptide from Pseudomonas sp. SH-C52, motivated by genome mining. J. Nat. Prod. 2021;84:101–109. doi: 10.1021/acs.jnatprod.0c01174. [DOI] [PubMed] [Google Scholar]
  • 46.Paulo BS, Recchia MJJ, Lee S, Fergusson CH, Romanowski SB, Hernandez A, et al. Discovery of megapolipeptins by genome mining of a Burkholderiales bacteria collection. Chem. Sci. 2024;15:16567–16581. doi: 10.1039/D4SC03594A. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Meesil W, Muangpat P, Sitthisak S, Rattanarojpong T, Chantratita N, Machado RAR, et al. Genome mining reveals novel biosynthetic gene clusters in entomopathogenic bacteria. Sci. Rep. 2023;13:20764. doi: 10.1038/s41598-023-47121-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Guo C, Mandalapu D, Ji X, Gao J, Zhang Q. Chemistry and biology of teixobactin. Chemistry. 2018;24:5406–5422. doi: 10.1002/chem.201704167. [DOI] [PubMed] [Google Scholar]
  • 49.Borgio JF, Alhujaily R, Alquwaie R, Alabdullah MJ, AlHasani E, Alothman W, et al. Mining the nanotube-forming Bacillus amyloliquefaciens MR14M3 genome for determining anti-Candida auris and anti-Candida albicans potential by pathogenicity and comparative genomics analysis. Comput. Struct. Biotechnol. J. 2023;21:4261–4276. doi: 10.1016/j.csbj.2023.08.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Wang Z, Zhang W, Wang Z, Zhang Z, Liu Y, Liu S, et al. Analysis of antimicrobial biological activity of a marine Bacillus velezensis NDB. Arch. Microbiol. 2024;206:131. doi: 10.1007/s00203-024-03861-4. [DOI] [PubMed] [Google Scholar]
  • 51.Borgio JF, Alhujaily R, Alfaraj AS, Alabdullah MJ, Alaqeel RK, Kaabi A, et al. Genome-guided identification of surfactinproducing Bacillus halotolerans AQ11M9 with anti-Candida auris potential. Int. J. Mol. Sci. 2024;25:10408. doi: 10.3390/ijms251910408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Ashajyothi M, Mahadevakumar S, Venkatesh YN, Sarma PVSRN, Danteswari C, Balamurugan A, et al. Comprehensive genomic analysis of Bacillus subtilis and Bacillus paralicheniformis associated with the pearl millet panicle reveals their antimicrobial potential against important plant pathogens. BMC Plant. Biol. 2024;24:197. doi: 10.1186/s12870-024-04881-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Wu D, Fu L, Cao Y, Dong N, Li D. Genomic insights into antimicrobial potential and optimization of fermentation conditions of pig-derived Bacillus subtilis BS21. Front. Microbiol. 2023;14:1239837. doi: 10.3389/fmicb.2023.1239837. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Meesil W, Muangpat P, Sitthisak S, Rattanarojpong T, Chantratita N, Machado RAR, et al. Genome mining reveals novel biosynthetic gene clusters in entomopathogenic bacteria. Sci. Rep. 2023;13:20764. doi: 10.1038/s41598-023-47121-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Kim MS, Jeong DE, Jang JP, Jang JH, Choi SK. Mining biosynthetic gene clusters in Paenibacillus genomes to discover novel antibiotics. BMC Microbiol. 2024;24:226. doi: 10.1186/s12866-024-03375-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Zhu LL, Yang Q, Wang DG, Niu L, Pan Z, Li S, et al. Deciphering the biosynthesis and physiological function of 5-methylated pyrazinones produced by myxobacteria. ACS Cent. Sci. 2024;10:555–568. doi: 10.1021/acscentsci.3c01363. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Dreckmann TM, Fritz L, Kaiser CF, Bouhired SM, Wirtz DA, Rausch M, et al. Biosynthesis of the corallorazines, a widespread class of antibiotic cyclic lipodipeptides. RSC Chem. Biol. 2024;5:970–80. doi: 10.1039/D4CB00157E. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Paulo BS, Recchia MJJ, Lee S, Fergusson CH, Romanowski SB, Hernandez A, et al. Discovery of megapolipeptins by genome mining of a Burkholderiales bacteria collection. Chem. Sci. 2024;15:16567–16581. doi: 10.1039/D4SC03594A. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Romanowski SB, Lee S, Kunakom S, Paulo BS, Recchia MJJ, Liu DY, et al. Identification of the lipodepsipeptide selethramide encoded in a giant nonribosomal peptide synthetase from a Burkholderia bacterium. Proc. Natl. Acad. Sci. USA. 2023;120:e2304668120. doi: 10.1073/pnas.2304668120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Murphy AC, Corney M, Monson RE, Matilla MA, Salmond GPC, Leeper FJ. Biosynthesis of antifungal solanimycin may involve an iterative nonribosomal peptide synthetase module. ACS Chem. Biol. 2023;18:1148–1157. doi: 10.1021/acschembio.2c00947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Voitsekhovskaia I, Ho YTC, Klatt C, Müller A, Machell DL, Tan YJ, et al. Altering glycopeptide antibiotic biosynthesis through mutasynthesis allows incorporation of fluorinated phenylglycine residues. RSC Chem. Biol. 2024;5:1017–1034. doi: 10.1039/D4CB00140K. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.García-Gutiérrez C, Pérez-Victoria I, Montero I, Fernández-De la Hoz J, Malmierca MG, Martín J, et al. Unearthing a cryptic biosynthetic gene cluster for the piperazic acid-bearing depsipeptide diperamycin in the ant-dweller Streptomyces sp. CS113. Int. J. Mol. Sci. 2024;25:2347. doi: 10.3390/ijms25042347. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Hong Z, Bolard A, Giraud C, Prévost S, Genta-Jouve G, Deregnaucourt C, et al. Azetidine-containing alkaloids produced by a quorum-sensing regulated nonribosomal peptide synthetase pathway in Pseudomonas aeruginosa. Angew Chem. Int. Ed. Engl. 2019;58:3178–3182. doi: 10.1002/anie.201809981. [DOI] [PubMed] [Google Scholar]
  • 64.Tymecka D, Redkiewicz P, Lipiński PFJ, Misicka A. Peptidomimetic inhibitors of the VEGF-A165/NRP-1 complex obtained by modification of the C-terminal arginine. Amino Acids. 2024;56:49. doi: 10.1007/s00726-024-03411-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Ouyang X, Wahlsten M, Pollari M, Delbaje E, Jokela J, Fewer DP. Identification of a homoarginine biosynthetic gene from a microcystin biosynthetic pathway in Fischerella sp. PCC 9339. Toxicon. 2024;243:107733. doi: 10.1016/j.toxicon.2024.107733. [DOI] [PubMed] [Google Scholar]
  • 66.Wang S, Fang Q, Lu Z, Gao Y, Trembleau L, Ebel R, et al. Discovery and biosynthetic investigation of a new antibacterial dehydrated non-ribosomal tripeptide. Angew. Chem. Int. Ed. Engl. 2021;60:3229–3237. doi: 10.1002/anie.202012902. [DOI] [PubMed] [Google Scholar]
  • 67.Li Z, Jiao Y, Ling J, Zhao J, Yang Y, Mao Z, et al. Characterization of a methyltransferase for iterative N-methylation at the leucinostatin termini in Purpureocillium lilacinum. Commun. Biol. 2021;7:757. doi: 10.1038/s42003-024-06467-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Matabaro E, Witte L, Gherlone F, Vogt E, Kaspar H, Künzler M. Promiscuity of omphalotin A biosynthetic enzymes allows de novo production of non-natural multiply backbone N-methylated peptide macrocycles in yeast. Chembiochem. 2024;25:e202300626. doi: 10.1002/cbic.202300626. [DOI] [PubMed] [Google Scholar]
  • 69.Pang B, Chen Y, Gan F, Yan C, Jin L, Gin JW, et al. Investigation of indigoidine synthetase reveals a conserved active-site base residue of nonribosomal peptide synthetase oxidases. J. Am. Chem. Soc. 2020;142:10931–10935. doi: 10.1021/jacs.0c04328. [DOI] [PubMed] [Google Scholar]
  • 70.Wirtz DA, Schneberger N, Klöppel S, Richarz R, Geyer M, König GM, et al. Adenylation domain-guided recruitment of transacting nonheme monooxygenases in nonribosomal peptide biosynthesis. ACS Chem. Biol. 2023;18:1748–1759. doi: 10.1021/acschembio.3c00106. [DOI] [PubMed] [Google Scholar]
  • 71.Chen H, Zhong L, Zhou H, Sun T, Zhong G, Tu Q, et al. Biosynthesis of glidomides and elucidation of different mechanisms for formation of β-OH amino Acid building blocks. Angew. Chem. Int. Ed. Engl. 2022;61:e202203591. doi: 10.1002/anie.202283562. [DOI] [PubMed] [Google Scholar]
  • 72.Park D, Ciezki K, van der Hoeven R, Singh S, Reimer D, Bode HB, Forst S. Genetic analysis of xenocoumacin antibiotic production in the mutualistic bacterium Xenorhabdus nematophila. Mol. Microbiol. 2009;73:938–949. doi: 10.1111/j.1365-2958.2009.06817.x. [DOI] [PubMed] [Google Scholar]
  • 73.Zane HK, Naka H, Rosconi F, Sandy M, Haygood MG, Butler A. Biosynthesis of amphi-enterobactin siderophores by Vibrio harveyi BAA-1116: identification of a bifunctional nonribosomal peptide synthetase condensation domain. J. Am. Chem. Soc. 2014;136:5615–5618. doi: 10.1021/ja5019942. [DOI] [PubMed] [Google Scholar]
  • 74.Heberlig GW, Boddy CN. Thioesterase from cereulide biosynthesis is responsible for oligomerization and macrocyclization of a linear tetradepsipeptide. J. Nat. Prod. 2020;83:1990–1997. doi: 10.1021/acs.jnatprod.0c00333. [DOI] [PubMed] [Google Scholar]
  • 75.Xu Y, Kersten RD, Nam SJ, Lu L, Al-Suwailem AM, Zheng H, et al. Bacterial biosynthesis and maturation of the didemnin anticancer agents. J. Am. Chem. Soc. 2012;134:8625–8632. doi: 10.1021/ja301735a. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Evans BS, Ntai I, Chen Y, Robinson SJ, Kelleher NL. Proteomics-based discovery of koranimine, a cyclic imine natural product. J. Am. Chem. Soc. 2011;133:7316–7319. doi: 10.1021/ja2015795. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Journal of Microbiology and Biotechnology are provided here courtesy of Korean Society for Microbiology and Biotechnology

RESOURCES