Abstract
Microcin J25 (MccJ25), a lasso peptide antibiotic with a unique structure that resembles the lariat knot, has been a topic of intense interest since its discovery in 1992. The precursor (McjA) contains a leader and a core segment. McjB is a protease activated upon binding to the leader, and McjC converts the core segment into the mature MccJ25. Previous studies suggested that these biosynthetic steps likely proceed in a (nearly) concerted fashion; however, there is only limited information regarding the structural and molecular intricacies of MccJ25 biosynthesis. To close this knowledge gap, we used AlphaFold2 to predict the structure of the precursor (McjA) in complex with its biosynthetic enzymes (McjB and McjC) and queried the critical predicted features by protein engineering. Based on the predicted structure, we designed protein variants to show that McjB can still be functional and form a proficient biosynthetic complex with McjC when its recognition and protease domains were circularly permutated or split into separate proteins. Specific residues important for McjA recognition were also identified, which permitted us to pinpoint a compensatory mutation (McjBM108T) to restore McjA/McjB interaction that rescued an otherwise nearly nonproductive precursor variant (McjAT–2M). Studies of McjA, McjB, and McjC have long been mired by them being extremely difficult to handle experimentally, and our results suggest that the AF2 predicted ternary complex structure may serve as a reasonable starting point for understanding MccJ25 biosynthesis. The prediction-validation workflow presented herein combined artificial intelligence and laboratory experiments constructively to gain new insights.
Introduction
Microcin J25 (MccJ25) is a potent antibiotic with a lasso structure (Figure 1a).1−3 It is a peptide natural product that consists of 21 amino acids, whose C-terminal tail passes through a macrocycle that results from the formation of an isopeptide bond between the Gly1 N-terminal amine and the Glu8 side-chain carboxylate. This threaded lasso configuration is stabilized by two bulky residues immediately above (Phe19) and below (Tyr20) the macrolactam ring and remains intact against chemical and thermal denaturation.4,5 MccJ25 was the first antibiotic known to inhibit transcription by blocking the secondary channel of the bacterial RNA polymerase complex.6 As a natural product with a unique structure and mechanism of action, MccJ25 has fascinated scientists since its discovery in 1992.7,8 However, despite extensive interest over the past three decades, the structural and molecular details of MccJ25 biosynthesis have remained elusive. Herein, we combined AlphaFold2 (AF2) structure prediction9 and protein engineering, including domain swapping, cyclic permutation, site-directed mutagenesis, and synthetic rescue, to gain new insights into the interaction between the MccJ25 precursor peptide (McjA) and its biosynthetic enzymes (McjB and McjC).
MccJ25 is a ribosomally synthesized and post-translationally modified peptide (RiPP). A lasso peptide biosynthetic gene cluster (BGC) in general encodes three genes (A–C).10,11 They are named mcjA, mcjB, and mcjC in the MccJ25 BGC,12 which contains an additional membrane transporter (McjD) that confers self-resistance by exporting the mature MccJ25.13 McjA is the precursor peptide (Figure 1b), which includes a leader segment (McjA(−37)to(−1)) and a core segment (McjA1–21). Residues of the core peptide were numbered 1 to 21 and those of the leader peptide were assigned negative numbers. MccJ25 maturation is catalyzed by the biosynthetic enzymes McjB and McjC (Figure 1c).14,15 Specifically, McjB cleaves the amide bond that connects the leader and the core, and McjC catalyzes the formation of the macrolactam that entraps its own tail into a threaded configuration. Furthermore, in vitro assays showed that McjB and McjC are only active in the presence of each other,15 suggesting that they likely act in a (nearly) concerted fashion to catalyze the maturation of MccJ25. While this phenomenon is interesting from a molecular mechanism viewpoint, it makes elucidating the structural and molecular details of MccJ25 biosynthesis significantly more challenging.
The conversion of a RiPP precursor into the final natural product generally entails the following steps.16 The leader peptide first interacts with a conserved domain in one of the biosynthetic enzymes, called the RiPP precursor recognition element (RRE).17 This interaction is the gatekeeping event that triggers further post-translational modification(s). In lasso peptide biosynthesis, a protease is activated upon leader recognition by the RRE to cleave the precursor into the leader and the core peptides.18 Interestingly, the RRE and protease domains may exist either as a single didomain protein, such as McjB in the MccJ25 BGC, or discretely as separate proteins, such as PadeB1 and PadeB2 in the paeninodin BGC.19 The open reading frame (ORF) that encodes a fused enzyme is termed the “B” gene, whereas separate RRE and protease are usually termed the B1 and B2 proteins, respectively, a naming convention we adhered to throughout this manuscript. Note that the RRE and protease in lasso peptide BGC are occasionally termed the B and E proteins.
In vitro assays for leader recognition and/or cleavage, either by separate B1/B2 proteins or a fused B enzyme, have been developed to study the biosynthesis of fusilassin (also known as fuscanodin), lariatin, paeninodin, burhizin, and therbactin.20−26 These assays revealed residues key to leader recognition by the RRE to activate the protease. Furthermore, X-ray crystal structures of B1 proteins, including RRE/leader peptide cocrystal structures for fusilassin and therbactin biosynthesis, were reported recently and provided details at atomic resolution (Table S1).18,27 Unfortunately, as the lasso peptide that initiated the field of supramolecular natural product research, not nearly as much is known about MccJ25 biosynthesis. Herein, AF2 was used to predict the structure of the McjA/McjB/McjC ternary complex, and its critical features were confirmed by MccJ25 production using engineered biosynthetic enzymes designed based on the predicted structure. The AF2 prediction is therefore a reasonable model and can serve as the starting point toward understanding the structural basis of MccJ25 biosynthesis.
The study of MccJ25 biosynthesis has been mired by McjA, McjB, and McjC being very difficult to handle experimentally. McjA alone is unstructured and highly susceptible to proteases; most His-tagged McjB and McjC end up in inclusion bodies, supplying as little as tens of micrograms of pure proteins per liter of culture.14 These enzymes were obtained at low yields even when fused to a maltose binding protein (MBP),15 and MBP-McjC still forms high-molecular-weight aggregates consisting of 10 or more monomers (Figure S1). More challenging still, the enzymatic actions of McjB (leader activation and cleavage) and McjC (macrolactam formation) cannot be studied in separate assays as they depend upon each other for activation,14,15 hinting at (nearly) concerted biosynthetic steps. As such, it may require a high-resolution structure of the McjA/McjB/McjC ternary complex, as opposed to individual proteins, to fully understand the molecular details of MccJ25 maturation, making an already formidable task even more demanding.
Results
AF2 Predicts a McjA/McjB/McjC Ternary Complex
In light of these struggles, we used AF2 to foray into the structural and molecular details of MccJ25 biosynthesis.9 We are aware of a number of caveats regarding such an endeavor.28 An AF2 predicted structure is usually accurate when the subject is homologous to proteins whose structures have already been determined. Structure predictions of proteins with no experimentally characterized homologues, as well as protein–protein interaction, are generally taken with a grain of salt. In our case, AF2 was not only tasked with predicting the structure of the McjA/McjB/McjC ternary complex, but there were reasons to believe that each of the three components poses challenges. As a 58-residue linear peptide, McjA is intrinsically disordered and without a defined structure.29 Even though McjB is known to be a cysteine protease, it is an atypical member of this enzyme family as it requires ATP to operate and is inactive without the presence of McjC.15 As for McjC, it is only loosely homologous to asparagine synthetase (∼20% identity and ∼38% similarity),30,31 and no lasso peptide synthetase has ever been structurally characterized.
McjA, McjB, and McjC were submitted as separate polypeptide sequences to AF2. Gratifyingly, the three proteins were predicted to form a ternary complex that shows structural features consistent with what is known about MccJ25 biosynthesis and RiPP maturation in general (Figure 2a). Specifically, the McjB predicted structure shows discrete N-terminal (B1) and C-terminal (B2) domains. The former displays a typical winged helix-turn-helix fold that harbors the RRE (Figure 2b); the latter shows a Cys-His-Asp catalytic triad typical of cysteine proteases, wherein the side-chain carboxylate, imidazole, and thiol moieties are aligned (Figure 2c). The thiol points directly at the scissile amide bond (McjAK–1/G1) that connects the leader and the core segments of the precursor (Figure 2d). The predicted structure also offered some new insights. For example, the active sites of McjB and McjC face each other and sandwich the leader peptide in between. The contact surface area between McjB and McjC is approximately 1000 Å2 and appears to be driven by a network of at least 10 hydrogen bonds (Figure S2).32 It is hypothesized that McjC stabilizes the core peptide in a folded conformation that restricts the motion of its own C-terminal tail when macrolactam formation takes place.29,33 While AF2 did predict the core peptide to extend into a deep cavity in McjC, the hypothesized prefolding mechanism of the core peptide was not evident in the predicted structure. The side-chain carboxylate of Glu8 (McjAE8) seems poised to be adenylated to form an active ester as it sits next to the ATP binding site in McjC (Figure S3).
Unless guided by structural information, modification of any of the three proteins will very likely disrupt the intricate coordination of this McjA/McjB/McjC biosynthetic complex and abolish MccJ25 production. On the other hand, protein engineering guided by (predicted) structural information shall have a higher rate of success. We designed a series of McjA, McjB, and McjC variants meant for testing critical structural features predicted by AF2 at various scales, from the orientation of interaction between proteins, to domain boundary within a protein, to specific contacts between individual amino acid residues. If a variant disrupted the McjA/McjB/McjC ternary complex, little or no MccJ25 would be produced. Conversely, robust MccJ25 production would suggest that AF2 presented a reasonable model for the structural feature probed by that particular protein variant. Protein variants were constructed by directly modifying the plasmid commonly used for MccJ25 production (pTUC202).34 Residues of interest were replaced and tested one at a time while leaving all other proteins unchanged (Table S2); MccJ25 production was determined by inspecting culture extracts in accordance with published procedures (Figure S4).35,36
McjB/McjC Interface and Their Termini
The MBP (396 aa) is a commonly used tag to solubilize and stabilize recombinant proteins; it is almost twice as large as McjB (208 aa) and about three-fourths the size of McjC (513 aa). McjB and McjC are known to still be functional when the MBP is fused to their N-termini.15 For all of our protein variants (engineered McjA, McjB, or McjC in the pTUC202 vector expressed in Escherichia coli), culture extracts were first analyzed by MALDI-TOF MS, and those that showed the expected MccJ25 m/z signals were then subjected to an inhibition zone assay and LC quantitation. For the former, a 2-fold culture extract dilution series was prepared and spotted onto a bacterial lawn to compare the sizes of the growth inhibitory zones they generate. The latter entails integrating the MccJ25 peak area in an LC trace (Table 1), wherein caffeine was coinjected as the internal standard for quantitation.
Table 1. MccJ25 Production Yields for Various Protein Variants.
construct | peak areaa,b | yieldc |
---|---|---|
WT | 14 ± 2.3 × 103 | 100% |
MBP fusions | ||
MBP-McjB | ref (15)d | |
MBP-McjC | ref (15)d | |
McjB-MBP | 8.5 ± 1.2 × 103 | 61% |
McjC-MBP | 12 ± 0.6 × 103 | 86% |
McjB/McjC interface | ||
McjBF67R | 82 ± 73 | 0.6% |
circular permutations | ||
McjBCP60/61 | 51 ± 13 | 0.4% |
McjBCP70/71 | n.d. | |
McjBCP80/81 | 610 ± 137 | 4.4% |
split McjB | ||
McjBS60/61 | n.d. | |
McjBS70/71 | n.d. | |
McjBS80/81 | 5.8 ± 0.3 × 103 | 41% |
McjA/McjC recognition | ||
McjCK337E | 1.1 ± 0.2 × 103 | 8.1% |
McjCS440Y | 324 ± 86 | 2.4% |
synthetic rescue | ||
McjAT–2F | n.d. | |
McjAT–2F/McjBF23T | n.d. | |
McjAT–2M | 145 ± 2 | 1.0% |
McjAT–2M/McjBM108T | 755 ± 56 | 5.4% |
Area under the MccJ25 peak in an HPLC trace was integrated. All samples contained the same concentration of caffeine (100 μg/mL) as a quantitation standard.
“n.d.” denotes not detected.
Average yield relative to that of the WT; all assays were done in triplicate (n = 3).
Rebuffat and co-workers reported the production of MccJ25 using these constructs in an in vitro reconstitution experiment.15
We constructed C-terminal MBP fusions of McjB and McjC and tested them, one at a time, in a background of otherwise native BGC. The MccJ25 production of these constructs were compared to the WT MccJ25 BGC via both qualitative and quantitative methods described above (Figure 3a–3c). These constructs showed only a slight decrease in MccJ25 production yield compared with the native enzymes (Table 1), suggesting that having a sizable protein tag on the C-terminus of either McjB and McjC does not interfere with the formation of a catalytically proficient enzyme complex. The AF2 predicted structure of the McjA/McjB/McjC ternary complex is in line with these observations, which shows the N- and C-termini of both McjB and McjC all pointing away from their interaction interface.
We also wanted to probe the McjB/McjC interface. As a residue on the protein surface and far away from the catalytic triad, variants of McjBF67 are expected to still be proficient proteases. However, AF2 predicted that this residue (McjBF67) engages in a cation-π interaction with McjCR438 and also lies at the heart of a network of hydrogen bonds at the McjB/McjC interface (Figure S2). The McjBF67R variant, wherein the Phe aromatic ring is replaced by the positively charged guanidinium in Arg, should abolish both types of noncovalent attraction described above. The resulting McjB/McjC interaction would be much weaker and result in decreased MccJ25 production. MccJ25 production for the McjBF67R construct did turn out to be much lower compared to that for the WT (0.6%, Table 1 and Figure S5a).
Circular Permutation of McjB
Upon closer examination of the AF2 predicted structure, we noticed that the N- and C-termini of McjB are spatially very close to each other (Figure 4a). Their Cα atoms are merely 12.3 Å apart. This inspired us to explore the possibility of circular permutation (CP), which can be viewed as if a string was circularized and then cut open at a different site.37,38 In a CP protein, the N- and C-termini of the native protein are connected (usually via a short linker) and new termini are created elsewhere in the native sequence.39 The resulting CP protein variant has two stretches of amino acid sequences that are each identical to part of the native protein, yet their connectivity is rearranged. To generate McjB CP variants, we used a short flexible spacer (GGSSGG) to link its original termini and chose the positions of the new termini based on sequence analysis and the AF2 predicted structure.
While McjB is a single polypeptide, the B enzymes of Gram-positive bacterial lasso peptide BGCs often exist as separate B1 and B2 proteins.26 The B1/B2 domain boundary in McjB would be the ideal new terminus if a CP variant was to be generated. A collection of 20 phylogenetically diverse lasso peptide BGCs that harbor discrete B1 and B2 ORFs was compiled (Table S3).40 The B1 proteins and the B2 proteins were then aligned separately to McjB. All B1 and B2 proteins were aligned to the N- and C-termini of McjB, respectively. In the 61-to-80 segment (numbering based on the McjB sequence), sequence alignments showed multiple sharp edges (Figure 4b,4c) and logo plots identified multiple highly conserved residues (Figure 4d,4e). These results suggest that residues 61 to 80 in McjB are likely where its B1/B2 domain boundary is located. The AF2 predicted structure corroborated this notion, showing two discretely folded domains connected by a stretch of amino acid residues (59 to 83) without a well-defined secondary structure.
Based on the above analysis, we generated three variants, McjBCP60/61, McjBCP70/71, and McjBCP80/81, wherein McjBCPX/X+1 denotes a CP variant with new C- and N-termini at residue X and X+1, respectively (Figure 3d–3f). Out of the three constructs, two showed MccJ25 production, wherein the yield of the McjBCP80/81 construct was about an order of magnitude higher than that of McjBCP60/61 (Table 1). These results are the first reports of a circularly permutated lasso peptide biosynthetic enzyme.
Splitting McjB into Two B1 and B2 Proteins
Marahiel and co-workers showed that the B enzyme of the rubrivinodin BGC (RugeB) can be split into B1 and B2 proteins and still produce the same lasso peptide, albeit at a much lower yield.19 In the case of MccJ25 production, if McjB were to be split into two proteins, the split site must be chosen carefully to ensure that the intricate interaction is preserved, and the resulting B1 and B2 proteins can still form a proficient enzyme complex with McjC to catalyze McjA maturation. We again consulted sequence alignment results and the AF2 predicted structure. To split McjB, we inserted in mcjB a 32-nucleotide spacer that includes a stop codon (TAA), a short spacer, a ribosome binding site (AAGGAG), and a start codon (ATG). Three constructs were generated (McjBS60/61, McjBS70/71, and McjBS80/81), wherein McjBSX/X+1 denotes splitting the native McjB to create a B1 protein that ends at residue X and a B2 protein that starts at residue X + 1 (Figure 3g–3i). Out of the three constructs, one variant showed robust production (McjBS80/81), yielding slightly less than half as much of MccJ25 (41%) compared to the WT (Table 1). These data again suggest that the AF2 predicted structure of the McjA/McjB/McjC complex is accurate enough to help guide the design of protein variants.
Disrupting McjA Recognition by McjC
The predicted McjC structure showed a central cavity that accommodates the core peptide (McjA1–21) and is consistent with the hypothetical prefolding mechanism for MccJ25 biosynthesis. Presumably, McjC stabilizes the core peptide in a conformation, wherein McjAG1-E8 wraps around the tail such that the threaded structure characteristic of MccJ25 is generated upon formation of the Gly1-Glu8 isopeptide bond to create the macrolactam ring (Figure 1). AF2 provided insights into how McjC controls the position and orientation of the McjAE8 side-chain carboxylate (Figure 5a,5b), both of which are of paramount importance in such a biosynthetic mechanism. There is an apparent salt bridge between the McjCK337 side-chain amine and the McjAE8 side-chain carboxylate; the two functional groups are only 2.9 Å apart in the predicted ternary complex structure. We constructed the McjCK337E variant to probe this predicted feature, wherein the Lys-to-Glu substitution was expected to eliminate the salt bridge interaction and disrupt McjA recognition by McjC. The McjCK337E variant produced MccJ25 at a much lower yield compared to the WT (8.1%, Table 1 and Figure 5c). In addition, the McjCS440Y variant was generated to probe the effect of steric hindrance. This residue is on the rim of the McjC cavity and is spatially close to McjAE8. Replacing Ser with Tyr, a much larger residue that can still engage in hydrogen bonding, is expected to negatively impact McjA/McjC interaction. Indeed, this construct produced MccJ25 at a much lower yield compared to that of the WT as well (2.4%, Table 1 and Figure 5d).
Synthetic Rescue of McjA Recognition by McjB
Perhaps the ultimate test of the AF2 predicted structure of the McjA/McjB/McjC ternary complex is to use it to pinpoint the molecular basis of the McjA/McjB interaction. Link and co-workers showed that truncating up to 28 N-terminal residues of the leader peptide still led to detectable MccJ25 production,41 suggesting that leader peptide recognition by McjB for the most part can be attributed to the rest of the eight C-terminal residues. The importance of Thr(−2), which refers to the second to the last residue of the leader peptide (McjAT–2), was also well established.42 Mutations at this position often negatively impact MccJ25 production. For example, the T-2 M variant yielded only ∼1% of the amount of MccJ25 compared with the WT, and the T-2F variant failed to produce any detectable amount of MccJ25. In addition, a large-scale bioinformatic analysis of lasso peptide BGCs revealed that residues at this penultimate “minus-two” position of the leader peptide is highly conserved, of which 94% are Thr.26 However, the binding pocket in McjB that is involved in McjA (leader peptide) recognition has not been identified.
In the AF2 predicted structure, the side chain of Thr(−2) fits snugly in a binding pocket formed collectively by four McjB residues, including one from the B1 domain (F23) and three from the B2 domain (M108, L151, and S181) (Figure 6a). The increase in side-chain size in both of the aforementioned McjA variants (T-2F and T-2M) must have turned what used to be a snug fit into steric clashes, compromised McjA precursor recognition by McjB, and resulted in diminished MccJ25 production (or a total loss thereof). It may be possible to restore MccJ25 production for these McjA variants by introducing compensatory mutations into McjB to expand the binding pocket, a procedure termed “synthetic rescue”. We hypothesized that when an X-to-Y exchange in the substrate makes it too big to fit in the binding pocket, a concomitant Y-to-X substitution in the binding pocket should alleviate the steric clash to restore substrate/enzyme interaction. Based on this simple design rationale, we generated two pairs of constructs – McjAT–2F/McjBF23T (TF) and McjAT–2M/McjBM108T (TM) (Figure 6b–6e). Even though the TF construct failed to boost MccJ25 production, the TM construct increased MccJ25 production yield by more than 5-fold (Figure 6f and Table 1). These results suggest that the AF2 predicted structure of the ternary complex is a reasonable model to identify individual residues involved in leader peptide recognition.
Discussion
MccJ25 is one of the first lasso peptides discovered.7 While many more examples have been found and studied since, MccJ25 is unique even among this family of natural products. Specifically, the two enzymes that catalyze MccJ25 maturation (McjB and McjC) depend on the presence of each other to function.15 This observation, together with the AF2 predicted structure reported herein, hints at an intriguing concerted biosynthetic process; however, this notion has thus far remained a speculation that awaits further experimental support (Figure 2d). Despite immense interest in MccJ25 over the past three decades,8 structural information on the MccJ25 biosynthetic machinery is still unavailable and molecular details of how the “threaded lasso” configuration is constructed remain elusive. AF2 was used to predict the structure of the McjA/McjB/McjC ternary complex, and our experimental results support all of the predicted structural features we tested, including protein orientation, domain boundaries, and contact between select amino acids.
Our CP McjB variants represent the first attempt at engineering a lasso peptide biosynthetic enzyme in this manner. Interestingly, we identified four lasso peptide B enzymes out of a database of 1,619 (0.2%) that appear to have an N-terminal protease domain (B2) and a C-terminal RRE (B1) (Table S4).26 Sequence alignment and structure prediction by AF2 both supported this domain assignment; i.e., these B enzymes naturally have an “inverse” domain arrangement (Figures S6 and S7). If B1 and B2 started out as separate proteins, based on our CP results and the few examples that exist in nature, there are viable ways to join them, in either order, to create a fused B protein that is functional. However, we are unaware of any evolutionary or genetic mechanisms that would bias the direction of a hypothetical B1 and B2 protein fusion event to such an extreme ratio (99.8% (=1 – 0.2%)). Seeing only very rare cases of “inverse” domain arrangement in nature (0.2%) would seem to suggest that the B1 and B2 proteins of today resulted from an ancient B protein splitting into two during the course of evolution. This notion is an interesting speculation that is worth testing in the future.
The interaction between McjA and McjB warrants further discussion. Large-scale sequence analysis and protein structural studies point to a general mode of leader recognition in RiPP biosynthesis.26 The leader peptide first binds to the RRE, which has a winged helix-turn-helix fold, with high affinity and specificity.17 Central to this interaction is a β-sheet in the RRE with three antiparallel strands (β1–3), and all RREs (15) structurally characterized to date show this feature (Table S1). Based on the AF2 predicted structure, the three strands correspond to residues 5–9 (β1), 12–16 (β2), and 21–25 (β3) of McjB, and the leader peptide aligns itself along the edge of β3 of McjB to form a fourth strand (β4, McjA(−12)to(−10)) to extend the β-sheet (Figure 2b). Approximately 6 to 7 residues in the therbactin and the fusilassin precursors interact with their respective RRE (TbiB1 and TfuB1),18,27 whereas only three McjA residues (−12, −11, and −10) appear to be involved. The latter interaction is likely much weaker in comparison, which, coupled with McjA(−37)to(−13) showing no obvious interaction with McjB in the predicted structure, explains why much of the McjA N-terminal sequence is dispensable (McjA(−37)to(−9)).41
This observation also implies that the few residues immediately upstream of the core peptide (McjA(−8)to(−1)) must play an outsized role in interacting with McjB. We therefore directed our attention to the highly conserved Thr(−2) residue (McjAT–2). Previous studies showed that mutations at this residue negatively impact MccJ25 production; e.g., the McjAT–2M variant yielded less than 1% of MccJ25 compared to the WT. This mutation presumably made the residue at the key “minus-two” position too bulky to fit in the McjB binding pocket. Using the AF2 predicted structure as a guide, our synthetic rescue construct (the introduction of a compensatory McjBM108T mutation) partially restored the McjA/McjB interaction, evidenced by an increase in yield by more than 5-fold. Interestingly, AF2 still predicted the formation of a McjA/McjB/McjC ternary complex for both the T-2F and T-2 M constructs, wherein the Phe and Met residues that replaced Thr showed no obvious clashes with the McjB binding pocket. This result suggests that while AF2 can often predict individual protein structures fairly accurately, it sometimes fails to reflect the consequences of subtle changes, especially when it comes to protein–protein interactions.
The unstructured linker segment (McjB61–80) is part of the McjB/McjC interface (Figure 1a). To construct our CP and split McjB variants, an extra pair of charges is generated in this region: a net positive and a net negative charge associated with the new N- and C-termini, respectively. Since the McjB61–70 segment is in direct contact with McjC and residues 80 and 81 are off to the side, it is not surprising that the McjBCP80/81 and McjBS80/81 variants showed the highest MccJ25 yields among our CP and split McjB constructs, respectively (Table 1). These observations suggest that a proficient complex that catalyzes MccJ25 maturation can still form when McjB is split at residue 80/81. In fact, when McjA/McjB1–80/McjB81–208/McjC were submitted as four separate polypeptides, AF2 predicted a quaternary complex nearly identical to the native McjA/McjB/McjC ternary complex (Figure S8). Future engineering of B enzymes should therefore take this into consideration and avoid amino acid changes that are part of the McjB/McjC interface.
Conclusions
Altogether, the data presented herein suggest that the structure of the McjA/McjB/McjC ternary complex predicted by AF2 is a reasonable model and can serve as a starting point toward understanding the structural basis of MccJ25 biosynthesis. In addition to presenting a structural model of leader peptide interaction with McjB, AF2 also predicted the core peptide to extend into a deep cavity in McjC. As no lasso peptide synthetase has ever been structurally characterized, it is not surprising that AF2 fell short of informing us the exact molecular details of the presumed McjC-stabilized prefolded conformation of the core peptide. Enzymes that catalyze the formation of such a supramolecular structure are currently beyond the reach of computation and artificial intelligence. This final knowledge gap in lasso peptide biosynthesis will have to be filled by experiments and human intelligence.
Acknowledgments
The authors thank the Consortia of Key Technologies at National Taiwan University (NTU) for technical support (MALDI-TOF MS). This work was supported by grants from the National Science and Technology Council, Taiwan (NSTC 112-2113-M-002-005 and NSTC 112-2923-M-002-005-MY3).
Supporting Information Available
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acschembio.4c00251.
The authors declare no competing financial interest.
Supplementary Material
References
- Bayro M. J.; Mukhopadhyay J.; Swapna G. V.; Huang J. Y.; Ma L. C.; Sineva E.; Dawson P. E.; Montelione G. T.; Ebright R. H. Structure of antibacterial peptide microcin J25: a 21-residue lariat protoknot. J. Am. Chem. Soc. 2003, 125, 12382–12383. 10.1021/ja036677e. [DOI] [PubMed] [Google Scholar]
- Rosengren K. J.; Clark R. J.; Daly N. L.; Goransson U.; Jones A.; Craik D. J. Microcin J25 has a threaded sidechain-to-backbone ring structure and not a head-to-tail cyclized backbone. J. Am. Chem. Soc. 2003, 125, 12464–12474. 10.1021/ja0367703. [DOI] [PubMed] [Google Scholar]
- Wilson K. A.; Kalkum M.; Ottesen J.; Yuzenkova J.; Chait B. T.; Landick R.; Muir T.; Severinov K.; Darst S. A. Structure of microcin J25, a peptide inhibitor of bacterial RNA polymerase, is a lassoed tail. J. Am. Chem. Soc. 2003, 125, 12475–12483. 10.1021/ja036756q. [DOI] [PubMed] [Google Scholar]
- Blond A.; Cheminant M.; Destoumieux-Garzon D.; Segalas-Milazzo I.; Peduzzi J.; Goulard C.; Rebuffat S. Thermolysin-linearized microcin J25 retains the structured core of the native macrocyclic peptide and displays antimicrobial activity. Eur. J. Biochem. 2002, 269, 6212–6222. 10.1046/j.1432-1033.2002.03340.x. [DOI] [PubMed] [Google Scholar]
- Rosengren K. J.; Blond A.; Afonso C.; Tabet J. C.; Rebuffat S.; Craik D. J. Structure of thermolysin cleaved microcin J25: extreme stability of a two-chain antimicrobial peptide devoid of covalent links. Biochemistry 2004, 43, 4696–4702. 10.1021/bi0361261. [DOI] [PubMed] [Google Scholar]
- Mukhopadhyay J.; Sineva E.; Knight J.; Levy R. M.; Ebright R. H. Antibacterial peptide microcin J25 inhibits transcription by binding within and obstructing the RNA polymerase secondary channel. Mol. Cell 2004, 14, 739–751. 10.1016/j.molcel.2004.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Salomón R. A.; Farias R. N. Microcin 25, a novel antimicrobial peptide produced by Escherichia coli. J. Bacteriol. 1992, 174, 7428–7435. 10.1128/jb.174.22.7428-7435.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baquero F.; Beis K.; Craik D. J.; Li Y.; Link A. J.; Rebuffat S.; Salomon R.; Severinov K.; Zirah S.; Hegemann J. D. The pearl jubilee of microcin J25: Thirty years of research on an exceptional lasso peptide. Nat. Prod. Rep. 2024, 41, 469–511. 10.1039/D3NP00046J. [DOI] [PubMed] [Google Scholar]
- Jumper J.; Evans R.; Pritzel A.; Green T.; Figurnov M.; Ronneberger O.; Tunyasuvunakool K.; Bates R.; Zidek A.; Potapenko A.; Bridgland A.; Meyer C.; Kohl S. A. A.; Ballard A. J.; Cowie A.; Romera-Paredes B.; Nikolov S.; Jain R.; Adler J.; Back T.; Petersen S.; Reiman D.; Clancy E.; Zielinski M.; Steinegger M.; Pacholska M.; Berghammer T.; Bodenstein S.; Silver D.; Vinyals O.; Senior A. W.; Kavukcuoglu K.; Kohli P.; Hassabis D. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maksimov M. O.; Pan S. J.; Link A. J. Lasso peptides: structure, function, biosynthesis, and engineering. Nat. Prod. Rep. 2012, 29, 996–1006. 10.1039/c2np20070h. [DOI] [PubMed] [Google Scholar]
- Hegemann J. D.; Zimmermann M.; Xie X.; Marahiel M. A. Lasso peptides: an intriguing class of bacterial natural products. Acc. Chem. Res. 2015, 48, 1909–1919. 10.1021/acs.accounts.5b00156. [DOI] [PubMed] [Google Scholar]
- Solbiati J. O.; Ciaccio M.; Farias R. N.; Gonzalez-Pastor J. E.; Moreno F.; Salomon R. A. Sequence analysis of the four plasmid genes required to produce the circular peptide antibiotic microcin J25. J. Bacteriol. 1999, 181, 2659–2662. 10.1128/JB.181.8.2659-2662.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bountra K.; Hagelueken G.; Choudhury H. G.; Corradi V.; El Omari K.; Wagner A.; Mathavan I.; Zirah S.; Yuan Wahlgren W.; Tieleman D. P.; Schiemann O.; Rebuffat S.; Beis K. Structural basis for antibacterial peptide self-immunity by the bacterial ABC transporter McjD. EMBO J. 2017, 36, 3062–3079. 10.15252/embj.201797278. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Duquesne S.; Destoumieux-Garzon D.; Zirah S.; Goulard C.; Peduzzi J.; Rebuffat S. Two enzymes catalyze the maturation of a lasso peptide in Escherichia coli. Chem. Biol. 2007, 14, 793–803. 10.1016/j.chembiol.2007.06.004. [DOI] [PubMed] [Google Scholar]
- Yan K. P.; Li Y.; Zirah S.; Goulard C.; Knappe T. A.; Marahiel M. A.; Rebuffat S. Dissecting the maturation steps of the lasso peptide microcin J25 in vitro. ChemBioChem 2012, 13, 1046–1052. 10.1002/cbic.201200016. [DOI] [PubMed] [Google Scholar]
- Yang X.; van der Donk W. A. Ribosomally synthesized and post-translationally modified peptide natural products: new insights into the role of leader and core peptides during biosynthesis. Chem. - Eur. J. 2013, 19, 7662–7677. 10.1002/chem.201300401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burkhart B. J.; Hudson G. A.; Dunbar K. L.; Mitchell D. A. A prevalent peptide-binding domain guides ribosomal natural product biosynthesis. Nat. Chem. Biol. 2015, 11, 564–570. 10.1038/nchembio.1856. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sumida T.; Dubiley S.; Wilcox B.; Severinov K.; Tagami S. Structural basis of leader peptide recognition in lasso peptide biosynthesis pathway. ACS Chem. Biol. 2019, 14, 1619–1627. 10.1021/acschembio.9b00348. [DOI] [PubMed] [Google Scholar]
- Zhu S.; Fage C. D.; Hegemann J. D.; Mielcarek A.; Yan D.; Linne U.; Marahiel M. A. The B1 protein guides the biosynthesis of a lasso peptide. Sci. Rep. 2016, 6, 35604 10.1038/srep35604. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheung W. L.; Chen M. Y.; Maksimov M. O.; Link A. J. Lasso peptide biosynthetic protein LarB1 binds both leader and core peptide regions of the precursor protein LarA. ACS Cent. Sci. 2016, 2, 702–709. 10.1021/acscentsci.6b00184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hegemann J. D.; Schwalen C. J.; Mitchell D. A.; van der Donk W. A. Elucidation of the roles of conserved residues in the biosynthesis of the lasso peptide paeninodin. Chem. Commun. 2018, 54, 9007–9010. 10.1039/C8CC04411B. [DOI] [PMC free article] [PubMed] [Google Scholar]
- DiCaprio A. J.; Firouzbakht A.; Hudson G. A.; Mitchell D. A. Enzymatic reconstitution and biosynthetic investigation of the lasso peptide fusilassin. J. Am. Chem. Soc. 2019, 141, 290–297. 10.1021/jacs.8b09928. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Koos J. D.; Link A. J. Heterologous and in vitro reconstitution of fuscanodin, a lasso peptide from Thermobifida fusca. J. Am. Chem. Soc. 2019, 141, 928–935. 10.1021/jacs.8b10724. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Si Y.; Kretsch A. M.; Daigh L. M.; Burk M. J.; Mitchell D. A. Cell-free biosynthesis to evaluate lasso peptide formation and enzyme-substrate tolerance. J. Am. Chem. Soc. 2021, 143, 5917–5927. 10.1021/jacs.1c01452. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Alfi A.; Popov A.; Kumar A.; Zhang K. Y. J.; Dubiley S.; Severinov K.; Tagami S. Cell-free mutant analysis combined with structure prediction of a lasso peptide biosynthetic protease B2. ACS Synth. Biol. 2022, 11, 2022–2028. 10.1021/acssynbio.2c00176. [DOI] [PubMed] [Google Scholar]
- Kretsch A. M.; Gadgil M. G.; DiCaprio A. J.; Barrett S. E.; Kille B. L.; Si Y.; Zhu L.; Mitchell D. A. Peptidase activation by a leader peptide-bound ripp recognition element. Biochemistry 2023, 62, 956–967. 10.1021/acs.biochem.2c00700. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chekan J. R.; Ongpipattanakul C.; Nair S. K. Steric complementarity directs sequence promiscuous leader binding in RiPP biosynthesis. Proc. Natl. Acad. Sci. U.S.A. 2019, 116, 24049–24055. 10.1073/pnas.1908364116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- AlphaFold Protein Structure Database | Frequently asked questions 2024. https://www.alphafold.ebi.ac.uk/faq#faq-8. (accessed March 6).
- Assrir N.; Pavelkova A.; Dazzoni R.; Ducasse R.; Morellet N.; Guittet E.; Rebuffat S.; Zirah S.; Li Y.; Lescop E. Initial molecular recognition steps of McjA precursor during microcin J25 lasso peptide maturation. ChemBioChem 2016, 17, 1851–1858. 10.1002/cbic.201600306. [DOI] [PubMed] [Google Scholar]
- Scofield M. A.; Lewis W. S.; Schuster S. M. Nucleotide sequence of Escherichia coli AsnB and deduced amino acid sequence of asparagine synthetase B. J. Biol. Chem. 1990, 265, 12895–12902. 10.1016/S0021-9258(19)38244-4. [DOI] [PubMed] [Google Scholar]
- Larsen T. M.; Boehlein S. K.; Schuster S. M.; Richards N. G.; Thoden J. B.; Holden H. M.; Rayment I. Three-dimensional structure of Escherichia coli asparagine synthetase B: a short journey from substrate to product. Biochemistry 1999, 38, 16146–16157. 10.1021/bi9915768. [DOI] [PubMed] [Google Scholar]
- Krissinel E.; Henrick K. Inference of macromolecular assemblies from crystalline state. J. Mol. Biol. 2007, 372, 774–797. 10.1016/j.jmb.2007.05.022. [DOI] [PubMed] [Google Scholar]
- Ferguson A. L.; Zhang S.; Dikiy I.; Panagiotopoulos A. Z.; Debenedetti P. G.; James Link A. An experimental and computational investigation of spontaneous lasso formation in microcin J25. Biophys. J. 2010, 99, 3056–3065. 10.1016/j.bpj.2010.08.073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Solbiati J. O.; Ciaccio M.; Farias R. N.; Salomon R. A. Genetic analysis of plasmid determinants for microcin J25 production and immunity. J. Bacteriol. 1996, 178, 3661–3663. 10.1128/jb.178.12.3661-3663.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hegemann J. D.; De Simone M.; Zimmermann M.; Knappe T. A.; Xie X.; Di Leva F. S.; Marinelli L.; Novellino E.; Zahler S.; Kessler H.; Marahiel M. A. Rational improvement of the affinity and selectivity of integrin binding of grafted lasso peptides. J. Med. Chem. 2014, 57, 5829–5834. 10.1021/jm5004478. [DOI] [PubMed] [Google Scholar]
- Chen P. H.; Sung L. K.; Hegemann J. D.; Chu J. Disrupting transcription and folate biosynthesis leads to synergistic suppression of Escherichia coli growth. ChemMedChem 2022, 17, e202200075 10.1002/cmdc.202200075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cunningham B. A.; Hemperly J. J.; Hopp T. P.; Edelman G. M. Favin versus concanavalin A: Circularly permuted amino acid sequences. Proc. Natl. Acad. Sci. U.S.A. 1979, 76, 3218–3222. 10.1073/pnas.76.7.3218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu Y.; Lutz S. Circular permutation: a different way to engineer enzyme structure and function. Trends Biotechnol. 2011, 29, 18–25. 10.1016/j.tibtech.2010.10.004. [DOI] [PubMed] [Google Scholar]
- Lo W. C.; Lee C. C.; Lee C. Y.; Lyu P. C. CPDB: a database of circular permutation in proteins. Nucleic Acids Res. 2009, 37, D328–32. 10.1093/nar/gkn679. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hegemann J. D.; Zimmermann M.; Zhu S.; Klug D.; Marahiel M. A. Lasso peptides from Proteobacteria: Genome mining employing heterologous expression and mass spectrometry. Biopolymers 2013, 100, 527–542. 10.1002/bip.22326. [DOI] [PubMed] [Google Scholar]
- Cheung W. L.; Pan S. J.; Link A. J. Much of the microcin J25 leader peptide is dispensable. J. Am. Chem. Soc. 2010, 132, 2514–2515. 10.1021/ja910191u. [DOI] [PubMed] [Google Scholar]
- Pan S. J.; Rajniak J.; Maksimov M. O.; Link A. J. The role of a conserved threonine residue in the leader peptide of lasso peptide precursors. Chem. Commun. 2012, 48, 1880–1882. 10.1039/c2cc17211a. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.