Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2017 Mar 1.
Published in final edited form as: J Struct Biol. 2015 Dec 23;193(3):196–205. doi: 10.1016/j.jsb.2015.12.011

The LINKS motif zippers trans-acyltransferase polyketide synthase assembly lines into a biosynthetic megacomplex

Darren C Gay 1,*, Drew T Wagner 1,*, Jessica L Meinke 1, Charles E Zogzas 1, Glen R Gay 1, Adrian T Keatinge-Clay 1
PMCID: PMC4738151  NIHMSID: NIHMS753382  PMID: 26724270

Abstract

Polyketides such as the clinically-valuable antibacterial agent mupirocin are constructed by architecturally-sophisticated assembly lines known as trans-acyltransferase polyketide synthases. Organelle-sized megacomplexes composed of several copies of trans-acyltransferase polyketide synthase assembly lines have been observed by others through transmission electron microscopy to be located at the Bacillus subtilis plasma membrane, where the synthesis and export of the antibacterial polyketide bacillaene takes place. In this work we analyze ten crystal structures of trans-acyltransferase polyketide synthases ketosynthase domains, seven of which are reported here for the first time, to characterize a motif capable of zippering assembly lines into a megacomplex. While each of the three-helix LINKS (Laterally-INteracting Ketosynthase Sequence) motifs is observed to similarly dock with a spatially-reversed copy of itself through hydrophobic and ionic interactions, the amino acid sequences of this motif are not conserved. Such a code is appropriate for mediating homotypic contacts between assembly lines to ensure the ordered self-assembly of a noncovalent, yet tightly-knit, enzymatic network. LINKS-mediated lateral interactions would also have the effect of bolstering the vertical association of the polypeptides that comprise a polyketide synthase assembly line.

INTRODUCTION

Complex polyketides are a class of secondary metabolites that provide a competitive advantage for a wide spectrum of bacteria and fungi, often by targeting vital processes within competing organisms1,2. The polyketide synthases (PKSs) responsible for their production are inherently modular, enabling PKS-harboring organisms to explore vast regions of chemical space through the reorganization of enzymatic domains. While the clinical value of the antibiotics erythromycin and mupirocin allude to the potential of polyketides in drug discovery, harnessing the biosynthetic capabilities of PKSs has been hindered by the complexity of their higher-order architecture35.

An enormous molecular assembly-line of the subcellular milieu, a PKS typically contains upwards of 50 independently-folded domains, each responsible for one step in the biosynthesis of a polyketide. The enzymatic modules of PKSs work together in assembly-line fashion to extend and process a growing polyketide chain6, each module containing a set of domains responsible for the incorporation and tailoring of a selected α-carboxylated extender unit. The domains employed by Type I PKSs are homologous to those responsible for fatty acid biosynthesis7, including acyltransferases (ATs) that select extender units, ketosynthases (KSs) that condense extender units with growing polyketides in a decarboxylative thio-Claisen condensation, ketoreductases (KRs) that stereospecifically reduce newly incorporated β-keto groups, dehydratases (DHs) that eliminate water to form α,β-olefins, enoylreductases (ERs) that stereospecifically reduce those olefins, and acyl carrier proteins (ACPs) that shuttle polyketide intermediates between the domains within a module as well as the following module. Thus, modules containing different sets of KR, DH, or ER domains yield different functional groups and stereochemistries. A thioesterase (TE) downstream of the final module commonly off-loads the polyketide product via hydrolysis or cyclization.

Modular PKSs are divided into two architecturally-distinct classes: cis-AT and trans-AT2. While cis-AT PKSs harbor ATs that are integrated into the multi-domain polypeptide, trans-AT systems rely on discretely-encoded AT domains that noncovalently interact with the megasynthase (Figure 1). Each of the independently-folded domains from cis-AT PKSs has been structurally characterized4, and although structural information has recently become available for domains from trans-AT PKSs, if and how the eponymous trans-AT domains dock to the megasynthase to charge ACPs with extender units remains to be determined810. A conserved ~100-residue region C-terminal to KS was hypothesized to facilitate the docking of trans-ATs to megasynthases and was named the AT-docking domain, or ATd11. The KS/AT adapters, observed in the crystal structures of KS+AT didomains from the erythromycin PKS and the mammalian fatty acid synthase (FAS), are homologous to ATd on a sequence level and seem to validate this functional assignment12,13. However, in vitro experiments examining extender unit transfer from trans-ATs to ACPs showed that the inclusion of ATd in constructs decreased the catalytic efficiency of acyl transfer compared to constructs that did not include ATd14.

Figure 1. Trans-AT vs. cis-AT PKS architecture.

Figure 1

The modular assembly for the archetypal trans-AT (PksX) and cis-AT (6-dEBS) PKSs are shown, responsible for the biosynthesis of bacillaene and erythromycin, respectively. For each system, the fourth PKS module has been magnified to reveal the organization of domains. Although much of the higher-order architecture remains unknown, the schematics represent hypothetical models based on available crystal structures (Ery(KS+AT)3 (PDB: 2QO3), EryDH4 (PDB: 3EL6), Spn(KR+ER)2 (PDB: 3SLK), PksKS2 (PDB: 4NA1), and Rhi(KS+B)11 (PDB: 4KC5). The presence of a LINKS contact is shown as lateral red extensions for each of the modules in PksX that harbor this region. N- and C-terminal docking domains for 6-dEBS are shown as vertical blue extensions. ACP, acyl carrier protein; AL, acyl-CoA ligase; AT, acyltransferase; DH, dehydratase; ECH, enoyl-CoA hydratase; ER, enoylreductase; FSD, flanking subdomain; HCS, HMG-CoA synthase; HYD, hydrolase; KS, ketosynthase; KR, ketoreductase; LD, loading didomain; M, module; NRPS, non-ribosomal peptide synthetase; P450, cytochrome P450; TE, thioesterase.

The first-discovered, and prototypical, trans-AT PKS is encoded within the Bacillus subtilis and Bacillus amyloliquefaciens genomes (pksX and bae, respectively)2. This hybrid PKS/non-ribosomal peptide synthetase (NRPS) produces bacillaene, a linear polyene diamide that interferes with prokaryotic protein synthesis (Figure 1)15. Transmission electron micrographs of B. subtilis cells have revealed dense masses associated with the plasma membrane, each comprised of several copies of the PksX/Bae megasynthase16,17. This organelle-sized structure, with an approximate mass of 10–100 MDa, includes the five multidomain polypeptides PksJ, PksL, PksM, PksN, and PksR. At the time of this discovery megacomplex formation was a complete mystery since trans-AT PKS polypeptides lack the N- and C-terminal docking domains that linearly organize cis-AT PKSs and no PKS architectural scaffolding proteins were known; however, ATd was hypothesized to help stabilize the observed structure (Figure 1)18. Membrane localization of PKS enzymes has been observed for the mycolactone and actinorhodin pathways, and the biosynthetic machinery for the siderophore pyoverdine is known to accrete into a dense, membrane-bound mass termed the “siderosome”; the co-localization of large enzymatic complexes at the bacterial plasma membrane is emerging as a theme in the biosynthesis of natural products1923.

The first ATd to be structurally observed, from a didomain composed of a KS and a β-branching enzyme (KS+B) from the rhizoxin PKS, revealed that ATd is structurally related to the KS/AT adapter of cis-AT PKSs8. Where the AT domain branches from the KS/AT adapter in cis-AT PKSs, only a short loop is present. The second structure of an ATd, from the second KS domain of the B. subtilis PksX synthase (PksKS2), showed two additional α-helices where the AT domain branches from the KS/AT adapter in cis-AT PKSs9. Modeling a trans-AT docked to the ATd based on the location of AT within cis-AT PKSs indicated that significant restructuring of these helices would be required for docking. Since neither functional nor structural studies supported the role of ATd as a trans-AT docking site, it was renamed the “flanking subdomain.9 Comparisons of solved trans-AT KS domains have not yet revealed its role.24

Here, we present seven additional crystal structures of trans-AT KS domains that provide evidence that the flanking subdomain plays a role in the assembly of PKS megacomplexes. At first considered to be a serendipitous crystal contact, the clearly defined self-association of flanking subdomains published here and elsewhere appears in ten crystal structures (five different KS domains from three different microbial species harboring trans-AT PKSs; seven structures reported here, two structures reported in reference 9, and one reported in reference 24). We introduce the term Laterally-INteracting Ketosynthase Sequences (LINKS) to distinguish the three helices that form the observed interaction from the remainder of the flanking subdomain. Bioinformatic analysis of LINKS regions reveals that each is unique to the KS domain possessing them and indicates that PKS assembly lines that form megacomplexes are zippered together by LINKS-containing KSs that possess higher affinity for copies of themselves than other LINKS-containing KSs within the assembly line. These findings suggest an unprecedented mechanism for the assembly of several copies of a PKS megasynthase into a megacomplex.

RESULTS

Structure of the LINKS interaction

Three of the trans-AT KS domains deposited into the Protein Data Bank (PDB) prior to this work contain the LINKS region: one from the bacillaene PKS (PksKS2, PDBs 4NA1 and 4NA2) and two from the migrastatin PKS (MgsKS5 and MgsKS7, PDBs 4TL2 and 4TKT, respectively). The LINKS interaction between homodimeric KSs, not apparent from the asymmetric units of wild-type PksKS2 (PDB 4NA1) and MgsKS5 (PDB 4TL2), can be observed by generating the crystallographic symmetry mates. This is opposite in crystals of PksKS2(Cys176Ser) (PDB: 4NA2), in which the asymmetric unit shows two KS monomers associated via the LINKS interaction and the KS dimer is visualized by generating crystallographic symmetry mates. Here we report seven additional crystal structures in which LINKS interactions are observed, including one of PksKS2 in its monomeric form, one of PksKS6, two of PksKS6(Cys169Ser), one of Pks(ACP5+KS6), one of BaeKS1, and one of BaeKS5 (Table 1). To date, only one crystal structure of a LINKS-containing trans-AT KS has been reported that does not show this interaction (MgsKS7, PDB: 4TKT).

Table 1.

Crystallographic Data and Refinement Statistics

PksKS2 (4NA1)
(monomeric)
PksKS6 (5E5N)
(Cys169Ser)
PksKS6
(5E6K)(Cys169Ser)
PksKS6
(5ERF)
Pks(ACP5+KS6)
(5ENY)
BaeKS1
(5ELP)
BaeKS5
(5ERB)
Data Collection

Wavelength (Å) 0.9999 1.0332 0.9765 0.9765 1.0332 1.0332 1.0332
Space group C2 P 21 P 21 P 1 P 1 P 1 P 21
Cell dimensions, a, b, c (Å); α, β, γ (°) 197.1, 75.1, 134.1; 90.0, 132.9, 90.0 89.4, 108.2, 151.2; 90.0, 96.4, 90.0 76.7, 105.5, 89.3; 90.0, 96.8, 90.0 69.0, 77.3, 85.3; 77.7, 70.3, 64.2 63.1, 112.7, 211.4; 105.0, 90.1, 106.3 61.2, 100.0, 100.8; 91.9, 88.2, 96.0 70.8, 319.1, 103.5; 90.0, 110.3, 90.0
Resolution (Å) 47.25-3.02 150.89 -2.00 36.55 - 2.16 39.73 - 3.10 39.77 - 4.00 99.35 - 2.93 97.07 - 4.20
Rmerge 0.048 (0.663) 0.120 (0.570) 0.092 (0.481) 0.058 (0.359) 0.102 (0.373) 0.148 (0.641) 0.114 (0.483)
I/σ(I) 32.2 (1.8) 28.1 (2.5) 12.3 (2.0) 13.3 (1.7) 6.1 (2.7) 15.7 (1.5) 14.2 (2.8)
Completeness (%) 99.7 (98.4) 99.2 (96.0) 99.2 (93.5) 94.1 (75.6) 98.7 (99.0) 94.9 (89.75) 79.9 (59.3)
Redundancy 3.7 (3.8) 2.1 (2.1) 1.9 (1.8) 1.9 (1.9) 3.3 (3.5) 1.1 (1.1) 3.2 (2.8)
Wilson B value (Å2) 119.7 35.3 42.5 77.3 157.2 62.4 196.4

Refinement

Resolution (Å) 47.25-3.02 150.89 -2.00 36.55 - 2.16 39.73 - 3.10 39.77 - 4.00 99.35 - 2.93 97.07 - 4.20
No. of reflections 26825 (1879) 183336 (13049) 71285 (4947) 24128 (1403) 39874 (2246) 46279 (3209) 23712 (1314)
Rwork/Rfree 0.223/0.268 0.201/0.231 0.224/0.258 0.218/0.262 0.322/0.344 0.262/0.288 0.288/0.341
No. of atoms
 Protein 8938 17314 8463 8944 34416 16904 18315
 Water - 1358 88 - - - -
Average B factors (Å2)
 Protein 119.7 44.3 42.7 77.8 157.3 63.1 196.4
 Water - 27 42.4 - - - -
RMS deviations
 Bond lengths (Å) 0.0018 0.0186 0.0083 0.0180 0.0013 0.0127 0.0130
 Bond angles (°) 1.084 1.352 1.342 1.390 0.716 1.937 2.651
Ramachandran Statistics (%)
 Preferred Regions 91.8 97.3 96.5 94.0 95.8 92.6 93.3
 Allowed Regions 8.2 2.7 3.5 5.9 4.2 6.7 6.6
 Outliers 0.0 0.0 0.0 0.1 0.0 0.7 0.1

The LINKS motif is comprised of ~40 residues (α17–α19) that project from the flanking subdomain as a triangular flap (see reference 9 for secondary structure assignments) (Figure 2). While α19 is structurally equivalent to a helix in the KS/AT adapter of cis-AT PKSs and the mammalian FAS, α17 and α18 take the place of the AT domain in these synthases. Helices α17 and α18 do not make interactions with neighboring secondary structural elements and thus are not rigidly oriented with respect to the flanking subdomain. In the LINKS interaction, α18 of one flanking subdomain makes contact with a spatially-reversed α18 of the neighboring flanking subdomain, and α17 of one flanking subdomain makes contact with α19 of the neighboring flanking subdomain (a dynamic exploration of the interaction is provided in Supplementary Movie 1).

Figure 2. The LINKS interaction.

Figure 2

The LINKS interaction that forms between trans-AT KS domains is mediated by three α-helices (α17-α19) that bind to a spatially-reversed copy of the same structure. (a) A model for the LINKS interaction between two KS homodimers is magnified, revealing the orientation of the LINKS helices. The angle above the structures indicates the relative angle between KS monomers forming the interaction (Figure S2). Each individual LINKS sequence is colored blue at the N-terminal end of α17 and red at the C-terminal end of α19, and the N-terminal end of α17 always forms favorable interactions with the C-terminal end of α19 from the neighboring KS (α19′). The coloring scheme used for the cartoon model has been repeated in each of the displayed structures to highlight the conservation of the LINKS interface. The structure shown for PksKS2 (monomeric form) reveals two KS monomers that do not form the traditional homodimer interaction; however, the LINKS interaction is crystallographically maintained. The relatively high thermal factors for the LINKS helices in BaeKS1 did not permit the complete construction of α17–α19 for each of the KS monomers, but the relative orientation of the KS bodies indicates that the LINKS interaction is conserved. (b) The three helices forming the LINKS interaction for PksKS2 are shown in cartoon format (green), and the remainder of this PksKS2 monomer has been hidden for clarity. The neighboring KS forming the complementary LINKS interaction is represented with a transparent surface (cyan). The angle of the image is set from the interior of the flanking subdomain, to reveal the collection of centrally-located hydrophobic residues that would be surface-exposed in the absence of the LINKS interaction. (c) A 90° rotation of the viewing angle shown in panel B reveals the series of ionic interactions formed at the poles of α18. The aspartate and glutamate residues at the N-terminal end of α18 form well-defined ionic interactions with the lysine residues at the C-terminal end of α18 from the neighboring structure. Met559 of the blue monomer can be observed in the center of the image, extending into the hydrophobic cavity of the green monomer (Figure S1a). (d) The three LINKS helices for PksKS2 have been modeled linearly to reduce the complexity of the image. LINKS contacts between residues ranging from 2.0 – 3.0 Å are shown as black dots, and those ranging from 3.1 – 4.0 Å are shown as grey dots. The corresponding sequence for the region is also shown, and residues represented by grey letters have been hidden from the cartoon and do not directly contribute to the LINKS interaction.

The observed LINKS interactions form through the burial of hydrophobic residues and exhibit a high degree of shape complementarity (~700 Å2). In the LINKS motif of PksKS2 (the structure possessing the highest-resolution and lowest B-factors for the LINKS region) these hydrophobic residues include I552, F553, M559, L563, W566, and L578 (Figures 2b–d, and Table S1). The LINKS interactions observed from bacillaene synthases (PksKS2, PksKS6, BaeKS1, and BaeKS5) show contact between clusters of charged residues that cap the poles of α18. The N-terminal end of this helix contains ~3 negatively-charged residues, and the C-terminal end contains ~3 positively-charged residues. Two copies of α18 are oriented in an antiparallel fashion, and the negatively-charged N-terminal end of α18 from one flanking subdomain forms salt bridges with the positively-charged C-terminal end of α18 from the neighboring flanking subdomain (Figure S1). Collectively, the burial of exposed hydrophobic residues in the center of each LINKS triangular flap and the clustering of complementary ionic interactions at both termini of α18 form a lock-and-key fit uniquely adapted to each LINKS interaction (Figure S3).

The LINKS motifs connect neighboring KS dimers in crystals of MgsKS5 (PDB: 4TL2); however, instead of the twofold axes of these KS dimers being parallel as in each of the other observed LINKS-mediated associations of KS dimers, they are nearly orthogonal (Figure 2a). The MgsKS5 LINKS interaction is distorted relative to all the other crystallographically-observed LINKS interactions - α17 and α18 are skewed, and several ionic interactions potentially made in vivo are not formed (e.g., Asp1121/Arg1133 and Glu1122/Arg1129) (Figure S2). The only intact salt bridge, formed by Lys1120 and Asp1138, may provide an example of covariation within the LINKS motif, as both of these residues generally carry the opposite charge in sequence alignments (Figure 3 and Table S2).

Figure 3. Sequence alignment of the LINKS region.

Figure 3

Over 100 trans-AT KS sequences were aligned, and a subset of the alignment is shown. Residues in the alignment have been colored according to the following scheme: red for acidic residues, blue for basic residues, and yellow for hydrophobic residues. Helices α17–α19 (grey) delineate the boundaries of the LINKS region. If a KS in the alignment has been structurally characterized, the associated PDB identifier is shown in parentheses. Although strict conservation of the LINKS region is poor, the conservation of LINKS attributes are generally maintained (i.e., charged poles of α18 flanked by hydrophobic residues). For those KSs that do not contain LINKS (e.g., RhiKS11 and OzmKS9, bottom of alignment), the available structures do not reveal any crystallographic homotypic interactions mediated by this region. The only structure available that contains LINKS but does not exhibit the crystallographic interaction observed in all other trans-AT KS structures is of MgsKS7. The structure of MgsKS5 reveals the LINKS interaction, yet the LINKS attributes differ from what has been observed in other structures. A seemingly anomalous basic residue at the N-terminal end of α18 (Lys1120) forms an ionic interaction with an acidic residue at the N-terminal end of α19 (Asp1138) that is almost invariably basic in the other sequences (both residue positions are marked with asterisks). Accession numbers for the sequences are available in the supporting information (Table S4).

The seven crystal structures reported here show the LINKS interaction in seven different contexts. In one crystal form, PksKS2 monomers that do not make the traditional KS homodimer interaction were observed. Remarkably, even though ~5300 Å2 is not buried through the formation of the traditional KS homodimer, the LINKS interaction is maintained (Figure 2a). The construct Pks(ACP5+KS6), generated to investigate the interaction between KS and ACP, also yielded crystals in which the LINKS interaction was observed. Although the crystals could not be optimized to diffract beyond 4.0-Å resolution and the ACP domain could not be located in the electron density map, the LINKS interaction was observed to be formed by each of the eight KS monomers in the asymmetric unit (Figure S4). In addition to the 2.0 Å-resolution structure of PksKS6(Cys169Ser), a construct generated to visualize bound intermediates9, two additional crystal forms of PksKS6 were observed that contain the LINKS interaction (wild-type PksKS6, 3.10-Å resolution; PksKS6(Cys169Ser), 2.16-Å resolution) (Table S1). Two crystal structures of KSs from the B. amyloliquefaciens bacillaene PKS, BaeKS1 (2.93 Å) and BaeKS5 (4.20 Å), were also determined. While locally-elevated temperature factors prevented the complete modeling of α17–α19 in BaeKS1 (Figure S5), the relative positioning of KSs indicates that each of the flanking subdomains form LINKS interactions.

Bioinformatic analysis of LINKS

The lack of sequence conservation of α17–α19 is a hallmark of the LINKS motif. While the LINKS motifs shown in Figure 3 are each quite hydrophobic and contain charged residues at the poles of α18, individual LINKS residues are highly variable (Figures 3 and S6). To further investigate this variability, the sequence conservation of LINKS-containing KS domains was mapped onto a model of a trans-AT KS structure (Figure 4). An alignment of 152 trans-AT KS sequences with clear LINKS regions was generated, and the degree of conservation of each residue was assessed. A moving average calculation was used to determine the mean conservation for short stretches of sequence, such that the value for the conservation of residue n is equal to the average value of conservation for the 11 residues ranging from n−5 to n+5. A template KS structure was then colored based on this value: red for 0–25%, yellow for 25–50%, green for 50–75%, and blue for 75–100%. While the active site region of KS appears blue and the remainder of the KS body varies between green and yellow, the longest stretch of red (46 residues) is the LINKS region. The next longest stretch of red sequence is only 17 residues and is located on the face of KS that would be adjacent to processing domains such as DH, KR, and MT. Classes of LINKS motifs other than the one shown in the sequence alignment of Figure 3 likely exist; several LINKS motifs that did not align well with those in Figure 3 are displayed in Figure S6. Due to the highly variable nature of the LINKS region, it is currently unclear whether these sequences self-associate in the same manner as the more canonical LINKS sequences seen in Figure 3.

Figure 4. Simple moving average calculation of sequence conservation.

Figure 4

To determine which regions of trans-AT KSs are the most susceptible to sequence variation, a simple moving average (SMA) calculation was performed based on sequence conservation and residues were colored according to the corresponding quartiles. The five regions that received the highest SMA conservation scores all map to the active site of the KS body (blue), while the lowest scoring region corresponds to the LINKS helices (red). A representative KS structure is shown colored accordingly. N- and C-termini are depicted by cyan and red spheres, respectively, and residue numbering is based on the structure of PksKS2.

LINKS interactions are homotypic

With the exception of MgsKS5, the relative orientation of trans-AT KS dimers connected through a LINKS interaction is essentially identical for all of the crystal structures available to date (Figure 2a). We investigated whether heterotypic LINKS interactions were complementary between KSs from different modules. The six LINKS helices (three helices from each KS monomer) were isolated from the observed LINKS interactions (using the structure with the best-resolved LINKS motif for each KS), each of these were superposed, and new coordinate files were created for the heterodimeric LINKS interactions (i.e., PksKS2 with BaeKS1, PksKS6, or MgsKS5). The simulated interactions were investigated to assess the interfacial contacts. In general, heterotypic LINKS interactions were significantly less favorable than homotypic LINKS interactions, displaying much worse shape complementarity with fewer ionic interactions (Figure S7).

DISCUSSION

The docking site of trans-ATs had been hypothesized to be generated by an ~100-residue subdomain, formerly termed ATd and now referred to as the “flanking subdomain”, that is associated with the majority of KSs within trans-AT PKSs. That an AT domain is integrated into the equivalent subdomain of the mammalian FAS and cis-AT PKSs appears to support this role; however, this subdomain is common to each of these synthases and was most likely present within an ancestral PKS before cis- and trans-AT PKSs diverged25. Thus, the docking site to which trans-ATs dock in trans-AT PKSs should not be expected to be equivalent to where ATs are integrated in cis-AT PKSs. A ~40-residue LINKS motif is usually found in trans-AT PKSs where the AT domain is integrated in cis-AT PKSs. In 10 out of 11 crystal structures of constructs containing the LINKS motif it is found mediating a LINKS- interaction, and in 8 of the 10 structures it associates neighboring KS dimers in a parallel orientation (the exceptions being MgsKS5 and the monomeric form of PksKS2). We propose that the LINKS motif makes lateral interactions that help construct megacomplexes like those observed in B. subtilis cells (Figure 5; based on the measured dimensions, one bacillaene megacomplex is comprised of 10–20 homodimeric bacillaene PKS assembly lines).

Figure 5. Vertical and lateral interactions stabilize the PKS megacomplex.

Figure 5

The electron micrographs observed by Straight et al. revealed a dense mass associated with the membrane of B. subtilis cells, identified to consist of numerous copies of the PksX megasynthase17. An image from that work is reprinted here, revealing that the observed superstructure measures approximately 150 × 150 nm. The arrow points to the cell membrane, and the arrowhead points to gold nanoparticles engineered to target PksX proteins. For visualizing how this mass of proteins may self-associate, a diagram shows the assembly line subunits (PksJ, PksL, PksM, PksN, and PksR) interacting vertically through domain-domain interactions, and laterally through the LINKS network. The estimations for megacomplex dimensions that flank the cartoon are calculated based on available crystal structures of individual PKS domains. In the vertical dimension, 175 nm is represented by 15 PKS modules and 2.5 NRPS modules (10 nm each). In the lateral dimension, 160 nm is estimated by the total length of 15 KS homodimers that polymerize through LINKS interactions.

The LINKS hypothesis excludes the site that AT is observed in cis-AT PKSs as the docking site for a trans-AT. While trans-ATs could dock with another region of the flanking subdomain or the KS body, they may only interact with the ACP domains to which they transfer extender units27. Studies have shown that PksE, a trans-AT in the PksX bacillaene pathway, is present in concentrations 10–100 fold greater than the assembly line polypeptides17. Additionally, superstoichiometric trans-AT concentrations have a large effect on the rate of polyketide production by the virginiamycin trans-AT PKS27. If trans-ATs were stably associated with each trans-AT PKS module, increased trans-AT expression would have little effect. Several condensation-incompetent trans-AT KSs harbor flanking subdomains yet have no need to interact with a trans-AT (e.g., BTKS10, ChiKS18, DszKS10, and OnnKS4). Similarly, several trans-AT KSs that catalyze polyketide chain elongation do not contain a flanking subdomain, yet the corresponding ACPs from these modules must interact with a trans-AT (e.g., ChiKS16, LnmKS3, MlnKS8, and DifKS10). Studies have shown that trans-ATs are capable of charging ACP domains in the absence of a KS or flanking subdomain and that a trans-AT from the kirromycin PKS (KirCII) is quite specific for its cognate ACP14,26,2830. This degree of trans-AT specificity for an ACP is surprising if the flanking subdomain or KS body is selective for docking with a particular trans-AT.

The structure of the homodimeric KS+B didomain from the rhizoxin trans-AT PKS shows that the longest dimension of the B dimer is rotated 90° relative to the longest dimension of the KS dimer8. This orientation, also anticipated for the trans-AT KS+DH didomain based on the structural homology of the B and DH domains, would be compatible with the higher-order architecture of a PKS megacomplex formed through LINKS interactions (Figure 5). While several contacts between the KS and B dimers from a trans-AT PKS fix their relative orientation, the KS and DH dimers of the mammalian FAS rotate freely relative to one another13,32. The crystal structure of the mammalian FAS shows no contact between the two dimers (Figure S8), and cryo-electron microscopy studies also report flexibility between the KS and DH dimers.

Even if LINKS motifs do not make high affinity interactions, significant avidity could be generated through the several LINKS motifs interacting along the length of the synthase (Figure S9). Within the megacomplex, the lateral LINKS interactions would have the effect of bolstering vertical interactions, thought to be mediated by the N- and C-terminal regions of trans-AT PKS polypeptides. The flexibility of the LINKS interaction, illustrated by the MgsKS5 crystal structure, could aid megacomplex formation and maintenance. Many trans-AT PKS assembly lines harbor several LINKS motifs (e.g., the bacillaene PKS contains 12, the chivosazol PKS contains 9, the mupirocin PKS contains 7), such that the dissociation of a single LINKS interaction would have little effect on the stability of the entire megacomplex. LINKS interactions may be difficult to biophysically characterize using methods other than x-ray crystallography. If the dissociation constant for an individual LINKS interaction is in the millimolar range, it may not be observable through techniques such as small-angle x-ray scattering (SAXS) or analytical ultracentrifugation32. To accurately determine the dissociation constant of the LINKS interaction, the natural KS dimerization interface may need to be disrupted by site-directed mutagenesis such that only the LINKS-mediated association of KS monomers is measured.

Several structures of trans-AT PKS KS domains have been deposited in the PDB that do not contain a LINKS motif, including the KS+B didomain from the rhizoxin PKS8. However, only one reported crystal structure of a LINKS-containing KS does not display the LINKS interaction (MgsKS7). Favorable crystal contacts made elsewhere may preclude this interaction in the same manner that LINKS-mediated crystal contacts within the monomeric crystal structure of PksKS2 are favored over natural homodimer interactions.

The prevalence of the LINKS motif within many trans-AT systems suggests that megacomplex formation may be a more common occurrence than is currently recognized. The mode through which LINKS interactions could stabilize trans-AT PKS megacomplexes has not been observed in the architectures of other biosynthetic assemblies. The crystal structures presented here reveal a defined binding interface formed by hydrophobic and ionic interactions mediated by three helices on the surface of the flanking subdomain. Bioinformatic analysis suggests that the LINKS interaction is unique for each KS, such that the in-register, homotypic, lateral interactions made by them also confer stability to the vertical interactions between the assembly lines of the megacomplex through avidity. While trans-AT systems represent an excellent template for the rational engineering of synthetic PKSs for the exploration of new medicines, it may be necessary to consider the implications of modifying the LINKS network when genetically relocating modules or domains within a trans-AT PKS megasynthase.

METHODS

All protein structures described in this work were cloned, purified, and the corresponding structures refined using identical methods to those previously described for PksKS29. Therefore, only the primers used for gene amplification of new constructs, crystallization conditions, phasing methods, and any other modifications will be reported here.

Crystals of PksKS2 in which the KS domains did not crystallize in the native dimeric form grew over a period of 5–7 days by sitting drop vapor diffusion at 4 ºC. Drops were formed by mixing 2 μL protein solution (4.5 mg/ml PksKS2, 150 mM NaCl, 10% glycerol, 10 mM HEPES, pH 7.5) with 1 μL crystallization buffer (30% PEG 400 (v/v), 0.2 M MgCl2, 0.1 M HEPES, pH 7.5). Crystals were frozen in liquid nitrogen after a 20 min soak in crystallization buffer modified with 20% glycerol, and the diffraction data collected at ALS Beamline 5.0.3. The structure was solved to 3.0 Å resolution by molecular replacement with PhaserMR in the CCP4 suite, using a single chain of the dimeric structure of PksKS2 (PDB: 4NA1) as the search model.

The DNA encoding PksKS6 was amplified using primers: 5′-GCGGCCTGGTGCCGCGCGGCTCTAGCGACCGCCCGGAGGATGCGATAG-3′ and 5′-GTGGTGGTGGTGGTGGTGATGTTAGCCTTCTTGAGTTGGCAGCCAG-3′ (LIC cloning regions underlined for insertion into the plasmid pGAY2833). The eluted protein was concentrated to 15 mg/mL in the equilibration buffer and stored at −80ºC until needed. Crystals of PksKS6 grew in 1–6 days by sitting drop vapor diffusion at 22 ºC. Drops were formed by mixing 2 μL protein solution (15 mg/mL PksKS6, 150 mM NaCl, 10 mM HEPES, pH 7.5) with 1 μL crystallization buffer (150 mM (NH4)2SO4−2, 15% PEG 4000 (v/v), 100 mM MES, pH 6.0). Crystals were frozen in liquid nitrogen after a 20 min soak in the crystallization buffer modified with 10% (v/v) ethylene glycol. Diffraction data, collected at ALS Beamline 5.0.3, were processed by HKL2000. The structure was solved to 3.00 Å resolution by molecular replacement using the KS2 monomer from the PksX synthase of B. subtilis as the search model (PDB: 4NA1).

The PksKS6(Cys169Ser) expression plasmid was generated by PCR amplification from the PksKS6 construct described above using primers 5′-AGACGGCCTCGAGCTCTTCATTGGTCGGCACTCATTTAGCGCGCCAGGCACTTATAA ACAAAG-3′ and 5′-AGAACTGCTCGAGGCGGTATCAATAGGAATCGCTGGCCCTTTTAAATTTAGAAAATA CGG-3′ (XhoI site underlined). The PCR product was digested with XhoI and ligated. Crystals of PksKS6(Cys169Ser) grew over 2–10 days by sitting drop vapor diffusion at 22 ºC. Drops were formed by mixing 2 μL protein solution (15 mg/mL PksKS6(Cys169Ser), 150 mM NaCl, 10 mM HEPES, pH 7.5) with 1 μL crystallization buffer (1.7 M LiSO4, 100 mM Tris, pH 8.0) (both crystal forms grew in the same crystallization condition). Crystals were frozen in liquid nitrogen after a 20 min soak in crystallization buffer modified with 10% (v/v) ethylene glycol and the diffraction data were collected at APS Beamline 23-ID-D. The structure was solved by molecular replacement using a single chain of the dimeric structure of PksKS2 (PDB: 4NA1) as the search model.

Bacillus amyloliquefaciens FZB42 genomic DNA was used as template DNA to PCR amplify BaeKS1 with KAPA High-Fidelity DNA polymerase (KAPA) using the forward primer 5′-GCGGCCTGGTGCCGCGCGGCTCTAGCTTTCCCGATTATTACGAGGACAG-3′ and the reverse primer 5′-GTGGTGGTGGTGGTGGTGATGTTAGTTCTTTTCTGCTTTTTTCGGCG-3′ (pGAY28 LIC cloning regions underlined). The eluted protein was concentrated to 14 mg/mL in the equilibration buffer and stored at −80ºC until needed. Crystals of BaeKS1 grew over 4–15 days by sitting drop vapor diffusion at 22 ºC. Drops were formed by mixing 2 μL protein solution (14 mg/mL BaeKS1, 150 mM NaCl, 10 mM HEPES, pH 7.5) with 1 μL crystallization buffer (32% PEG 400 (v/v), 0.2 M MgCl2, 100 mM HEPES, pH 7.5). Crystals were frozen in liquid nitrogen after a 20 min soak in crystallization buffer modified with 40% PEG 400 (v/v) and the diffraction data were collected at APS Beamline 23-ID-D. The structure was solved by molecular replacement using a single chain of the dimeric structure of PksKS2 (PDB: 4NA1) as the search model.

The gene corresponding to Pks(ACP5+KS6) was PCR amplified using primers 5′-GCGGCCTGGTGCCGCGCGGCTCTAGCGCTGAAGAAACGATTCAATATGC-3′ and 5′-GTGGTGGTGGTGGTGGTGATGTTAGCCTTCTTGAGTTGGCAGC-3′ (pGAY28 LIC cloning regions underlined). Crystals grew over a period of 2–4 days by sitting drop vapor diffusion at 22 ºC. Drops were formed by mixing 2 μL protein solution (25 mg/ml Pks(ACP5+KS6), 150 mM NaCl, 10% glycerol, 10 mM HEPES, pH 7.5) with 1 μL crystallization buffer (30% PEG 4000 (v/v), 0.2 M sodium acetate, 0.1 M Tris, pH 8.5). Crystals were frozen in liquid nitrogen after a 20 min soak in crystallization buffer modified with 20% glycerol, and the diffraction data collected at ALS Beamline 5.0.2. Phases were obtained by molecular replacement using PksKS6 (wild-type) as a search model. The structure was solved to 4.0 Å resolution by molecular replacement using a single chain from the dimeric structure of PksKS6 as the search model. Electron density for the eight ACPs in the asymmetric unit could not be located despite exhaustive inspection of the map.

The DNA encoding BaeKS5 was amplified using primers: 5′- ′GCGGCCTGGTGCCGCGCGGCTCTAGCCAGCAGCTGACAGAGCGTG-3′ and 5′-GTGGTGGTGGTGGTGGTGATGTTAATCAGCTGATGTGTCGATCCAATAAC-3′ (pGAY28 LIC cloning regions underlined). The eluted protein was concentrated to 9.9 mg/mL in the equilibration buffer. Crystals of BaeKS5 grew in 2 days by sitting drop vapor diffusion at 22 ºC. Drops were formed by mixing 1.75 μL protein solution (9.9 mg/mL BaeKS5, 150 mM NaCl, 15 mM HEPES, pH 7.5) with 0.5 μL crystallization buffer (1.1 M LiSO4, 0.1 M Tris, pH 8.0). Crystals were frozen in liquid nitrogen after a 20 min soak in the crystallization buffer modified with 10% (v/v) glycerol. Diffraction data, collected at APS Beamline 23-ID-B, were processed by HKL2000. The structure was solved to 4.20 Å resolution by molecular replacement using the KS2 monomer from the PksX synthase of B. subtilis as the search model (PDB: 4NA1).

Supplementary Material

1
Download video file (11.1MB, mp4)
2

Acknowledgments

Instrumentation and technical assistance for crystallographic work were provided by Dr. A. Monzingo and the Macromolecular Crystallography Facility, with financial support from the College of Natural Sciences, the Office of the Executive Vice President and Provost, and the Institute for Cellular and Molecular Biology at the University of Texas at Austin. The Berkeley Center for Structural Biology is supported in part by the National Institutes of Health (NIH), National Institute of General Medical Sciences, and the Howard Hughes Medical Institute. The Advanced Light Source is supported by the Director, Office of Science, Office of Basic Energy Sciences, of the US Department of Energy under contract no. DE-AC02-05CH11231. We thank the NIH (GM106112) and the Welch Foundation (F-1712) for supporting this research.

Footnotes

AUTHOR CONTRIBUTIONS

D.G. solved and refined or assisted in the solution and refinement of each crystal structure, realized that LINKS mediate megacomplex assembly, wrote the manuscript, and generated the figures. D.W. solved and refined or assisted in the solution and refinement of the crystal structures for PksKS6 and its mutant, BaeKS1, and BaeKS5. J.M., C.Z., and G.G. assisted in the solution and refinement of the BaeKS5, BaeKS1, and PksKS2 crystal structures, respectively. A.K.C. provided experimental and crystallographic insight, and assisted in composing the manuscript and figures. All authors contributed to editing the manuscript and figures, and declare no competing financial interests.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  • 1.Hertweck C. The biosynthetic logic of polyketide diversity. Angew Chem Int Ed Engl. 2009;48:4688–716. doi: 10.1002/anie.200806121. [DOI] [PubMed] [Google Scholar]
  • 2.Piel J. Biosynthesis of polyketides by trans-AT polyketide synthases. Nat Prod Rep. 2010;27:996–1047. doi: 10.1039/b816430b. [DOI] [PubMed] [Google Scholar]
  • 3.Walsh CT. The chemical diversity of natural-product assembly lines. Acc Chem Res. 2008;41:4–10. doi: 10.1021/ar7000414. [DOI] [PubMed] [Google Scholar]
  • 4.Keatinge-Clay AT. The structures of type I polyketide synthases. Nat Prod Rep. 2012;29:1050–1073. doi: 10.1039/c2np20019h. [DOI] [PubMed] [Google Scholar]
  • 5.Dutta S, et al. Structure of a modular polyketide synthase. Nature. 2014;510:512–517. doi: 10.1038/nature13423. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Khosla C, Herschlag D, Cane DE, Walsh CT. Assembly line polyketide synthases: mechanistic insights and unsolved problems. Biochemistry. 2014;53:2875–2883. doi: 10.1021/bi500290t. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Smith S, Tsai SC. The type I fatty acid and polyketide synthases: a tale of two megasynthases. Nat Prod Rep. 2007;24:1041–1072. doi: 10.1039/b603600g. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Bretschneider T, et al. Vinylogous chain branching catalyzed by a dedicated polyketide synthase module. Nature. 2013;502:124–128. doi: 10.1038/nature12588. [DOI] [PubMed] [Google Scholar]
  • 9.Gay D, et al. A close look at a ketosynthase from a trans-acyltransferase modular polyketide synthase. Structure. 2014;22:441–451. doi: 10.1016/j.str.2013.12.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Piasecki SK, Zheng J, Axelrod AJ, Detelich ME, Keatinge-Clay AT. Structural and functional studies of a trans-acyltransferase polyketide assembly line enzyme that catalyzes stereoselective α- and β-ketoreduction. Proteins. 2014 doi: 10.1002/prot.24561. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Tang GL, Cheng YQ, Shen B. Leinamycin biosynthesis revealing unprecedented architectural complexity for a hybrid polyketide synthase and nonribosomal peptide synthetase. Chem Biol. 2004;11:33–45. doi: 10.1016/j.chembiol.2003.12.014. [DOI] [PubMed] [Google Scholar]
  • 12.Tang Y, Chen AY, Kim CY, Cane DE, Khosla C. Structural and mechanistic analysis of protein interactions in module 3 of the 6-deoxyerythronolide B synthase. Chem Biol. 2007;14:931–943. doi: 10.1016/j.chembiol.2007.07.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Maier T, Leibundgut M, Ban N. The crystal structure of a mammalian fatty acid synthase. Science. 2008;321:1315–1322. doi: 10.1126/science.1161269. [DOI] [PubMed] [Google Scholar]
  • 14.Aron ZD, Fortin PD, Calderone CT, Walsh CT. FenF: Servicing the mycosubtilin synthetase assembly line in trans. ChemBioChem. 2007;8:613–616. doi: 10.1002/cbic.200600575. [DOI] [PubMed] [Google Scholar]
  • 15.Patel PS, et al. Bacillaene, a novel inhibitor of procaryotic protein synthesis produced by Bacillus subtilis: Production, taxonomy, isolation, physico-chemical characterization and biological activity. J Antibiot. 1995;48:997–1003. doi: 10.7164/antibiotics.48.997. [DOI] [PubMed] [Google Scholar]
  • 16.Butcher RA, et al. The identification of bacillaene, the product of the PksX megacomplex in Bacillus subtilis. Proc Natl Acad Sci USA. 2007;104:1506–1509. doi: 10.1073/pnas.0610503104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Straight PD, Fischbach MA, Walsh CT, Rudner DZ, Kolter R. A singular enzymatic megacomplex from Bacillus subtilis. Proc Natl Acad Sci USA. 2007;104:305–310. doi: 10.1073/pnas.0609073103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Broadhurst RM, Nietlispach D, Wheatcroft MP, Leadlay PF, Weissman KJ. The structure of docking domains in modular polyketide synthases. Chem Biol. 2003;10:723–731. doi: 10.1016/s1074-5521(03)00156-x. [DOI] [PubMed] [Google Scholar]
  • 19.Xu XP, et al. Localization of the ActIII actinorhodin polyketide ketoreductase to the cell wall. FEMS Microbiol Lett. 2008;287:15–21. doi: 10.1111/j.1574-6968.2008.01289.x. [DOI] [PubMed] [Google Scholar]
  • 20.Imperi F, Visca P. Subcellular localization of the pyoverdine biogenesis machinery of Pseudomonas aeruginosa: A membrane-associated “siderosome”. FEBS Lett. 2013;587:3387–3391. doi: 10.1016/j.febslet.2013.08.039. [DOI] [PubMed] [Google Scholar]
  • 21.Porter JL, et al. The cell wall-associated mycolactone polyketide synthases are necessary but not sufficient for mycolactone biosynthesis. PLoS ONE. 2013;8(7):e70520. doi: 10.1371/journal.pone.0070520. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Simunovic V, et al. Myxovirescin A biosynthesis is directed by hybrid polyketide synthases/nonribosomal peptide synthetase, 3-hydroxy-3-methylglutaryl-CoA synthases, and trans-acting acyltransferases. ChemBioChem. 2006;7(8):1206–1220. doi: 10.1002/cbic.200600075. [DOI] [PubMed] [Google Scholar]
  • 23.Hantash FM, Earhart CM. Membrane association of the Escherichia coli enterobactin synthase proteins EntB/G, EntE, and EntF. J Bacteriol. 2000;182:1768–1773. doi: 10.1128/jb.182.6.1768-1773.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Lohman JR, et al. Structural and evolutionary relationships of “AT-less” type I polyketide synthase ketosynthases. PNAS. 2015;112:12693–12698. doi: 10.1073/pnas.1515460112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Nguyen T, et al. Exploiting the mosaic structure of trans-acyltransferase polyketide synthases for natural product discovery and pathway dissection. Nat Biotechnol. 2008;26:225–233. doi: 10.1038/nbt1379. [DOI] [PubMed] [Google Scholar]
  • 26.Ye Z, Musiol EM, Weber T, Williams GJ. Reprogramming acyl carrier protein interactions of an acyl-CoA promiscuous trans-acyltransferase. Chem Biol. 2014;21:636–646. doi: 10.1016/j.chembiol.2014.02.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Pulsawat N, Kitani S, Nihira T. Characterization of biosynthetic gene cluster for the production of virginiamycin M, a streptogramin type A antibiotic, in Streptomyces virginiae. Gene. 2007;393:31–42. doi: 10.1016/j.gene.2006.12.035. [DOI] [PubMed] [Google Scholar]
  • 28.Wong FT, Jin X, Mathews II, Cane DE, Khosla C. Structure and mechanism of the trans-acting acyltransferase from the disorazole synthase. Biochemistry. 2011;50:6539–6548. doi: 10.1021/bi200632j. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Lopanik NB, et al. In vivo and in vitro trans-acylation by BryP, the putative bryostatin pathway acyltransferase derived from an uncultured marine symbiont. Chem Biol. 2008;15:1175–1186. doi: 10.1016/j.chembiol.2008.09.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Musiol EM, et al. Supramolecular templating in kirromycin biosynthesis: the acyltransferase KirCII loads ethylmalonyl-CoA extender onto a specific ACP of the trans-AT PKS. Chem Biol. 2011;18:438–444. doi: 10.1016/j.chembiol.2011.02.007. [DOI] [PubMed] [Google Scholar]
  • 31.Brignole EJ, Smith S, Asturias FJ. Conformational flexibility of metazoan fatty acid synthase enables catalysis. Nat Struct Mol Biol. 2009;16:190–197. doi: 10.1038/nsmb.1532. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Davison J, et al. Insights into the function of trans-acyl transferase polyketide synthases from the SAXS structure of a complete module. Chem Sci. 2014;5:3081–3095. [Google Scholar]
  • 33.Gay G, Wagner D, Keatinge-Clay AT, Gay DC. Rapid modification of the pET-28 expression vector for ligation-independent cloning using homologous recombination in Saccharomyces cerevisiae. Plasmid. 2014;76:66–71. doi: 10.1016/j.plasmid.2014.09.005. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1
Download video file (11.1MB, mp4)
2

RESOURCES