Abstract
Bacterial capsules have critical roles in host-pathogen interactions. They provide a protective envelope against host recognition, leading to immune evasion and bacterial survival. Here we define the capsule biosynthesis pathway of Haemophilus influenzae serotype b (Hib), a Gram-negative bacterium that causes severe infections in infants and children. Reconstitution of this pathway enabled the fermentation-free production of Hib vaccine antigens starting from widely available precursors and detailed characterization of the enzymatic machinery. The X-ray crystal structure of the capsule polymerase Bcs3 reveals a multi-enzyme machine adopting a basket-like shape that creates a protected environment for the synthesis of the complex Hib polymer. This architecture is commonly exploited for surface glycan synthesis by both Gram-negative and Gram-positive pathogens. Supported by biochemical studies and comprehensive 2D nuclear magnetic resonance, our data explain how the ribofuranosyltransferase CriT, the phosphatase CrpP, the ribitol-phosphate transferase CroT and a polymer-binding domain function as a unique multi-enzyme assembly.
Subject terms: Carbohydrates, Glycobiology, Enzyme mechanisms, X-ray crystallography, Vaccines
Polymerase Bcs3, which allows the fermentation-free synthesis of Haemophilus influenzae type b capsule for vaccine development, adopts a basket-like shape with all six active sites facing the interior, creating a protected environment for catalysis.
Main
Haemophilus influenzae is a Gram-negative bacterium colonizing the human nasopharynx. It commonly causes upper and lower respiratory infections and occasionally more serious diseases like meningitis and septicemia with high incidence rates, especially in infants1. Based on the chemical properties of the linear, charged capsule polymer surrounding the pathogen, H. influenzae can be grouped into six serotypes a–f, of which serotype b (Hib) is clinically most important. The capsule polymer of Hib, consisting of a ribofuranose- (Ribf) and ribitol-5-phosphate-containing repeating unit2 ([→3)-β-d-Ribf-(1→1)-d-Ribitol-(5→OPO3→]n) (Fig. 1a), is one of the pathogen’s main virulence factors and enables it to evade the host’s immune system1,3.
Capsule polymers can be used as an antigen in vaccine formulations4. In the early 1990s, the Hib conjugate vaccine was the first glycoconjugate vaccine ever licensed and was immensely successful in reducing the disease burden5. However, the global distribution of Hib conjugate vaccines is still a challenge, partly because the high production costs associated with purifying the capsule polymer from pathogen cultures limit the introduction of Hib vaccination worldwide, including in developing countries6. Promising alternative strategies are the use of either chemical and/or enzymatic synthesis for the provision of capsule polymer7. Notably, a chemically synthesized, cost-effective Hib vaccine was introduced in the Cuban national immunization schedule in 2004 (ref. 5). However, an enzyme-based synthesis approach has not yet been realized, as the enzymatic machinery responsible for capsule biosynthesis in Hib remains elusive. Its characterization would open up exciting opportunities for the fermentation-free production of a Hib vaccine from ultrapure components, both in the context of recombinant proteins and substrates.
On the DNA level, capsule expression in Hib is organized in a conserved region named capsule gene cluster, which can be subdivided into regions 1–3 (refs. 8,9; Fig. 1b and Extended Data Fig. 1). Regions 1 and 3 are conserved between species and encode the export machinery belonging to an ABC-transporter-dependent capsule assembly system, historically referred to as group 2 biosynthesis system3. Region 2 is serotype-specific and encodes the capsule biosynthesis enzymes. In Hib, region 2 contains four open reading frames called bcs1-bcs4 (ref. 8; Fig. 1b and Extended Data Fig. 1a), of which only the product of bcs1 (Bcs1) was biochemically characterized10 and shown to encode a CDP-ribitol synthase, which putatively provides the nucleotide-activated precursor for the ribitol-5-phosphate moiety in the Hib polymer (Fig. 1c). So far, no information is available about the function of the gene products Bcs2–4.
At least the following two enzymatic functions are needed to assemble the Hib polymer ([→3)-β-d-Ribf-(1→1)-d-Ribitol-(5→OPO3→]n): a ribofuranosyltransferase and a ribitol-phosphate transferase. Despite the abundance of ribose in bacterial surface glycans11, the enzymatic machinery for ribose incorporation into capsular polysaccharides has not been identified yet. Ribitol-phosphate transferases have been described biochemically in the context of Gram-positive wall teichoic acids (WTAs)12. However, evidence of these enzymes in the Gram-negative background is lacking, and structural information is missing entirely. In this context, the capsule biosynthesis machinery of Hib might connect biosynthesis modules used in both Gram-negative and Gram-positive pathogens, and the structural characterization of the enzyme would greatly improve the understanding of surface glycan biosynthesis in both systems, thereby advancing the search for the discovery of new antimicrobials.
In this study, we defined the entire pathway for the fermentation-free synthesis of the Hib capsule polymer starting from purified enzymes and inexpensive, widely available substrates. We identified Bcs3 as the capsule polymerase of Hib and solved its crystal structure in complex with cytidine monophosphate (CMP), phosphate and Hib polymer. The enzymes of Bcs3 form a machine for heteropolymer synthesis of as yet undescribed complexity. We unveil the molecular basis of substrate specificity in each one of the three enzymes and suggest a mechanism by which they act in concert to sequentially assemble the Hib capsule.
Results
Bcs3 is the Hib capsule polymerase
From the noncharacterized genes of the Hib capsule gene cluster (bcs2–4), only bcs3 codes for an enzyme large enough (1,215 amino acids) to encompass at least two catalytic activities (Fig. 1b). Homology modeling performed with Bcs3 using Phyre2 (ref. 13) predicted (Fig. 1b) (1) an N-terminal TagF-like polymerase14,15 (Supplementary Fig. 1), (2) a central phosphatase16 and (3) a large C-terminal region of ca. 600 amino acids for which no significant homologs could be provided by Phyre2. We hypothesized that this region might harbor the ribosyltransferase domain. Bcs3 was cloned and purified, and CDP-ribitol was considered a suitable donor substrate. Because CDP-ribitol is not commercially available and its precursor ribulose-5-phosphate is comparatively expensive, the enzymes Bcs1 (ref. 10) and RpiA17 from Hib were cloned to establish a cascade synthesis for CDP-ribitol starting from d-ribose-5-phosphate, NADPH and CTP (Fig. 1c,d and Supplementary Fig. 2). As expected, the presence of Bcs1 in a reaction containing ribulose-5-phosphate led to the consumption of NADPH, while NADP+ and a peak eluting similarly to CDP-glycerol were produced. The same result can be observed if ribulose-5-phosphate is produced in situ from ribose-5-phosphate by RpiA (Fig. 1d, compare reactions 1 and 2), indicating that CDP-ribitol can be generated by RpiA and Bcs1 in a one-pot reaction.
PRPP is the ribose donor for the synthesis of Hib polymer
ADP-ribose, α-d-ribose-1-phosphate (Rib-1P) and 5-phospho-α-d-ribose-1-diphosphate (PRPP; Fig. 1e) have been discussed as putative ribose donors for Hib polymer synthesis9 and were thus tested as substrate candidates for the putative ribofuranosyltransferase activity of Bcs3. Because CDP-ribitol degraded during attempts of isolation (Supplementary Fig. 3), it was generated in situ by Bcs1 and RpiA. Interestingly, CDP-ribitol was only used up in the presence of PRPP, with concomitant production of CMP and large polymeric products detectable by Alcian blue-stained PA gel18 (Fig. 1d,f, reaction 6). In contrast, ADP-ribose and ribose-1-phosphate did not stimulate the consumption of CDP-ribitol nor the production of CMP or polymer (Fig. 1d,f, reactions 4 and 5). Thus, we concluded that PRPP is the donor substrate for Bcs3. To confirm the identity of the produced polymer, the enzyme cascade of RpiA, Bcs1 and Bcs3 was scaled up. Polymer was purified and analyzed by immunoblotting (Fig. 1g) and nuclear magnetic resonance (NMR; Extended Data Fig. 2). 1H and 13C chemical shifts agreed perfectly with spectra previously reported for natural Hib polymer (Supplementary Tables 1 and 2). Notably, 1D 31P NMR and 1H-31P HMBC spectra showed only a single phosphodiester signal and no indication of the presence of phosphomonoesters in the polymer, suggesting that the predicted phosphatase (Fig. 1b) might be required to remove the 5-phosphate present in PRPP. Of note, the ribofuranose has α-configuration in PRPP and β-configuration in the polymer (Fig. 1a,e), indicating that Bcs3 uses an inverting mechanism.
The overall architecture of Bcs3
Crystallographic studies were performed using a C-terminally truncated version of the enzyme (Supplementary Fig. 4). The crystal structure of Bcs3 (residues 1–1,115, called Bcs3-CMP hereafter) in complex with CMP was solved by molecular replacement methods using an Alphafold19 model (see Methods for details) at a maximum resolution of 2.9 Å (Fig. 1h, Supplementary Fig. 5 and Supplementary Table 3). Bcs3-CMP crystallized in the P 212121 space group, with two molecules in the asymmetric unit and residues 1–1,115 visible in the electron density map (Extended Data Figs. 3 and 4 and Supplementary Fig. 6). The following five different regions were identified from the N- to the C-terminus (Fig. 1h and Supplementary Fig. 5): (1) a TagF-like capsule polymerase14,15 called CroT (Capsule ribitol-5-phosphate Transferase (residues 1–373), (2) a phosphatase called CrpP (Capsule ribose phosPhatase (residues 374–591), (3) the ribose-5-phosphate transferase CriT (Capsule ribose-5-phosphate Transferase (residues 592–1,036) and (4) an SH3b20 domain (residues 1,037–1,115). Region 5, the amino acids C-terminally attached to the SH3b domain, was not visible in the structure (residues 1,116–1,215; Supplementary Fig. 7). However, a bundle of amphipathic α-helices (CT-HB) compatible with membrane association and/or protein–protein interactions could be predicted for this region using Alphafold19. Interestingly, these helices are not present in Bcs3 homologs identified via BLAST21 in Gram-positive bacteria (Supplementary Data 1), suggesting that they might be specific for the Gram-negative capsule biosynthesis systems3 shown in Extended Data Fig. 1.
The Bcs3 protomers build into a physiological and functional head-to-tail dimer (Fig. 1h and Supplementary Fig. 4f,5), which is assembled primarily by interactions between CriT from one protomer and CroT’/CrpP’ from the neighbor protomer, with an overall basket-like shape (Extended Data Fig. 5). The active sites of the three enzymes in each protomer, as well as each of the SH3b domains and the predicted amphipathic α-helices, are located at the concave surface of the dimer (Supplementary Fig. 5). The concave surface of Bcs3 contains several hydrophobic patches interspersed with clusters of positively charged residues, which could indicate potential binding sites for Ribf/ribitol and for phosphate of the repeating unit, respectively. In contrast, the opposite side of Bcs3 displays a negatively charged surface, which would generate a substantial electrostatic repulsion with the bacterial plasma membrane (Supplementary Fig. 5d).
Structural basis of CriT substrate specificity and catalysis
CriT comprises the following four domains from the N- to the C-terminus (Fig. 2a–c): (1) a first α-helical bundle (HB-1; residues 599–654 and 904–959), (2) a catalytic domain (CriT-CAT; residues 655–715 and 796–903), (3) a second α-helical bundle (HB-2; residues 716–795) and (4) a third α-helical bundle (HB-3; residues 960–1,036). The CriT-CAT domain adopts an α/β core structure composed of five parallel β-strands arranged in a twisted β-sheet, surrounded by six α-helices (Fig. 2a). A similar structural arrangement can be observed in phosphoribosyltransferases (PRTases) involved in the formation of N-glycosidic bonds during nucleotide synthesis22. PRTases contain a deep pocket for PRPP binding23. In Bcs3-CMP, a corresponding pocket contains two phosphate groups (PO41 and PO42; Extended Data Fig. 4d and Supplementary Fig. 8), is located at one end of the β-sheet, and is surrounded by α-helices α37, α38, α45 and α46, and the loops shown in Fig. 2. Structural comparison of the CriT-CAT domain with PRTases revealed that the 5-phosphate of PRPP superimposes well with PO42, whereas the β-phosphate of PRPP superimposes well with PO41 (Supplementary Fig. 9). Thus, molecular docking guided by homology (Methods) was used to place PRPP in an energetically favored configuration into the deep pocket of the CriT-CAT (Fig. 2d–g).
PRTases use the side chains of two conserved vicinal aspartates (DD motif) to accommodate the ribose ring of PRPP in the corresponding active sites24. In addition, the ribose C2 and C3 hydroxyl groups and the pyrophosphate moiety interact with Mg2+. The functional conservation of the DD motif in CriT-CAT is achieved by equivalent positioning of the side chains of D680 and E818, which are conserved in all homologs identified in this study (Fig. 3a, Supplementary Data 1 and Supplementary Fig. 9). The replacement of D680 and E818 by alanine led to a clear reduction of the Bcs3 enzymatic activity (Fig. 3b,c, constructs 5–7). The location of these residues in different structural elements, namely loop 3 and loop 1 (Fig. 2d), provides a more open topology of the CriT-CAT active site when compared to the PRTase homologs (Supplementary Fig. 9) and enables the Hib polymer to access the binding site. As a consequence, C1 from PRPP faces a putative entrance for the nonreducing end ribitol of the nascent polymer (Fig. 2e–g). Alongside this entrance, we hypothesize that E874 is ideally positioned to deprotonate 1-OH of ribitol for a nucleophilic attack (Supplementary Fig. 10). Based on homology with PRTases, we propose that CriT follows the mechanism proposed in Supplementary Fig. 11a.
Structural basis of CrpP substrate specificity and catalysis
The phosphatase CrpP adopts a general topology of the haloacid dehalogenase (HAD)-like enzymes, a large superfamily of phosphohydrolases found in organisms from all three superkingdoms of life16,25. CrpP comprises the following two domains from the N- to the C-terminus: a catalytic Rossmann-fold domain (CrpP-CAT; residues 374–392 and 491–591, gray) and a mostly α-helical mobile cap domain (CrpP-CAP; residues 393–490, yellow; Fig. 4a–c). The CrpP-CAT comprises a five-stranded parallel β-sheet flanked by six α-helices on both sides. CrpP-CAT displays the following two key structural signature motifs that are shared by HAD-like enzymes16: (1) a single helical turn (residues 387–390 in CrpP, tan and loop 1) and (2) a ‘flap’ in the form of a β-hairpin motif (β14 and β15 in CrpP) located immediately after the first β-strand of the Rossmann-fold domain (Fig. 4a). These two signatures are predicted to provide the necessary mobility between the ‘open’ and ‘closed’ conformations in HAD-like enzymes16.
Besides the structural motifs, HAD-like phosphohydrolases are defined by the presence of four sequence motifs16. Motif I comprises a signature DXD located at the end of the first β-strand, right at the beginning of loop 1. The first aspartate of this motif acts as a nucleophile that forms an aspartyl-intermediate during catalysis and only this first aspartic acid is conserved in CrpP (D386; loop 1 in Fig. 4d, Supplementary Figs. 12 and 13 and Supplementary Data 1). In HAD-like phosphohydrolases, a deep pocket contains the binding site for the terminal phosphomonoester substrate. In CrpP, such a binding site could be located at one end of the β-sheet (β13, β16 and β18) and would be flanked by α-helix α33 and loops 1–4 of CrpP-CAT, as well as by the flap (β14-β15), α23, α25, α26 and loops 5 and 6 of CrpP-CAP (Fig. 4a,b). In agreement with that, a phosphate and a Zn2+ found in CrpP’s deep pocket superimpose well with substrates and Mg2+ identified in structural homologs (Fig. 4a–c, Extended Data Fig. 4a,b and Supplementary Fig. 12). Guided by the ligands and homology, we performed molecular docking, which placed one Hib repeating unit ending with a 5-phosphorylated, nonreducing end Ribf ([OPO3→3)-β-d-Ribf-(1→1)-d-Ribitol-(5→OPO3]) in an energetically favored configuration into the CrpP-CAT deep pocket (Fig. 4d–g; Methods). Notably, an internal 5-phosphorylated Ribf could not be placed into the pocket, supporting an exo-cleavage of the 5-phosphate over an endo-cleavage, and suggesting that CrpP acts after CriT and before CroT in the reaction cycle. Notably, the architecture of the active sites of CrpP and other HAD-like phosphatases support a common catalytic mechanism (Supplementary Fig. 11b).
Structural basis of CroT substrate specificity and catalysis
CroT displays the typical GT-B fold of glycosyltransferases (Supplementary Fig. 14), consisting of an N-terminal (CroT-NT) and a C-terminal (CroT-CT) Rossmann-fold domain, with a deep fissure at the interface that includes the catalytic center26 (Fig. 5a,b). The core of each domain is composed of a six-stranded parallel β-sheet flanked by α-helices on both sides (Fig. 5a). The close inspection of the electron density maps revealed a CMP molecule and glycerol located in one of the CroT protomers (Fig. 5a,b, Extended Data Fig. 3a,b and Supplementary Fig. 15). The CMP binding pocket on the C-terminal side of the interdomain crevice is defined by α-helices α8, α15 and α16, loops 1–3 of the CroT-CT domain, as well as loop 4 of the CroT-NT domain (Fig. 5a–c). A structural comparison of the CroT protomer complexed with CMP and the apo CroT protomer reveals an important interdomain movement that has been proposed for other members of the GT-B superfamily during substrate binding and catalysis (Extended Data Fig. 6 and Supplementary Fig. 16).
CroT transfers ribitol-phosphate from CDP-ribitol, generating a phosphodiester bond with the 3-OH of the nonreducing end ribose of the Hib polymer. The only other bacterial polyol-phosphate transferase whose structure was solved is the glycerol-phosphate polymerase TagF from Staphylococcus epidermidis14. TagF creates a phosphodiester bridge by transferring a glycerol-3-phosphate moiety from CDP-glycerol to the 1-OH of poly-glycerol-phosphate. It uses a simple displacement mechanism and by homology, we propose a similar mechanism for CroT (Supplementary Fig. 11c). To support this, we performed molecular docking calculations of CDP-ribitol into CroT (Methods). A structural comparison of the CroT-CMP complex with the TagFH444N-CDP-glycerol complex (PDB code 3L7K14) revealed that the CMP moieties of both structures superimpose well (Supplementary Fig. 14). Therefore, the ribitol-phosphate moiety was placed into the CroT active site using the observed orientation of the glycerol-3-phosphate moiety in the TagFH444N-CDP-glycerol complex14, as well as the glycerol and CMP identified in CroT as a reference. The glycerol moiety of CDP-glycerol accommodates into a small pocket flanked by P447 in TagF, whereas in CroT, the equivalent residue G99 creates extra space, presumably facilitating the ribitol-phosphate binding. In agreement with that, the proline is conserved in homologs of CroT that transfer glycerol-phosphate15,27, whereas the glycine is conserved in homologs transferring ribitol-phosphate12 (Supplementary Data 2, highlighted in yellow). The putative base H96, as well as residue H233, are strictly conserved among all homologs identified in this study and substitution to alanine abolished or drastically reduced activity (Fig. 3a–c, constructs 2–3, and Supplementary Data 1 and2).
Structural basis of Bcs3 Hib capsule polymer recognition
To study the location of Hib capsule polymer in Bcs3, crystals were soaked with oligomer fractions having a narrowed degree of polymerization (DP) (Extended Data Fig. 7, Supplementary Note and Supplementary Table 2). A structure was solved by molecular replacement methods (3.6 Å maximum resolution) showing Bcs3 in complex with two complete repeating units (DP2) of the Hib capsule polymer and an additional nonreducing end phosphate coming from a third repeating unit (Fig. 6). Bcs3-DP2 crystallized in the P 1 21 1 space group, with four molecules in the asymmetric unit (Supplementary Table 3). The overall architecture between Bcs3-CMP and Bcs3-DP2 is essentially preserved. However, we observed more flexibility for the CroT domain in Bcs3-DP2 than for the CroT domain in Bcs3-CMP (Supplementary Figs. 6 and 16). DP2 is clearly visible in the electron density maps and is present in all four Bcs3 molecules bound to an SH3b domain (residues 1,037–1,115), with the nonreducing end facing toward the active centers in the main groove (Fig. 6a,b, Extended Data Fig. 3c,d and Supplementary Fig. 17). SH3b domains are the prokaryotic counterparts of the well-characterized SH3 domain family20,28. The SH3b domain in Bcs3-DP2 revealed a completely conserved fold—a compact β-barrel, formed by two anti-parallel, three-stranded β-sheets (Fig. 6e and Supplementary Fig. 14c).
Molecular basis of Bcs3 enzymology
As described above, the constructs Bcs3H96A/Bcs3H233A (CroT), Bcs3D386A (CrpP) and Bcs3D680A/Bcs3E818A/Bcs3D680A/E818A (CriT) were (partially) inactive (Fig. 3a–c, constructs 2–7), confirming the importance of the targeted residues for the catalytic function of the Bcs3 enzymes. Constructs 8–10, in which only one of the enzymes remained intact, could be combined in trans during polymer assembly, corroborating that the introduced substitutions did not alter the function of the remaining Bcs3 domains (Fig. 3b,c, last lane) and indicating that the polymer could be released from each construct despite the fact that processivity is observed for wild-type Bcs3 (Extended Data Fig. 8).
To determine the relevance of the two predicted amphipathic α-helices and the SH3b domain, we performed C-terminal truncations in Bcs3 (Fig. 3b, constructs 11–14). Interestingly, the deletion of the last 53 amino acids (Fig. 3b; Bcs32–1162, construct 11) comprising the entire amphipathic α-helical bundle (α56-α59 in Supplementary Data 1) improved the yield of purified protein by fourfold (20 mg l−1 (Bcs3) versus 86 mg l−1 (Bcs32–1162)). Because truncations can influence the ability of capsule polymerases to start the reaction de novo29, activity tests were performed in the presence and absence of acceptor. Both Bcs32–1162 and Bcs32–1122 maintained their enzymatic activity, indicating that the catalytic machinery is not dependent on the two predicted amphipathic α-helices (Fig. 3d). As expected, the observed polymers are slightly shorter in the presence of acceptor, indicating specific elongation and a certain control over the product length, which would be beneficial for the biotechnological use of the enzyme29,30 (Extended Data Fig. 8). Notably, Bcs32–1093 and Bcs32–1068, in which the SH3b domain is disrupted, were completely inactive, strongly supporting the mechanistic role of the SH3b domain for Hib capsule polymer recognition.
N-terminal truncations were performed to further advance our understanding of the CriT domain. CriT and CrpP were separated between α35 and α36, which are part of the N-terminal HB-1 of CriT (Fig. 2a). The loop connecting these structural elements is of variable length (Supplementary Data 1) or even missing (Fig. 3a) in CriT homologs identified by BLAST21. The resulting constructs Bcs32–631, Bcs3632–1162 and Bcs3632–1215 were inactive alone but active if combined in trans (Fig. 3b,e, constructs 15–17), demonstrating that the separated enzymes indeed maintain their ability to generate polymer and that HB-1 of CriT can be partly reduced in size. Since Bcs3632–1162 was purified with improved quality if compared to Bcs3632–1215 (Supplementary Fig. 2c), Bcs3632–1162 was N-terminally truncated to narrow down the minimal active CriT domain. The resulting Bcs3643–1162, Bcs3653–1162 and Bcs3664–1162 (Fig. 3b, constructs 18–20) were combined in trans with Bcs32–631. The ability to initiate chain elongation de novo was only observed for Bcs32–631 + Bcs3632–1162, with Bcs32–631 + Bcs3643–1162 still being able to elongate priming acceptor oligosaccharides (Fig. 3e, 15 + 17, 15 + 18).
The rationale for separating CroT and CrpP was based on a comparison of Bcs3 with the capsule biosynthesis enzymes of Escherichia coli K18 (Extended Data Fig. 1b). The Hib ([→3)-β-d-Ribf-(1→1)-d-Ribitol-(5→OPO3→]n) and E. coli K18 ([→2)-β-d-Ribf-(1→2)-d-Ribitol-(5→OPO3→]n) capsules only differ with regard to how their constituents are linked. We identified separate open reading frames for CroT and CrpP/CriT in the K18 capsule genes cluster (Fig. 3b and Extended Data Fig. 1b). An alignment of Bcs3 with CroT-K18 and CrpP-CriT-K18 demonstrates that the region comprising α21 and α22 (amino acids 348–379) of Bcs3 is missing in E. coli K18, indicating that it might not be crucial for activity (Supplementary Data 3). Consequently, we designed Bcs32–354, Bcs32–379, Bcs3354–631 and Bcs3378–631 (Fig. 3b, constructs 21–24) and tested them in all possible combinations together with the CriT domain Bcs3632–1162. The activity was only visible in the presence of the acceptor and when constructs Bcs32–378 + Bcs3354–631 were combined (Fig. 3f, 17 + 24 + 21), indicating that the truncated region is important for the function of both CroT and CrpP under in vitro conditions.
Discussion
Gram-negative bacteria use ABC-transporter-dependent assembly systems for the expression of polymers in which charged carbohydrate moieties (sialic acid, 3-deoxy-d-manno-oct-2-ulosonic acid (Kdo) and glucuronic acid) or phosphate residues alternate with noncharged carbohydrates (group 2 for E. coli nomenclature)31. Polymers exclusively containing sugar units are generated by enzymes, adopting one of the two most abundant GT folds: GT-A or GT-B3,26,32,33. Polymerases assembling a heteropolymer, in which monosaccharides are bridged by phosphodiester linkages29,34, belong to a protein family called stealth35. Polyol-phosphate-containing polymers are assembled by TagF-like enzymes15,18,27,30. Structural data for capsule polymerases embedded in group 2 systems are limited to the chondroitin polymerase K4CP from E. coli K4 (ref. 32) and the polysialyltransferase from Mannheimia haemolytica serotype A2 (ref. 33; MhPST). With Bcs3, we present the structure of a multi-enzyme machine that generates a complex heteropolymer and combines three catalytic domains that are new in the context of capsule biosynthesis.
Our structural data support a mechanistic model in which the catalytic cycle that adds one repeating unit to the nascent chain consists of three steps as follows: (1) CriT transfers Ribf-5P from PRPP to the C1’ OH group of the nonreducing end ribitol of the growing chain, (2) CrpP removes the phosphate group from the 5 position of the now terminal Ribf-5P transferred by CriT and (3) CroT catalyzes the transfer of ribitol-phosphate from CDP-ribitol to the 3 position of the ribose ring released by CrpP (Extended Data Figs. 9 and 10). Even though the dimeric structural arrangement of Bcs3 allows the nascent chain to be elongated according to two scenarios (Extended Data Fig. 9), the arrangement of CriT, CrpP, CroT and SH3b supports a model in which two reaction centers inside the concave side of Bcs3 are formed, comprising two triads of enzymes—CriT/CrpP’/CroT’ and CriT’/CrpP/CroT (Extended Data Figs. 6a and 9c). The fact that constructs 15 (CroT/CrpP) and 17 (CriT) can form a polymerase (Fig. 3e) that is even active de novo supports this hypothesis.
A polysaccharide-binding site could be identified through molecular dynamics simulations and a priming acceptor monosaccharide in the crystal structure of the hyaluronan synthase Cv-HAS from Chlorella virus36. However, no structural evidence is available for extended binding sites or binding modules complexed with native polymer for bacterial group 2 capsule polymerases32,33. The Hib capsule dimer found in Bcs3 is bound to an SH3b domain, which is located 20–40 Å away from the active sites, suggesting a tethering mechanism37 that allows Bcs3 to stay attached to its acceptor while the nonreducing end is elongated (Extended Data Fig. 9). It is tempting to speculate that the SH3b domain allows a sliding motion to guide the polymer to the outside of the concave Bcs3 basket. The concave walls of Bcs3 are decorated with scattered patches of positively charged residues we suspect to participate in the channeling of the capsule polymer during the catalytic cycle, possibly via phosphate interactions. We hypothesize that this allows efficient elongation38 and will likely contribute to the level of processivity39 observed in Extended Data Fig. 8 and discussed in Supplementary Discussion.
In the living cell, most physiological processes are coordinated in networks. Colocalization of enzymes in compartments promotes higher performance of individual enzyme activities and reaction efficiency40. The Bcs3 crystal structures reveal a complex architecture compatible with a multi-enzyme macromolecular machine that acts as a concave basket-like compartment to polymerize the Hib capsule polymer. The electrostatic surface potential of Bcs3 suggests that the concave surface—including the three active sites, the SH3b domain and the predicted amphipathic α-helices—is oriented toward the membrane. Interestingly, the exit points of the SH3b domains are approximately 70–80 Å apart, while ABC transporters exporting polymers similar to the Hib capsule have a diameter of approximately 60–70 Å41,42. Combined with the observation that capsule polymers are synthesized in rafts43, it is tempting to speculate that the Bcs3 architecture supports the bridging of two ABC-transporter complexes, thereby coupling capsule biosynthesis to export (Extended Data Fig. 10).
The identity of the activated precursor for the transfer of Ribf and the machinery for its incorporation into bacterial CPS has long been a matter of debate. We show that Ribf transfer catalyzed by Bcs3 is realized by combining a HAD-like phosphatase (CrpP) and a glycosyltransferase (CriT) of a yet undescribed GT fold. Typically, glycosyltransferases adopt Rossmann GT-A or GT-B folds and use NDP/NMP-sugars or lipid-linked donors as substrates26. In contrast, CriT resembles PRPP-transferases involved in the formation of N-glycosidic linkages22 but uses PRPP as a substrate for the transfer of O-linked ribofuranose-5-phosphate, which is dephosphorylated by CrpP. During the final preparation of this study, a study was published44 describing CrpP and CriT homologs, and the crystal structure of a HAD-like phosphatase and a putative ribofuranosyltransferase from Thermobacillus composti. A structural comparison with Bcs3 revealed considerable differences between the two structures, indicating structural variety in PRPP-dependent glycosyltransferases (Supplementary Fig. 18).
The closest homologs to the Hib polymer are complex Gram-positive WTAs, but polymerases generating complex WTA have not been identified yet12. WTAs make upto 60% of the cell wall of Gram-positive pathogens, extend beyond the peptidoglycan layer and are thus in direct contact with the host immune system12,45, making them attractive vaccine antigens7,46, and WTA biosynthesis enzymes promising drug targets47. Of the four WTA types (I–IV), only WTA I biosynthesis has been investigated thoroughly12. We identified the homologs of the entire Bcs3 sequence in Gram-positive pathogens (Supplementary Data 1). Future studies will show if these homologs are part of WTA biosynthesis gene clusters. Notably, BLAST searches performed herein using CriT and CroT as queries could not identify homologs in the human proteome, corroborating that mammalian enzymes22,48 with similar substrate usage are distinct from CriT and CroT and emphasizing the importance of the Bcs3 structure for the development of inhibitors against bacterial pathogens.
Enzymatic and chemo-enzymatic polymer synthesis is a promising strategy to provide antigens for glycoconjugate vaccine development4,7. It omits pathogen culture and starts with highly pure precursors27,49. Herein, the established multi-enzyme cascade is scalable, could be used to produce milligram amounts of polymer and allows a considerable measure of length control (Extended Data Figs. 7 and 8). The developed hydrolysis protocols yielded oligomers with free reducing ends that can be coupled through reductive amination and have the right size (for example, 10–25 repeating units) for vaccine development studies7. The single action transferases developed herein might even allow the step-wise build-up of the polymer according to established protocols30. The structural and biochemical characterization of Bcs3, the identification of homologs in both Gram-negative and Gram-positive pathogens and the development of fermentation-free synthesis protocols represent important milestones toward understanding, manipulating and exploiting multi-enzyme machines that will allow new access to structurally complex surface glycans.
Methods
General cloning
A PreScission cleavage site was inserted into the modified p-Mal-c (New England BioLabs) vector pMBP-S3N10-csxA-His6 (ref. 51) (tac) resulting in pMBP-PreScission-S3N10-csxA-His6. The gene cps3B27 was cloned via BamHI/XhoI into plasmid pMBP-PreScission-S3N10-csxA-His6, replacing the csxA sequence and resulting in pMBP-PreScission-S3N10-cps3B-His6. The genes bcs1 (OOD27571.1), bcs3 (OOD27573.1; Uniprot: Q2ERG0), rpiA (OOD27170.1), prsA (OOD25971.1) and rk (OOD27132.1) were amplified by PCR using heat-inactivated lysate of Hib strain ATCC 10211 (GenBank accession number MTGI00000000.1) as a template. The resulting PCR products were cloned into pMBP-S3N10-csxA-His6 via BamHI/XhoI restriction sites or pMBP-PreScission-S3N10-cps3B-His6 via restriction-free cloning52 replacing csxA and cps3B, respectively. Single amino acid mutants and truncations of Bcs3 were introduced according to ref. 53 or by using the Q5 Site-Directed Mutagenesis Kit (New England BioLabs) according to the manufacturer’s guidelines. Constructs and primers used in this study are listed in Supplementary Tables 4 and 5.
Expression and purification of recombinant proteins
For the expression of recombinant proteins, E. coli M15 (pREP4) cells were transformed with expression plasmids and grown in 250–500 ml of PowerBroth medium (ATHENAES) supplemented with 50 μg ml−1 carbenicillin at 37 °C. When the culture reached an optical density at 600 nm (OD600) of 1.0, the culture was incubated at 4 °C for 30–45 min, Bcs3 expression was induced by the addition of 0.5 mM isopropyl β-d-thiogalactopyranoside (IPTG) and incubation was continued at 15 °C and 200 rpm overnight. Cells were collected by centrifugation at 6,000g for 10 min and resuspended in 10 ml of purification buffer (50 mM Tris, pH 8.0, 300 mM NaCl), containing EDTA-free protease inhibitors (Complete EDTA-free; Roche). Cells were sonicated (Sonorex sonication; Bandelin) for 10 cycles of 30 s each at 25–30% power and 50% amplitude, interrupted by 30 s of cooling on ice. Cell debris was separated by centrifugation at 27,000g for 30 min. The supernatant was filtered through a 0.8-µm cutoff membrane and protein was purified by affinity chromatography using a HisTrap column (1 ml bed volume; GE Healthcare) or MBPTrap column (1 or 5 ml bed volume; GE Healthcare), previously equilibrated in purification buffer. The enzyme was eluted using a linear gradient from 35 to 500 mM imidazole over 15 column volumes (HisTrap column) or with 10 mM maltose over 5 column volumes (MBPTrap column). Protein-containing fractions were pooled and applied to a HiPrep 26/10 Desalting column (GE Healthcare) equilibrated in purification buffer. The protein was concentrated using an Amicon centrifugal device (10,000–100,000 molecular weight cutoff, depending on protein size; Merck) and aliquots were snap-frozen in liquid nitrogen and stored at −80 °C. Aliquots of 10 µl of each purification step were analyzed by SDS-PAGE as previously described51. The crystallization construct MBP-Bcs32–1162-His6 was purified using MBPTrap column as described above and shown in Supplementary Fig. 4. Protein-containing fractions were further purified by size exclusion chromatography (SEC) using a Superdex 200 10/300 GL column (GE Healthcare) equilibrated in purification buffer. PreScission digest was performed with 5.8 µg PreScission per milligram of recombinant protein overnight at 4 °C. After centrifugation at 15,000g for 5 min, the supernatant was applied to MBPTrap column to retain the MBP tag. The Bcs3-His6-containing flowthrough was further purified using a HisTrap column as described above, followed by preparative SEC. The SEC column was equilibrated using a gel filtration marker kit for protein molecular weights of 29,000–700,000 (Sigma) according to the manufacturer’s guidelines. The protein was concentrated using an Amicon centrifugal device (50,000 molecular weight cutoff; Merck) to give 10 mg ml−1 of protein, and aliquots were snap-frozen in liquid nitrogen and stored at −80 °C.
Enzymatic reactions, HPLC-AEC analysis and PAGE
Enzymatic reactions were carried out with 0.5 to 2 µM of the enzyme at 37 °C in a total volume of 20 to 30 µl of assay buffer (50 mM Tris, pH 8.0, 10 mM MgCl2, 1 mM DTT (dithiothreitol)) containing 5 mM of each required substrate (ADP-ribose (Sigma), ATP (Roche), CTP (Sigma), NADPH (Roche), PRPP (Sigma), d-ribose (Sigma), ribose-1-phosphate (Sigma), ribose-5-phosphate (Sigma), ribulose-5-phosphate (Sigma) and UTP (Roche)). After 2 h of incubation or after overnight incubation, samples were analyzed by high-performance liquid chromatography-anion-exchange chromatography (HPLC-AEC) and high-percentage (25%) polyacrylamide gel electrophoresis (PAGE) as previously described18 with minor modifications—chromatograms were recorded using 5 µl sample and a linear elution gradient from 0% to 25% of mobile phase 2 (1 M NaCl, 20 mM Tris, pH 8.0) over 42 min. Data were collected using LCsolution version 1.25 SP4 (Shimadzu). Alcian blue-/silver-stained PA gels were scanned on an Amersham Imager 680 and colors were adjusted equally across the entire image to improve the visualization of Alcian blue. If in situ-synthesized ribose-5-phosphate was used as a substrate for further enzymatic reactions, 0.5–2 µM ribokinase was preincubated in a total volume of 30–50 µl assay buffer at 37 °C for 2 h and heat-inactivated at 80 °C for 5 min. The substrate concentration of d-ribose and ATP was 20 mM, resulting in 5 mM of ribose-5-phosphate in subsequent reactions.
Upscaling of polymer synthesis and purification
For the synthesis of 5–50 mg polymer, enzymes were incubated overnight at 37 °C in 3–30 ml of assay buffer (50 mM Tris, pH 8.0, 10 mM MgCl2, 1 mM DTT (dithiothreitol)) supplemented with 5 mM of each substrate. In vitro-synthesized polymer was purified by AEC using a Mono Q 10/100 GL column (GE Healthcare) as described previously30 with the minor adjustment that polymer-containing fractions were either dialyzed against water or thoroughly washed using an Amicon centrifugal device (30,000 molecular weight cutoff; Merck).
Preparation of oligomers
For the generation of oligomers, hydrolysis conditions for the Hib polymer (2.5 mg ml−1) were tested in (1) 25 mM TFA at 37 °C or (2) 10 mM acetic acid at 70 °C. Samples of 10 µl were collected after 15, 30, 45, 60, 90, 120, 180, 255 and 330 min of reaction time and mixed with 10 µl of 2 M sucrose. The hydrolysis products were analyzed by Alcian blue/silver strained high-percentage (25%) PAGE as previously described18. Preparative hydrolysis of 50 mg polymer was performed by incubation with 25 mM TFA for 120 min at 37 °C. The reaction mixture was freeze-dried and dissolved in 20 ml buffer (50 mM Tris, pH 7.0, 100 mM NaCl, 10 mM MgCl2). Dephosphorylation of hydrolysis products was performed by the addition of 200 U alkaline phosphatase (Quick CIP; New England BioLabs) for 75 min at 37 °C, followed by the addition of another 100 U of alkaline phosphatase and incubation for 75 min at 37 °C. Alkaline phosphatase was removed using an Amicon centrifugal device (50,000 molecular weight cutoff; Merck). The oligomer-containing filtrate was diluted to 140 ml and further purified by AEC (Mono Q 10/100 GL; GE Healthcare) using a linear gradient from 0 mM to 350 mM NaCl over 40 column volumes. After analysis by high-percentage PAGE, fractions were divided into 12 pools, dialyzed against water (Spectra/Por 7, 1,000 molecular weight cutoff; Roth) and then freeze-dried (Extended Data Fig. 7).
Dot blot/agglutination assay
Dot blot analysis of purified polymer was performed as previously described51. Hib polymer was detected using a rabbit anti-Hib agglutinating serum (Remel/Thermo Fisher Scientific; 1:1,000 or 1:2,000 dilution).
NMR analysis
All spectra were measured on a 600-MHz Bruker Avance III HD equipped with a 1H/13C/15N/31P QXI probe at 298 K. Typically, samples were dissolved in 500 μl D2O (100.0 atom%; Armar Chemicals) and measured in a 5-mm NMR standard tube. 1H 1D spectra were recorded with eight transients and a recycle delay of 10 s. 31P 1D spectra were recorded with a recycle delay of 3 s and 352 transients. Standard 1H-13C HSQC experiments from the Bruker library (hsqcedetgpsisp2.2) were recorded with 32 scans, 2,048 × 230 points and a recycle delay of 1.5 s. 1H-1H TOCSY spectra were measured with four scans, 2,048 × 360 points, a recycle delay of 1.5 s and a mixing time of either 80 ms or 12 ms. 1H-13C HMBC spectra (Bruker pulse sequence: hmbclpndqf) were recorded with 32 scans, 4,096 × 512 points and a recycle delay of 1.5 s. 1H-31P HMBC spectra (Bruker pulse sequence: hmbcgpndqf) were recorded with 16 scans, 4,096 × 320 points and a recycle delay of 1.5 s. All spectra were processed using Topspin 3.6 (Bruker) and analyzed using Sparky 3.111 and 3.115 (T. D. Goddard and D. G. Kneller, SPARKY 3, University of California, San Francisco, CA, USA). Proton chemical shifts were calibrated to 2,2-dimethyl-2-silapentane-5-sulfonic acid (DSS) using an external sample of 2 mM sucrose and 0.5 mM DSS in H2O/D2O (Bruker). Indirect chemical shift referencing was applied to 13C and 31P according to IUPAB54 using the chemical shift referencing ratios of 0.251449530 and 0.404808636.
Bcs3 crystallization and data collection
The Bcs3-CMP complex was crystallized by mixing 0.25 µl of a protein solution (9.5 mg ml−1 in 50 mM Tris, pH 8.0, 300 mM NaCl, 1 mM DTT) in the presence of 5 mM CMP with 0.25 µl of 0.04 M potassium phosphate monobasic, 16% (wt/vol) PEG 8000 and 20% (vol/vol) glycerol (JCSG + commercial screening, Molecular Dimensions). Crystals were frozen under liquid nitrogen. Complete X-ray diffraction datasets were collected at the beamline i24 using a beam transmission equal to 16.33% (Diamond Light Source Synchrotron). Bcs3-CMP crystallized in the P 212121 space group with two molecules in the asymmetric unit and diffracted to a maximum resolution of 2.9 Å (Supplementary Table 3). Bcs3-DP2 was crystallized by mixing 0.25 µl of protein solution (9.5 mg ml−1 in 50 mM Tris, pH 8.0, 300 mM NaCl, 1 mM DTT) with 0.25 µl of 0.06 M magnesium chloride hexahydrate, 0.06 M calcium chloride dihydrate, 0.1 M Imidazole, 0.1 M MES monohydrate, pH 6.5, 37.5% (vol/vol) MPB, 37.5% (vol/vol) PEG 1000 and 37.5% (vol/vol) PEG 3350 (Morpheus, Molecular Dimensions). The crystal was plunged frozen in liquid nitrogen. Complete X-ray diffraction datasets were collected at the beamline iO4 using a beam transmission equal to 100% (Diamond Light Source Synchrotron). Bcs3-DP2 crystallized in the P 1 21 1 space group with four molecules in the asymmetric unit and diffracted to a maximum resolution of 3.6 Å (Supplementary Table 3). The dataset was processed in situ using the Automatic Software Pipeline available at Diamond Light Source Synchrotron, integrated and scaled through the xia2 system.
Bcs3 structure determination and refinement
The structure determination of the Bcs3-CMP complex was performed by molecular replacement methods implemented in Phaser55 and the PHENIX suite56. The initial coordinates of Bcs3 were generated by using a model predicted by AlphaFold as a search model19. The model rebuilding was carried out with Buccaneer57 and the CCP4 suite58. The final manual building was performed with Coot59 and refinement with phenix.refine60 and Refmac5 (ref. 61). The structure was validated by MolProbity62. The Bcs3-DP2 complex was solved by molecular replacement using the Bcs3-CMP complex as a search model. Data collection and refinement statistics are presented in Supplementary Table 3. Molecular graphics and structural analyses were performed with the UCSF Chimera package63.
Structural analysis and sequence alignment
Homologs of Bcs3 were identified by BLAST21, aligned using Clustal Omega64, MUSCLE65 or Cobalt66 and annotated using Jalview67. Structure-based sequence alignment analysis was performed using Chimera63. Protein pocket volume was calculated using HOLLOW68. Z-score values were produced using DALI69. Domain interface analysis was performed using PISA70. Conserved and similar residues were labeled using the BoxShade server (http://embnet.vital-it.ch/software/BOX_form.html).
Molecular substrate docking
The substrate binding analysis of the active sites has been performed for CroT, CrpP and CriT using a mixed approach based on (1) the experimental location of phosphate groups and/or metals at the active site, (2) placement of the substrate by homology and (3) the minimization of the substrate docked ligand using USCF Chimera. For CroT, the modeling of CDP-ribitol in the active site was achieved firstly by placing CDP-glycerol from the complex TagF mutant N444H (PDB 3L7K)14 by structural superposition. Then, the docked CDP-glycerol was modified to CDP-ribitol using UCSF Chimera editing tools and following the orientation of CMP and glycerol found in the CroT active site. Finally, docking was performed using a tight search box to minimize the position of CDP-ribitol using Vina. For CrpP, the terminal 5-phospho-d-ribose end of a 5-phospho-d-Ribf-β-(1→1)-d-Ribitol-5-phosphate was placed in the active site using the phosphate found in the structure as a reference. A guided docking was performed following the orientation of substrates in CrpP homologs: (1) the structure of pyridoxal phosphate phosphatase in complex with a non-hydrolyzable pyridoxal 5-phosphate analog and Mg2+ (PDB code 5AES) and (2) the structure of phosphoserine phosphatase in complex with phosphoserine (PDB 1L7P). The docking of the moiety was oriented according to the substrates in the homologous structures and minimized in Chimera. Finally, for CriT, the placing of a PRPP-Mg2+ complex at the active site was guided by the position of the two phosphates found in the active site, and docked by homology extracting a ligand displaying an overall consensus orientation from the following four PRTase homologs in complex with PRPP-Mg2+: hypoxanthine-guanine phosphoribosyltransferase (PDB code 1FSG), adenine phosphoribosyltransferase (PDB code 1ZN7), orotate phosphoribosyltransferase (PDB code 1LH0) and uracil phosphoribosyltransferase (PDB code 1JLS). All dockings were thoroughly inspected and confronted with the protein surfaces to verify the absence of clashes.
Statistics and reproducibility
Alignments were performed once with each algorithm, but separate algorithms, for example, MUSCLE, Cobalt and Clustal omega, yielded highly similar results regarding the critical amino acids targeted in this study. Phyre2 homology modeling was performed in 2015, 2018, 2019, 2020 and 2021 to reevaluate the results and include potential new entries to the PDB. All submissions yielded highly similar results. The detection of Hib polymer (Fig. 2f) using agglutination sera was repeated two times with similar results and the polymer structure was confirmed by NMR. Qualitative analysis of polymer production by different Bcs3 constructs or construct combinations (Fig. 3c–f and Supplementary Fig. 10) was performed at least three times with similar results. Constructs were purified once, each purification was documented by SDS-PAGE and separate Coomassie-stained gels displaying all constructs were included in this study (Supplementary Figs. 2 and 10a). CDP-ribitol was purified at least three times with similar results. Bcs3 activity in the presence of purified CDP-ribitol was analyzed twice by HPLC and a representative sample was chosen to be displayed by PAGE (Supplementary Fig. 3d). The crystallization construct was purified twice with similar results using the protocol shown in Supplementary Fig. 4. Four enzymatic syntheses of Hib polymer (yielding 5 mg, 2 × 30 mg and 50 mg of Hib polymer) were performed with similar results, and representative documentation of these reactions and the purification of the polymer are shown in Extended Data Fig. 2 and 7. The effect of ATP and UTP on the enzyme cascade was analyzed twice by HPLC and Alcian blue staining after PAGE (Extended Data Fig. 7g). Test hydrolysis of Hib polymer (Extended Data Fig. 7j) and purification and visualization of oligomers (Extended Data Fig. 7k) were performed once. To analyze the elongation mechanism of polymerase constructs, at least five reactions were performed at different donor-to-acceptor ratios with highly consistent results (Extended Data Fig. 8).
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Online content
Any methods, additional references, Nature Portfolio reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information, details of author contributions and competing interests, and statements of data and code availability are available at 10.1038/s41589-023-01324-3.
Supplementary information
Acknowledgements
This study was funded by Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under project 412824531 (to T.F.), MHH internal impact orientated funds (LOM) to the Institute of Clinical Biochemistry, the MINECO/FEDER EU under contracts PID2019-105649RB-I00 and Severo Ochoa Excellence Accreditation SEV-2016-0644 (to M.E.G.), the Basque Government under contracts KK-2021-00034 and KK-2022/00107 (to M.E.G.), and NIH R01AI149297 (to M.E.G.). We acknowledge Diamond Light Source (DLS; proposal MX28360), ALBA synchrotron beamline BL13-XALOC (proposal MX2018093013) and Swiss Light Source (proposal 20201980) for providing access to synchrotron radiation facilities. We gratefully acknowledge all members of the Structural Glycobiology Group (IIS-BioCruces, Spain) and the Institute of Clinical Biochemistry for valuable scientific discussions. We thank Jan Faix for kindly providing PreScission protease and Jana Führing for feedback on the manuscript.
Extended data
Source data
Author contributions
J.O.C, V.D.D. and A.M. crystallized proteins and collected X-ray data; J.O.C. and V.D.D. performed alpha fold modeling; J.O.C., V.D.D. and M.E.G. solved and analyzed the crystal structure; J.O.C. and C.D. performed structural and sequence analyses; J.S. and T.F. analyzed capsule gene clusters to identify homologs of Bcs3; A.B. purified all proteins, polymer and substrates; J.S., A.B., C.L., I.B., I.R.R. and M.B. generated mutants and performed activity assays; L.E. and H.T. purified CDP-ribitol, H.C. provided the Hib reference strain, agglutination sera and expertise regarding Hib biology; M.S. performed and analyzed NMR experiments; T.F. identified Bcs3, performed homology modeling and sequence analysis, and designed mutants, truncations and the cascade synthesis; J.O.C., V.D.D. and M.E.G. proposed the mechanism for Bcs3; R.G.S. helped with data interpretation and manuscript drafting. M.E.G. and T.F. conceived the project and oversaw the experimental work; J.O.C., J.S., V.D.D., C.D., M.S., M.E.G. and T.F. generated figures, tables and artwork; J.O.C., J.S., M.S., M.E.G. and T.F. wrote the initial draft of the manuscript. All authors discussed the results, gave feedback on the manuscript and made substantial contributions to the final version.
Peer review
Peer review information
Nature Chemical Biology thanks Mark von Itzstein, Helge Dorfmueller and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Funding
Open access funding provided by Medizinische Hochschule Hannover (MHH).
Data availability
The atomic coordinates and structure factors have been deposited in the Protein Data Bank, accession codes 8A0C for the Bcs3-CMP complex and 8A0M for the Bcs3-DP2 complex. Data collection and refinement statistics are presented in Supplementary Table 3. PDB IDs used in the analysis of this work include 1ZN7, 1LH0, 1FSG, 1JLS, 1L7N, 5AES, 3VAY, 1L7P, 3L7K, 4ZHT, 6JDT, 4X1T, 5LEO, 6RJE, 1R77, 7BNH and 7SHG. Accession codes for sequences used in this study are available in Supplementary Table 4 and Extended Data Fig. 1. NMR chemical shifts are presented in Supplementary Tables 1 and 2. Source data are provided in separate files with this paper.
Competing interests
All the authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Javier O. Cifuente, Julia Schulze, Andrea Bethe, Valerio Di Domenico.
Contributor Information
Marcelo E. Guerin, Email: mrcguerin@gmail.com
Timm Fiebig, Email: fiebig.timm@mh-hannover.de.
Extended data
is available for this paper at 10.1038/s41589-023-01324-3.
Supplementary information
The online version contains supplementary material available at 10.1038/s41589-023-01324-3.
References
- 1.Hallström T, Riesbeck K. Haemophilus influenzae and the complement system. Trends Microbiol. 2010;18:258–265. doi: 10.1016/j.tim.2010.03.007. [DOI] [PubMed] [Google Scholar]
- 2.Branefors-Helander P, et al. Structural studies of the capsular antigen from Haemophilus influenzae type b. Acta Chem. Scand. 1976;30b:276–277. doi: 10.1016/s0008-6215(00)83845-8. [DOI] [PubMed] [Google Scholar]
- 3.Willis LM, Whitfield C. Structure, biosynthesis, and function of bacterial capsular polysaccharides synthesized by ABC transporter-dependent pathways. Carbohydr. Res. 2013;378:35–44. doi: 10.1016/j.carres.2013.05.007. [DOI] [PubMed] [Google Scholar]
- 4.Micoli F, et al. Glycoconjugate vaccines: current approaches towards faster vaccine design. Expert Rev. Vaccines. 2019;18:881–895. doi: 10.1080/14760584.2019.1657012. [DOI] [PubMed] [Google Scholar]
- 5.Verez-Bencomo V, et al. A synthetic conjugate polysaccharide vaccine against Haemophilus influenzae type b. Science. 2004;305:522–525. doi: 10.1126/science.1095209. [DOI] [PubMed] [Google Scholar]
- 6.Ojo LR, et al. Global use of Haemophilus influenzae type b conjugate vaccine. Vaccine. 2010;28:7117–7122. doi: 10.1016/j.vaccine.2010.07.074. [DOI] [PubMed] [Google Scholar]
- 7.del Bino L, et al. Synthetic glycans to improve current glycoconjugate vaccines and fight antimicrobial resistance. Chem. Rev. 2022;122:15672–15716. doi: 10.1021/acs.chemrev.2c00021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Satola SW, Schirmer PL, Farley MM. Complete sequence of the cap locus of Haemophilus influenzae serotype b and nonencapsulated b capsule-negative variants. Infect. Immun. 2003;71:3639–3644. doi: 10.1128/IAI.71.6.3639-3644.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.van Eldere J, et al. Region II of the Haemophilus influenzae type b capsulation locus is involved in serotype-specific polysaccharide synthesis. Mol. Microbiol. 1995;15:107–118. doi: 10.1111/j.1365-2958.1995.tb02225.x. [DOI] [PubMed] [Google Scholar]
- 10.Zolli, M., Kobric, D. J. & Brown, E. D. Reduction precedes cytidylyl transfer without substrate channeling in distinct active sites of the bifunctional CDP-ribitol synthase from Haemophilus influenzae. Biochemistry40, 5041–5048 (2001). [DOI] [PubMed]
- 11.Kunduru BR, Nair SA, Rathinavelan T. EK3D: An E. coli K antigen 3-dimensional structure database. Nucleic Acids Res. 2016;44:D675–D681. doi: 10.1093/nar/gkv1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Brown S, Santa Maria JP, Walker S. Wall teichoic acids of gram-positive bacteria. Annu. Rev. Microbiol. 2013;67:313–336. doi: 10.1146/annurev-micro-092412-155620. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Kelley LA, Mezulis S, Yates CM, Wass MN, Sternberg MJE. The Phyre2 web portal for protein modeling, prediction and analysis. Nat. Protoc. 2015;10:845–858. doi: 10.1038/nprot.2015.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Lovering AL, et al. Structure of the bacterial teichoic acid polymerase TagF provides insights into membrane association and catalysis. Nat. Struct. Mol. Biol. 2010;17:582–589. doi: 10.1038/nsmb.1819. [DOI] [PubMed] [Google Scholar]
- 15.Litschko C, et al. A new family of capsule polymerases generates teichoic acid-like capsule polymers in gram-negative pathogens. mBio. 2018;9:16017. doi: 10.1128/mBio.00641-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Burroughs AM, Allen KN, Dunaway-Mariano D, Aravind L. Evolutionary genomics of the HAD Superfamily: understanding the structural adaptations and catalytic diversity in a superfamily of phosphoesterases and allied enzymes. J. Mol. Biol. 2006;361:1003–1034. doi: 10.1016/j.jmb.2006.06.049. [DOI] [PubMed] [Google Scholar]
- 17.Rangarajan ES, Sivaraman J, Matte A, Cygler M. Crystal structure of d-ribose-5-phosphate isomerase (RpiA) from Escherichia coli. Proteins. 2002;48:737–740. doi: 10.1002/prot.10203. [DOI] [PubMed] [Google Scholar]
- 18.Litschko C, Budde I, Berger M, Fiebig T. Exploitation of capsule polymerases for enzymatic synthesis of polysaccharide antigens used in glycoconjugate vaccines. Methods Mol. Biol. 2021;2183:313–330. doi: 10.1007/978-1-0716-0795-4_16. [DOI] [PubMed] [Google Scholar]
- 19.Jumper J, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583–589. doi: 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Whisstock JC, Lesk AM. SH3 domains in prokaryotes. Trends Biochem. Sci. 1999;24:132–133. doi: 10.1016/s0968-0004(99)01366-3. [DOI] [PubMed] [Google Scholar]
- 21.Boratyn GM, et al. BLAST: a more efficient report with usability improvements. Nucleic Acids Res. 2013;41:W29–W33. doi: 10.1093/nar/gkt282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Hove-Jensen B, et al. Phosphoribosyl diphosphate (PRPP): biosynthesis, enzymology, utilization, and metabolic significance. Microbiol Mol. Biol. Rev. 2017;81:e00040-16. doi: 10.1128/MMBR.00040-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Scapin G, Grubmeyer C, Sacchettini JC. Crystal structure of orotate phosphoribosyltransferase. Biochemistry. 1994;33:1287–1294. doi: 10.1021/bi00172a001. [DOI] [PubMed] [Google Scholar]
- 24.González-Segura L, Witte JF, McClard RW, Hurley TD. Ternary complex formation and induced asymmetry in orotate phosphoribosyltransferase. Biochemistry. 2007;46:14075–14086. doi: 10.1021/bi701023z. [DOI] [PubMed] [Google Scholar]
- 25.Kuznetsova E, et al. Functional diversity of haloacid dehalogenase superfamily phosphatases from saccharomyces cerevisiae: biochemical, structural, and evolutionary insights. J. Biol. Chem. 2015;290:18678–18698. doi: 10.1074/jbc.M115.657916. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Albesa-Jové D, Giganti D, Jackson M, Alzari PM, Guerin ME. Structure-function relationships of membrane-associated GT-B glycosyltransferases. Glycobiology. 2014;24:108–124. doi: 10.1093/glycob/cwt101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Litschko C, et al. Mix-and-match system for the enzymatic synthesis of enantiopure glycerol-3-phosphate-containing capsule polymer backbones from Actinobacillus pleuropneumoniae, Neisseria meningitidis, and Bibersteinia trehalosi. mBio. 2021;12:e0089721. doi: 10.1128/mBio.00897-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Mayer BJ, Eck MJ. SH3 domains: minding your p’s and q’s. Curr. Biol. 1995;5:364–367. doi: 10.1016/s0960-9822(95)00073-x. [DOI] [PubMed] [Google Scholar]
- 29.Fiebig T, et al. Efficient solid-phase synthesis of meningococcal capsular oligosaccharides enables simple and fast chemoenzymatic vaccine production. J. Biol. Chem. 2018;293:953–962. doi: 10.1074/jbc.RA117.000488. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Budde I, et al. An enzyme-based protocol for cell-free synthesis of nature-identical capsular oligosaccharides from Actinobacillus pleuropneumoniae serotype 1. J. Biol. Chem. 2020;295:5771–5784. doi: 10.1074/jbc.RA120.012961. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Whitfield C, Wear SS, Sande C. Assembly of bacterial capsular polysaccharides and exopolysaccharides. Annu. Rev. Microbiol. 2020;74:521–543. doi: 10.1146/annurev-micro-011420-075607. [DOI] [PubMed] [Google Scholar]
- 32.Osawa T, et al. Crystal structure of chondroitin polymerase from Escherichia coli K4. Biochem. Biophys. Res. Commun. 2009;378:10–14. doi: 10.1016/j.bbrc.2008.08.121. [DOI] [PubMed] [Google Scholar]
- 33.Lizak C, et al. X-ray crystallographic structure of a bacterial polysialyltransferase provides insight into the biosynthesis of capsular polysialic acid. Sci. Rep. 2017;7:5842. doi: 10.1038/s41598-017-05627-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Muindi KM, et al. Characterization of the meningococcal serogroup X capsule N-acetylglucosamine-1-phosphotransferase. Glycobiology. 2014;24:139–149. doi: 10.1093/glycob/cwt091. [DOI] [PubMed] [Google Scholar]
- 35.Sperisen P, Schmid CD, Bucher P, Zilian O. Stealth proteins: in silico identification of a novel protein family rendering bacterial pathogens invisible to host immune defense. PLoS Comput. Biol. 2005;1:0492–0499. doi: 10.1371/journal.pcbi.0010063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Maloney FP, et al. Structure, substrate recognition and initiation of hyaluronan synthase. Nature. 2022;604:195–201. doi: 10.1038/s41586-022-04534-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.May JF, Splain RA, Brotschi C, Kiessling LL. A tethering mechanism for length control in a processive carbohydrate polymerization. Proc. Natl Acad. Sci. USA. 2009;106:11851–11856. doi: 10.1073/pnas.0901407106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Kari J, et al. Sabatier principle for interfacial (heterogeneous) enzyme catalysis. ACS Catal. 2018;8:11966–11972. [Google Scholar]
- 39.Yakovlieva L, Walvoort MTC. Processivity in bacterial glycosyltransferases. ACS Chem. Biol. 2020;15:3–16. doi: 10.1021/acschembio.9b00619. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Nussinov R, Tsai CJ, Ma B. The underappreciated role of allostery in the cellular network. Annu. Rev. Biophys. 2013;42:169–189. doi: 10.1146/annurev-biophys-083012-130257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Chen L, et al. Cryo-electron microscopy structure and transport mechanism of a wall teichoic acid ABC transporter. mBio. 2020;11:e02749-19. doi: 10.1128/mBio.02749-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Bi Y, Mann E, Whitfield C, Zimmer J. Architecture of a channel-forming O-antigen polysaccharide ABC transporter. Nature. 2018;553:361–365. doi: 10.1038/nature25190. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Phanphak S, et al. Super-resolution fluorescence microscopy study of the production of K1 Capsules by Escherichia coli: evidence for the differential distribution of the capsule at the poles and the equator of the cell. Langmuir. 2019;35:5635–5646. doi: 10.1021/acs.langmuir.8b04122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Kelly SD, et al. The biosynthetic origin of ribofuranose in bacterial polysaccharides. Nat Chem. Biol. 2022;18:530–537. doi: 10.1038/s41589-022-01006-6. [DOI] [PubMed] [Google Scholar]
- 45.Guerardel Y, et al. Complete structure of the enterococcal polysaccharide antigen (EPA) of vancomycin-resistant enterococcus faecalis V583 reveals that EPA decorations are teichoic acids covalently linked to a rhamnopolysaccharide backbone. mBio. 2020;11:e00277–20. doi: 10.1128/mBio.00277-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.van Dalen R, et al. Do not discard Staphylococcus aureus WTA as a vaccine antigen. Nature. 2019;572:E1–E2. doi: 10.1038/s41586-019-1416-8. [DOI] [PubMed] [Google Scholar]
- 47.Gerlach D, et al. Methicillin-resistant Staphylococcus aureus alters cell wall glycosylation to evade immunity. Nature. 2018;563:705–709. doi: 10.1038/s41586-018-0730-x. [DOI] [PubMed] [Google Scholar]
- 48.Kuwabara N, et al. Crystal structures of fukutin-related protein (FKRP), a ribitol-phosphate transferase related to muscular dystrophy. Nat. Commun. 2020;11:303. doi: 10.1038/s41467-019-14220-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Oldrini D, et al. Combined chemical synthesis and tailored enzymatic elongation provide fully synthetic and conjugation-ready neisseria meningitidis serogroup X vaccine antigens. ACS Chem. Biol. 2018;13:984–994. doi: 10.1021/acschembio.7b01057. [DOI] [PubMed] [Google Scholar]
- 50.Fiebig T, et al. Molecular cloning and functional characterization of components of the capsule biosynthesis complex of Neisseria meningitidis serogroup A: toward in vitro vaccine production. J. Biol. Chem. 2014;289:19395–19407. doi: 10.1074/jbc.M114.575142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Fiebig T, et al. Functional expression of the capsule polymerase of Neisseria meningitidis serogroup X: a new perspective for vaccine development. Glycobiology. 2014;24:150–158. doi: 10.1093/glycob/cwt102. [DOI] [PubMed] [Google Scholar]
- 52.Bond SR, Naus CC. RF-Cloning.org: an online tool for the design of restriction-free cloning projects. Nucleic Acids Res. 2012;40:W209–W213. doi: 10.1093/nar/gks396. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Liu H, Naismith JH. An efficient one-step site-directed deletion, insertion, single and multiple-site plasmid mutagenesis protocol. BMC Biotechnol. 2008;8:91. doi: 10.1186/1472-6750-8-91. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Markley JL, et al. Recommendations for the presentation of NMR structures of proteins and nucleic acids. IUPAC-IUBMB-IUPAB Inter-Union Task Group on the Standardization of Data Bases of Protein and Nucleic Acid Structures Determined by NMR Spectroscopy. J. Biomol. NMR. 1998;12:1–23. doi: 10.1023/a:1008290618449. [DOI] [PubMed] [Google Scholar]
- 55.McCoy AJ, et al. Phaser crystallographic software. J. Appl. Crystallogr. 2007;40:658–674. doi: 10.1107/S0021889807021206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Adams PD, et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D Biol. Crystallogr. 2010;66:213–221. doi: 10.1107/S0907444909052925. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Cowtan K. The Buccaneer software for automated model building. 1. Tracing protein chains. Acta Crystallogr. D Biol. Crystallogr. 2006;62:1002–1011. doi: 10.1107/S0907444906022116. [DOI] [PubMed] [Google Scholar]
- 58.Winn MD, et al. Overview of the CCP4 suite and current developments. Acta Crystallogr. D Biol. Crystallogr. 2011;67:235–242. doi: 10.1107/S0907444910045749. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Emsley P, Lohkamp B, Scott WG, Cowtan K. Features and development of Coot. Acta Crystallogr D. Biol. Crystallogr. 2010;66:486–501. doi: 10.1107/S0907444910007493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Afonine PV, et al. Towards automated crystallographic structure refinement with phenix.refine. Acta Crystallogr. D Biol. Crystallogr. 2012;68:352–367. doi: 10.1107/S0907444912001308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Murshudov GN, et al. REFMAC5 for the refinement of macromolecular crystal structures. Acta Crystallogr. D Biol. Crystallogr. 2011;67:355–367. doi: 10.1107/S0907444911001314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Chen VB, et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr D. Biol. Crystallogr. 2010;66:12–21. doi: 10.1107/S0907444909042073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Pettersen EF, et al. UCSF Chimera–a visualization system for exploratory research and analysis. J. Comput. Chem. 2004;25:1605–1612. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
- 64.Sievers F, et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 2011;7:539. doi: 10.1038/msb.2011.75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Madeira F, et al. Search and sequence analysis tools services from EMBL-EBI in 2022. Nucleic Acids Res. 2022 doi: 10.1093/nar/gkac240. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Papadopoulos JS, Agarwala R. COBALT: constraint-based alignment tool for multiple protein sequences. Bioinformatics. 2007;23:1073–1079. doi: 10.1093/bioinformatics/btm076. [DOI] [PubMed] [Google Scholar]
- 67.Waterhouse AM, Procter JB, Martin DMA, Clamp M, Barton GJ. Jalview version 2–a multiple sequence alignment editor and analysis workbench. Bioinformatics. 2009;25:1189–1191. doi: 10.1093/bioinformatics/btp033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Ho BK, Gruswitz F. HOLLOW: generating accurate representations of channel and interior surfaces in molecular structures. BMC Struct. Biol. 2008;8:49. doi: 10.1186/1472-6807-8-49. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Holm L, Rosenström P. Dali server: conservation mapping in 3D. Nucleic Acids Res. 2010;38:W545–W549. doi: 10.1093/nar/gkq366. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Krissinel E, Henrick K. Inference of macromolecular assemblies from crystalline state. J. Mol. Biol. 2007;372:774–797. doi: 10.1016/j.jmb.2007.05.022. [DOI] [PubMed] [Google Scholar]
- 71.Willis LM, Whitfield C. KpsC and KpsS are retaining 3-deoxy-D-manno-oct-2-ulosonic acid (Kdo) transferases involved in synthesis of bacterial capsules. Proc. Natl Acad. Sci. USA. 2013;110:20753–20758. doi: 10.1073/pnas.1312637110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.McNulty C, et al. The cell surface expression of group 2 capsular polysaccharides in Escherichia coli: the role of KpsD, RhsA and a multi-protein complex at the pole of the cell. Mol. Microbiol. 2006;59:907–922. doi: 10.1111/j.1365-2958.2005.05010.x. [DOI] [PubMed] [Google Scholar]
- 73.Rigg GP, Barrett B, Roberts I. The localization of KpsC, S and T, and KfiA, C and D proteins involved in the biosynthesis of the Escherichia coli K5 capsular polysaccharide: evidence for a membrane-bound complex. Microbiology. 1998;144:2905–2914. doi: 10.1099/00221287-144-10-2905. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The atomic coordinates and structure factors have been deposited in the Protein Data Bank, accession codes 8A0C for the Bcs3-CMP complex and 8A0M for the Bcs3-DP2 complex. Data collection and refinement statistics are presented in Supplementary Table 3. PDB IDs used in the analysis of this work include 1ZN7, 1LH0, 1FSG, 1JLS, 1L7N, 5AES, 3VAY, 1L7P, 3L7K, 4ZHT, 6JDT, 4X1T, 5LEO, 6RJE, 1R77, 7BNH and 7SHG. Accession codes for sequences used in this study are available in Supplementary Table 4 and Extended Data Fig. 1. NMR chemical shifts are presented in Supplementary Tables 1 and 2. Source data are provided in separate files with this paper.