Abstract
Biofilms are community structures of bacteria enmeshed in a self-produced matrix of exopolysaccharides. The biofilm matrix serves numerous roles, including resilience and persistence, making biofilms a subject of research interest among persistent clinical pathogens of global health importance. Our current understanding of the underlying biochemical pathways responsible for biosynthesis of these exopolysaccharides is largely limited to Gram-negative bacteria. Clostridia are a class of Gram-positive, anaerobic and spore-forming bacteria and include the important human pathogens Clostridium perfringens, Clostridium botulinum and Clostridioides difficile, among numerous others. Several species of Clostridia have been reported to produce a biofilm matrix that contains an acetylated glucan linked to a series of hypothetical genes. Here, we propose a model for the function of these hypothetical genes, which, using homology modelling, we show plausibly encode a synthase complex responsible for polymerization, modification and export of an O-acetylated cellulose exopolysaccharide. Specifically, the cellulose synthase is homologous to that of the known exopolysaccharide synthases in Gram-negative bacteria. The remaining proteins represent a mosaic of evolutionary lineages that differ from the described Gram-negative cellulose exopolysaccharide synthases, but their predicted functions satisfy all criteria required for a functional cellulose synthase operon. Accordingly, we named these hypothetical genes ccsZABHI, for the Clostridial cellulose synthase (Ccs), in keeping with naming conventions for exopolysaccharide synthase subunits and to distinguish it from the Gram-negative Bcs locus with which it shares only a single one-to-one ortholog. To test our model and assess the identity of the exopolysaccharide, we subcloned the putative glycoside hydrolase encoded by ccsZ and solved the X-ray crystal structure of both apo- and product-bound CcsZ, which belongs to glycoside hydrolase family 5 (GH-5). Although not homologous to the Gram-negative cellulose synthase, which instead encodes the structurally distinct BcsZ belonging to GH-8, we show CcsZ displays specificity for cellulosic materials. This specificity of the synthase-associated glycosyl hydrolase validates our proposal that these hypothetical genes are responsible for biosynthesis of a cellulose exopolysaccharide. The data we present here allowed us to propose a model for Clostridial cellulose synthesis and serves as an entry point to an understanding of cellulose biofilm formation among class Clostridia.
Introduction
Biofilms are communities of microorganisms that reside in an extracellular matrix. This extracellular matrix is produced by the community itself and is composed largely of secreted exopolysaccharides. Biofilms are among the most successful and widely distributed forms of life on Earth, enabling bacteria to adhere to, colonize and persist on a wide variety of surfaces or interfaces. The production of the biofilm extracellular matrix represents a significant resource cost to the organisms producing it, rationalized by the many benefits the biofilm matrix confers. For example, the biofilm matrix serves in resource capture [1,2], accelerated cell growth [3] and the tolerance of stressors or disinfectants, such as shearing forces [2,3], desiccation [2,4], antimicrobial compounds [2,5], extreme temperature [2,5] or sanitizing agents [2,5,6]. Production of an extracellular matrix composed of the exopolysaccharide cellulose has been described in many species of clinical or economic importance, including Acetobacter [7], Clostridium (previously Sarcina) [7,8], Rhizobium [7], Agrobacterium [7], Cronobacter [9], Salmonella [10] and Pseudomonas [11], among others. The production of a linear β-(1–4)-glucan polymer by these bacteria is accompanied by the derivatization of the polysaccharide into novel biomaterials suited to the needs of the particular organism producing it and the niche being colonized [12]. These exopolysaccharide modifications enhance bacterial persistence and have been identified as a virulence factor under some circumstances [13].
The biosynthesis and export of exopolysaccharides in bacteria is carried out by synthase-dependent systems, encoded on partially conserved operons [14]. A survey of bacterial genomes encoding putative exopolysaccharide synthase operons demonstrated that the evolution of these operon-encoded processes apparently involves multiple gain, loss, duplication, and horizontal transfer events [15]. As a result, these operons are often highly mosaic and frequently involve the acquisition of their components from other biochemical pathways, making their annotation challenging without detailed structural and functional characterization of each synthase component.
At present, an understanding of the synthase responsible for bacterial cellulose biosynthesis has been largely limited to the study of Gram-negative model organisms. The Gram-negative bacterial cellulose synthesis (Bcs) complex is encoded by the bcsABCZ locus and represents the essential biosynthetic machinery required for production of the β-(1–4)-glucan polymer [16] that is subsequently derivatized by at least three distinct accessory cellulose modification systems described in Gram-negative bacteria [11,17–19]. The proposed functions of the bcsABCZ gene products were largely inferred from structural and functional studies of other Gram-negative exopolysaccharide systems (e.g., the poly-β-(1–6)-N-acetyl-D-gluccoasmine (PNAG) biosynthetic system from Escherichia coli [20] and Staphylococcal species [21–23] or the alginate biosynthesis pathway from Pseudomonas aeruginosa [24]). Subsequently, structural and functional studies of the Bcs gene products have since provided insight into their functional roles. In these systems, biosynthesis of the glucan polymer is carried out by the family 2 glycosyltransferase (GT-2) BcsA [25]. The processive action of BcsA uses UDP-glucose as a substrate to successively add the sugars to the growing chain, while concomitantly transporting the elongated chain through a pore in BcsA to the periplasm, where it interacts with the carbohydrate-binding module (CBM) contained within BcsB [25,26]. Although the mechanism whereby the nascent cellulose chain is exported from the cell has not been directly elucidated, the BcsC protein is proposed to contain both a β-barrel domain and a tetratricopeptide repeat (TPR) domain to facilitate both the assembly of the Bcs complex and the efficient export of the polymer to the extracellular space [14]. This model is in line with the alginate biosynthesis pathway, where AlgE forms a β-barrel necessary for exopolysaccharide export and AlgK is a TPR-containing protein [27,28]. In the alginate system, release of the polymer from the cell is accomplished by the action of an alginate lyase AlgL [29]. In the Bcs system, this role is accomplished by the family 8 glycoside hydrolase (GH-8) BcsZ [30–32].
Evidence for the production of a cellulose biofilm by the Gram-positive and strictly anaerobic bacterium Clostridium ventriculi (formerly Sarcina ventriculi) was first reported by Canale-Parola and coworkers [8]. Cultures of C. ventriculi were observed to form dense microaggregates embedded in an extracellular matrix composed of cellulose and this cellulosic matrix was absent on the surface of planktonic cells not participating in these microaggregates. More recently, a structurally similar biofilm composed of cellular microaggregates was described in C. difficile 630Δerm [33]. Transcriptomic analysis of these microaggregates suggest they are responsive to c-di-GMP because they are associated with the change in expression of a series of diguanylate cyclases and phosphodiesterases, which is a hallmark of the motile-to-sessile transition culminating in biofilm formation. Further, the biofilm-associated transcriptomic changes observed could be partially abrogated by a polar deletion affecting two putative transcriptional regulators. These regulators in C. difficile share homology with the Bacillus subtilis SinR/SinI pair, a known repressor/anti-repressor pair controlling the transcriptional landscape underlying biofilm formation. To date, numerous other Clostridia have been reported to form similar biofilms, including the important human pathogens Clostridium perfringens and Clostridium botulinum, a phenomenon that has been reviewed by others [34]. Only recently was the Pel polysaccharide biosynthetic gene cluster identified in Gram-positive bacteria, including Clostridia [35]. However, the operon encoding the cellulose exopolysaccharide synthase components has not been identified in any of these species and the specific requirements for biofilm formation in Clostridia remain largely unexplored.
The class Clostridia contains numerous pathogens of global health importance. C. difficile is one of many such pathogens that is a global threat to public health. C. difficile infection poses significant morbidity and mortality to all populations worldwide and among individuals beyond the groups traditionally recognized as at-risk (e.g. the elderly, those under hospital care or those undergoing antimicrobial therapy) [36–38]. The burden of C. difficile diarrheal disease is also not limited to developing nations; in the United States, C. difficile caused an estimated 453,000 infections resulting in 29,300 deaths in 2012 alone [37]. Although healthcare-associated costs are challenging to estimate, this burden of C. difficile infection results in an estimated cost of approximately $6 billion US dollars a year in the United States alone. This data is notwithstanding the other important human pathogens belonging to class Clostridia.
To identify the Clostridial cellulose synthase operon, we surveyed available sequence databases for predicted orthologues of the Gram-negative cellulose synthase components. We identified five hypothetical genes sharing an immediate genomic context that satisfy all known criteria for a cellulose exopolysaccharide synthase, where the cellulose polymerase shares direct homology with the Gram-negative operon, while the remaining subunits represent a mosaic of evolutionary lineages. Furthermore, the composition of this hypothetical gene cluster is conserved within some members of Clostridia. We designate this gene cluster herein as the Clostridial cellulose synthase (ccsZABHI) to distinguish it from the Gram-negative Bcs locus. Based on sequence annotation and homology modelling, the locus appears to encode a glycosyltransferase (CcsA), but demonstrates a divergent glycoside hydrolase (CcsZ) and a carbohydrate-binding domain that may aid in polymer translocation (CcsB), with the notable absence of an equivalent to the Gram-negative outer membrane/TPR component (BcsC). Additionally, sequence homology and modelling predicted that the adjacent ccsH and ccsI may encode a complex for O-acetylation of the cellulose exopolysaccharide, with a distinct evolutionary origin from the otherwise functionally equivalent proteins in Gram-negative bacteria. To investigate if ccsZABHI is responsible for cellulose synthesis, we selected the putative glycoside hydrolase (CcsZ) for cloning and expression, as the specificity of this enzyme would provide information about the polymer produced by the operon and for which a structural and functional characterization would be straightforward. The model of the Clostridial cellulose synthesis system we propose here, which was directly corroborated by our studies on CcsZ, represents an entry point to an understanding of cellulose biofilm formation by Gram-positive bacteria and enables future structural and functional studies of other Clostridial cellulose synthesis subunits.
Materials and methods
Identification of ccsZABHI
Initial identification of the Clostridial cellulose synthase was carried out by searching the NCBI database (https://www.ncbi.nlm.nih.gov) for sequences annotated as BcsA orthologues or by BlastP search (https://blast.ncbi.nlm.nih.gov/Blast.cgi) using the E. coli K12 BcsA sequence as a query and filtering results for sequences from Clostridia only. We then identified other putative ccs ORFs based upon their genomic proximity to BcsA orthologues and through functional annotation, BlastP searches and homology modelling on the PHYRE2 (http://www.sbg.bio.ic.ac.uk/phyre2/) or SWISS-MODEL (https://swissmodel.expasy.org/docs/references) servers [39,40] to ascertain their likely biochemical roles. Sequence alignments and identities were calculated using the BlastP tool. The topology of CcsB was predicted using the TMHHM server, v.2.0 (http://www.cbs.dtu.dk/services/TMHMM/) [41].
Cloning
The full-length sequence of the hypothetical exopolysaccharide synthesis-associated glycoside hydrolase from C. difficile (GenPept accession WP_077724661.1), which we designated ccsZ, was predicted to have an N-terminal signal sequence according to the PSort server [42] that remains as a single transmembrane anchor in the mature protein. A ccsZ plasmid construct lacking the transmembrane region (amino acids 1–29) was synthesized by BioBasic, Inc. (Markham, ON, Canada) in a pET21 vector background so that the expressed soluble portion of the protein (amino acids 30–340) would contain a C-terminal hexahistidine tag (pCcsZ-His6).
CcsZ expression and purification
The expression host, E. coli BL21-CodonPlus, was transformed with pCcsZ-His6 and cells were grown in rich media (32 g tryptone, 20 g yeast extract and 10 g NaCl) containing ampicillin (100 μg mL-1) at 37°C with shaking. Expression was induced with IPTG (1 mM) after cultures reached an OD600 of at least 0.6 and growth was allowed to continue 16–20 h at 23°C. The cells were collected by centrifugation at 4300 x g for 15 min and 4°C. The pellet was resuspended in buffer A (50 mM Tris-HCl pH 7.5, 300 mM NaCl) and cells were lysed with a cell disruptor (Constant Systems) operating at 17 kpsi. Cell lysate was separated by centrifugation at 28000 x g for 45 min at 4°C and the supernatant was loaded onto a column containing 2 mL of Ni-NTA agarose (Qiagen) equilibrated with buffer A for 1 h for purification by nickel affinity chromatography. The Ni-NTA column was washed in 50 mL each of buffer A and buffer A with the addition of 20 mM imidazole-HCl, followed by a wash in 50 mL of buffer A with the addition of 50 mM imidazole-HCl. The protein was eluted from nickel affinity chromatography columns using 25 mL of buffer A with the addition of 500 mM imidazole-HCl. Secondary purification was performed by anion exchange chromatography using a HiTrap Q FF 5 mL prepacked column (GE Healthcare) installed on an Akta Start instrument (GE Healthcare). Protein eluted from Ni-NTA resin was dialyzed against anion buffer A (50 mM sodium phosphate pH 7.5) for 12–16 h and passed over the column three times to promote binding. CcsZ-His6 was eluted from the column using a gradient of 0–100% anion buffer B (50 mM sodium phosphate pH 7.5, 1 M NaCl). The purity of protein obtained from both nickel affinity chromatography and anion exchange chromatography was routinely assessed using SDS-PAGE. Where needed, purified protein was concentrated in a centrifugal filter unit (Pall Corporation) with a nominal MWCO of 10,000 Da.
Crystallization and structure determination
Crystallization of CcsZ was carried out at 20°C using the hanging drop vapour diffusion method. Apo-CcsZ crystals were obtained from drops containing 100 mM Tris-HCl pH 8.5, 200 mM calcium acetate and 25% (v/v) poly-ethylene glycol 4000 in a 1:1 ratio with CcsZ protein concentrated to 12.5 mg/mL. Product-bound crystals were obtained from drops containing 100 mM MES pH 6.0, 200 mM calcium acetate and 20% (v/v) poly-ethylene glycol 8000 (PEG 8000) in a 1:1 ratio with CcsZ concentrated to 12.5 mg/mL. Cellotriose was introduced to the drop at a concentration of 5 mM following crystal growth and to cryoprotectants (crystallization buffer supplemented to 30% (v/v) with PEG 8000) just prior to harvesting and vitrification in liquid nitrogen. Data were collected on beamline 08-ID1 at the Canadian Light Source synchrotron. A total of 1250 images of 0.2° Δφ oscillations were collected using incident radiation with a wavelength of 1.0 Å for the apo-CcsZ crystal and a total of 1800 images of 0.2° Δφ oscillations were collected using incident radiation with a wavelength of 1.0 Å for the product-bound crystal. Collected data was processed with XDS [43]. A P 21 space group was determined with a single copy of CcsZ in the asymmetric unit using POINTLESS, then scaled using SCALA and data reduction performed using CTRUNCATE [44]. The processed data was solved using the molecular replacement technique with the Phaser tool in PHENIX [45] using TmCel5A (PDB ID 3AMD) as a search model. Structural refinement was performed using iterative rounds of automated refinement using the Refine tool in PHENIX, followed by manual refinement and checking of waters using Coot [46]. Refinement progress was monitored by the reduction and convergence of Rwork and Rfree. Ligand fitting was performed by LigandFit in PHENIX, followed by calculation of an omit map by Polder in PHENIX. Ligand real-space refinement was performed in Coot using the Polder map and validation was performed using MolProbity in PHENIX. Structure interface analysis was performed using the PDBePISA server (https://www.ebi.ac.uk/pdbe/pisa/pistart.html) [47]. Figures were prepared with PyMOL.
Glycoside hydrolase activity assays
Activity was initially assessed qualitatively by a plate-based zymogram analysis. Zymogram matrices in agar plates were prepared by heating a solution of 1.5% (w/v) agar with individual carbohydrates (0.8% (w/v) for CMC, xylan and hydroxyethylcellulose; 0.4% for curdlan) to 100°C for 10 min. Spotted on the agar plates were 0.5 mg each of purified CcsZ, cellulase cocktail from Trichoderma reesei (Sigma; C8546) and bovine serum albumin (BSA; BioShop). BSA served as the negative control and the T. reesei cellulase cocktail served as the positive control. The zymograms were incubated either 1 h or 24 h, as specified, at 37°C. Following incubation, plates were stained in 5 mL of a 0.1% (w/v) solution of Congo Red for 1 h and destained in three 5 mL volumes of a 1 M solution of NaCl for 1 h total.
The activity of CcsZ was quantitatively monitored using the dinitrosalicylic acid (DNS) method. Substrates tested included CMC, xylan, arabinoxylan, β-glucan, lichenin, xylan and curdlan. All carbohydrates not purchased from Sigma as noted above were supplied by Megazyme. All substrates were tested at 8.3 g/L, with the exception of curdlan that was tested at 0.75 g/L due to solubility constraints. CcsZ (20 μM) was mixed with substrate in assay buffer (50 mM sodium acetate pH 4.5) for 30 min at 37°C. Reactions were stopped with the addition of 56.3 μL of DNS reagent (1% w/v dinitrosalicylic acid, 4 mM sodium sulfite, 250 mM NaOH) and were heated at 90°C for 15 min. A volume of 18.75 μL of (40% w/v) sodium potassium tartrate was added to stabilize the DNS reaction. The resulting solutions were measured for absorbance at 575 nm in a Cytation5 imaging microplate reader (BioTek). Absorbance values were related to millimolar reducing sugar equivalents using a standard curve of D-glucose (BioShop) measured using the same DNS method. Buffers used for the pH assay included sodium acetate (pH 4.0–5.5), MES (pH 5.5–7), sodium phosphate (pH 7.0–8.0), Tris-HCl (pH 8.0–9.0) and CHES (pH 9.0–10) in increments of 0.5 pH units with each buffer at 50 mM final concentration.
The endo- vs exo- preference of CcsZ was evaluated using the Megazyme Cellulase Assay Kit (K-CELLG5-4V). Thermostable exo-β-glucosidase (provided by the manufacturer) was added to a solution containing 4,6-(3-ketobutylidene)-4-nitrophenyl-β-D-cellopentaoside in 10% (v/v) dimethylsulfoxide and 0.02% (w/v) sodium azide. A 100 μL amount of this solution was pre-incubated at 40°C then inoculated with 100 μL of CcsZ solution (prewarmed to 40°C and containing 15 μM enzyme) in acetate buffer (100 mM, pH 4.5) containing 1 mg/mL bovine serum albumin. This process was performed in parallel with the cellulase cocktail from Trichoderma reesei as a positive control and CcsZ without exo-β-glucosidase as a negative control. All reactions were incubated at 40°C for 10 min and then 3 mL of stopping reagent (2% (w/v) Tris buffer, pH 10) was added. A volume of 100 μL of the solution was then transferred to a 96-well plate where the absorbance was recorded at 400 nm using a Citation 5 Imaging Reader (BioTek). All model fitting and statistical analyses were performed using GraphPad Prism v8.4.3.
LC-MS
Liquid chromatography–mass spectrometry analyses were performed at the Mass Spectrometry Facility of the Advanced Analysis Centre, University of Guelph on an Agilent 1200 HPLC liquid chromatograph interfaced with an Agilent UHD 6530 Q-TOF mass spectrometer. A C18 column (Agilent Extend-C18 50 mm x 2.1 mm 1.8 μm) was used for chromatographic separation. The following solvents were used for separation: water with 0.1% (v/v) formic acid (A) and acetonitrile with 0.1% (v/v) formic acid (B). The initial mobile phase conditions were 10% B, hold for 1 min and then increase to 100% B in 29 min. These steps were followed by a column wash at 100% B for 5 min and a 20 min re-equilibration. The mass spectrometer electrospray capillary voltage was maintained at 4.0 kV and the drying gas temperature at 250°C with a flow rate of 8 L/min. Nebulizer pressure was 30 psi and the fragmentor was set to 160. Nozzle, skimmer and octapole RF voltages were set at 1000 V, 65 V and 750 V, respectively. Nitrogen (> 99%) was used as the nebulizing, drying and collision gas. The mass-to-charge ratio was scanned across the m/z range of 50–3000 m/z in 4 GHz (extended dynamic range) positive and negative ion modes. Data were collected by data independent MS/MS acquisition with an MS and MS/MS scan rate of 1.41 spectra/sec. The acquisition rate was set at 2 spectra/s. The mass axis was calibrated using the Agilent tuning mix HP0321 (Agilent Technologies) prepared in acetonitrile. Mass spectrometer control, data acquisition and data analysis were performed with MassHunter® Workstation software (B.04.00).
Results and discussion
Identification of the Clostridial cellulose synthase
We searched the NCBI Protein database for hypothetical proteins that were functionally annotated as cellulose synthases, or which showed homology to known Gram-negative BcsA proteins among the class Clostridia. We identified sequences functionally annotated as putative cellulose synthase catalytic subunits in C. difficile (GenBank CDIF630_RS14050), Clostridium vincentii (PRR81681.1), Clostridium chromireducens (OPJ65957.1) and Clostridium oryzae (OPJ56113.1), among others (Fig 1; orange). The functional annotations are based on sequence similarity to the GT-2 domain of E. coli BcsA (UniProt P37653) and the presence of a predicted PilZ domain for binding c-di-GMP, which is a hallmark of synthase-dependent exopolysaccharide catalytic subunits. These features were a promising indication that these proteins were orthologues of BcsA. Homology modelling of these hypothetical proteins also suggested they belong to GT-2, with the structure of BcsA from Rhodobacter sphaeroides (PDB ID 4HG6) selected as the top scoring homologue and the template for homology modelling in all cases. To distinguish this locus from the Gram-negative cellulose synthase, we named it the Clostridial cellulose synthase subunit A, or ccsA. However, homologs of the remaining Bcs proteins from Gram-negative bacteria (BcsB, BcsC, and BcsZ) were notably absent. Instead, we assessed the genomic context of ccsA to identify hypothetical genes that may be functionally equivalent to these proteins, but could have arisen from divergent evolution of the Gram-positive cellulose synthase operon. Exopolysaccharide synthase operon evolution has been shown to involve multiple loss, duplication, rearrangement, and horizontal transfer events, resulting in highly mosaic operon classes distributed among bacterial phyla. Accordingly, we did not find it surprising that the Clostridial cellulose synthase shared no apparent evolutionary relationship with the known Gram-negative operons given the significant phylogenetic distance between the species in question [15].
A survey of the genomic context of ccsA sequences demonstrated a series of four other hypothetical genes that always occurred with ccsA (Fig 1). These genes included those that are annotated as a glycosyl hydrolase (endocellulase) (Fig 1; red), a gene of unknown function (Fig 1; yellow), a membrane-bound O-acyltransferase family protein (Fig 1; green), and a DHHW family protein (Fig 1; blue). Each of these discrete genes we identified in the conserved ccs locus represented pairwise reciprocal best-hits of each other in the other respective genomes we identified them within, suggesting co-evolution of these genes as functionally co-operative operon subunits among the Clostridia.
Homology modelling of the predicted glycosyl hydrolases suggests they belong to the CAZy family glycosyl hydrolase family 5 (GH5) with a variety of enzymes from this family occurring as the top scoring alignments and modelling templates. Glycosyl hydrolases are highly conserved components of exopolysaccharide synthase operons and are thought to be required for release of the polysaccharides into the extracellular space, as well as possibly playing a role in release or dispersion from the biofilm matrix [14,31,48]. We named this hypothetical protein CcsZ, to distinguish it from its functional equivalent in the Gram-negative cellulose synthase operon, BcsZ, which is itself a member of glycosyl hydrolase family 8 [30].
BLAST searches and homology modelling of the protein of unknown function encoded adjacent to ccsA returned no known homologs outside of Clostridia. Homology modelling of this sequence suggested the N-terminal region aligns to structurally resolved family 11 carbohydrate binding module (CBM11) domains and probably adopts a similar β-jelly roll fold. These CBM11 proteins are found exclusively among Gram-positive bacteria and are known to bind mixed linkage β-glucans with high specificity [49]. We named this gene ccsB because it is not homologous to the Gram-negative BcsB, but its predicted three-dimensional fold and putative function align with that of BcsB in aiding the translocation of the nascent exopolysaccharide across the cytoplasmic membrane [25,50]. Our predictive model would then imply that CcsB (Fig 2; yellow) would be localized to the cytoplasmic membrane and would form a stable complex with CcsA that is both necessary and sufficient for cellulose synthesis (as observed for BcsA/BcsB). To predict the topology of CcsB, we analyzed its sequence with the TMHMM bioinformatics tool, and found CcsB contains two predicted transmembrane helices of 19 and 22 residues in length. Each of these predicted transmembrane helices reside near the termini of the protein; thereby bracketing the region that aligned to known CBM11 domains. In addition, the region between these helices with the putative CBM domain is predicted to be extracellular, which further positions CcsB as functionally and topologically analogous to BcsB.
We also identified two additional genes that were always encoded in the immediate genomic context of ccsA, ccsB and ccsZ. BLAST and homology modelling analysis of these two genes provided strong evidence that the hypothetical proteins encoded by these genes belong to the membrane-bound O-acyltransferase (MBOAT) family and the DHHW family (Figs 1 and 2; green and blue, respectively). The former is a family of enzymes found in all domains of life, which are responsible for cell surface modification in bacteria [51] and in some vertebrates are responsible for lipid biosynthesis and/or remodelling [52,53]. The latter DHHW family are found exclusively in bacteria and have demonstrated roles in the O-acetylation of cell surface polysaccharides [54]. Taken together, these findings suggest a role for these proteins in O-acetylation of the cellulosic material produced by the adjacent hypothetical operon subunits. The enzymatic modification of many exopolysaccharides have been described and have been reviewed expertly by others [12]. Among those operons that produce modified exopolysaccharides are at least three Gram-negative cellulose operons, which produce crystalline cellulose [18,55], phosphoethanolamine cellulose [17,19,56] and O-acetylated cellulose [11,57]. In Gram-negative bacteria, the operon subunits of Pseudomonas fluorescens responsible for O-acetylation of cellulose exopolysaccharides (Fig 1) are homologous to those involved in O-acetylation of alginate [11], a distinct exopolysaccharide found in Pseudomonas aeruginosa [58,59]. Bundalovic-Torma and colleagues have proposed that the acetylated cellulose operon class emerged because of the genomic co-occurrence of alginate operons among Pseudomonads, thereby resulting in subsequent duplication and operonic acquisition of the subunits required for O-acetylation [57]. Since no alginate operons have been identified in Clostridia from which these operon subunits could be acquired, this evolution by duplication and operonic acquisition of alginate machinery would account for the lack of homology between Gram-negative acetylcellulose operons and the MBOAT/DHHW pair we identified as part of the ccs locus. Instead, we propose that these putative O-acetyltransferase proteins evolved by duplication and operonic acquisition of secondary cell wall modifying complexes that are known to involve the functional contribution of MBOAT/DHHW family proteins in tandem [54]. In keeping with the naming of operon subunits based on their functional equivalence, rather than their evolutionary relationships to the Gram-negative cellulose operon, we named the MBOAT/DHHW pair CcsH and CcsI, respectively, as our model predicts them to be functionally equivalent to WssH and WssI (Figs 1 and 2) [11].
Surprisingly, the systematic pipeline for classifying bacterial operons reported by Bundalovic-Torma and colleagues did not identify the ccs operon we describe here. This was a direct result of considering only those loci that contained both a polysaccharide synthase subunit and one additional locus involved in exopolysaccharide modification or transport as defined previously [15]. As our findings above suggest, the additional ccs operon components are not homologous to their Gram-negative functional equivalents and no Gram-positive operon subunits were used in the search because none had been identified a priori to that study. Instead, Bundalovic-Torma and coworkers identified a unique putative cellulose “operon” in C. difficile 630 composed of only homologs of BcsA and BcsB, which shared genomic context. To investigate the possible existence of multiple cellulose operons in C. difficile 630, we assessed the genomic context of this BcsA/B pair proposed by Bundalovic-Toma and coworkers using our multiple-homology approach. We identified that this putative cellulose synthase BcsA from C. difficile 630 in fact aligns most closely with a related family 2 mannosyltransferase, although it does appear to share more distant homology with BcsA as well. Further, in the genomic context of this putative BcsA/B pair, we identified a putative glycosyl hydrolase belonging to family GH113, a WecB UDP-GlcNAc 2-epimerase homolog and a predicted polysaccharide deacetylase with distant homology to the carbohydrate esterase family 4. Given that i) family GH113 is exclusively composed of endo-mannanases, ii) the polysaccharide polymerase is most closely related to a mannosyltransferase and iii) that a UDP-GlcNAc 2-epimerase is required to produce UDP-ManNAc as an enzymatic precursor to polymer synthesis, these findings suggest that this predicted operon in fact encodes a complex for the polymerization, export, and modification of a partially deacetylated ManNAc-rich exopolysaccharide. This result underpins the challenge of identifying new exopolysaccharide-associated operons and operon classes, owing to their highly mosaic nature and the multiple evolutionary origins of each functionally equivalent subunit found within them, coupled with the lack of a robust technique to accurately identify and functionally annotate them in a high-throughput manner.
Dannheim and colleages have anecdotally proposed that the hypothetical genes we named ccsZABHI may be responsible for the secretion of an acetylated glucan by C. difficile [60]. This work cited the observation of a similar acetylated glucan in Clostridium acetobutyliticum, which was noted to possess these same putative genes [60,61]. However, the polymer was proposed to be involved in either exopolysaccharide or capsular biosynthesis. The biosynthesis of capsular polysaccharides in bacteria is carried out by structurally and functionally distinct systems from exopolysaccharide synthases that have been expertly reviewed by others [62]. Here, the structural and functional model we have proposed for ccsZABHI (Fig 2) does not support their role in capsule biosynthesis, as none of the predicted structures, functions, or homology of these proteins are in line with the requirements for the described capsular biosynthesis systems. Thus, our analysis suggests that the ccsZABHI locus independently identified by Dannheim and colleagues encodes a series of proteins functionally similar to known cellulose exopolysaccharide synthases in Gram-negative bacteria. Furthermore, we have expanded these findings to show that these genes are widely distributed among other pathogenic and non-pathogenic species of class Clostridia.
The overall structure of CcsZ
We engineered and synthesized the soluble domain of ccsZ from C. difficile, corresponding to residues 30–340 of the GenPept entry WP_077724661.1, into the pET21 expression vector to generate a recombinant His-tagged expression construct of CcsZ. Protein from this construct expressed well, yielding up to 75 mg/L of culture and purified to near homogeneity following nickel affinity chromatography and anion exchange chromatography, making it an excellent candidate for structure determination. Our CcsZ crystals grew in space group P 21 and diffracted X-rays well to 1.75 Å resolution with a single copy of CcsZ in the asymmetric unit. We solved the crystal structure of CcsZ using the molecular replacement technique based upon the model of apo-TmCel5A (PDB ID 3AMD) as a search template. Model building and subsequent refinement resulted in an improved Rwork and Rfree of 0.227 and 0.260, respectively. The model covers almost the entirety of the CcsZ construct, from residues 38 to 337 in the sequence, although residue Ser91 could not be reliably built into the model due to a lack of electron density. The complete data collection and refinement statistics are available in Table 1. The completed structure of CcsZ was deposited to the Protein Data Bank (PDB) under accession code 6UJE.
Table 1. Diffraction data processing, refinement statistics and model validation.
CcsZ apo | CcsZ-BGC | |
---|---|---|
Data collection | ||
Wavelength (Å) | 0.9795 | 0.9795 |
Space group | P 1 21 1 | P 1 21 1 |
Unit cell parameters; Å,° | a = 56.95 b = 45.63 c = 59.94 α = 90, β = 94.99, γ = 90 | a = 57.02 b = 46.53 c = 61.09 α = 90, β = 94.50, γ = 90 |
Resolution, Å | 43.04–1.75 (1.813–1.75) | 31.69–2.00 (2.071–2.00) |
Total no. reflections | 60071 (5845) | 43557 (4313) |
No. of unique reflections | 30543 (2983) | 25294 (2157) |
Redundancy | 2.0 (2.0) | 2.0 (2.0) |
Completeness, % | 96.85 (87.20) | 99.50 (99.58) |
Average I/σ(I) | 9.0 (0.99) | 17.97 (2.4) |
Rmerge, * % | 0.08 (3.62) | 0.02 (0.28) |
CC1/2 | 0.998 (0.229) | 1 (0.903) |
CC* | 1 (0.611) | 1 (0.974) |
Refinement | ||
Rwork/Rfree† | 0.227 / 0.260 | 0.238 / 0.287 |
No. of non-H atoms | ||
Total | 2676 | 2650 |
Protein | 2556 | 2556 |
Ligands | 1 | 12 |
Water | 120 | 82 |
Average B-factors, ‡ Å2 | ||
Protein | 43.83 | 44.22 |
Ligands | 30.0 | 30.0 |
Water | 45.89 | 42.17 |
rmsd | ||
Bond lengths, Å | 0.05 | 0.009 |
Bond angles,° | 1.16 | 1.15 |
Ramachandran plot‡ | ||
Total favoured, % | 94.95 | 93.94 |
Total allowed, % | 4.71 | 5.72 |
PDB accession | 6UJE | 6UJF |
Values in parentheses correspond to the highest resolution shell. I/σ(I): intensity of a group of reflections divided by the standard deviation of those reflections.
*Rmerge = ∑∑ |I(k) − <I>|/∑ I(k), where I(k) and <I> represent the diffraction intensity values of the individual measurements and the corresponding mean values. The summation is over all unique measurements.
†Rwork = ∑ ||Fobs|− k|Fcalc||/|Fobs|, where Fobs and Fcalc are the observed and calculated structure factors, respectively. Rfree is the sum extended over a subset of reflections excluded from all stages of the refinement.
‡As calculated using MolProbity [63].
In agreement with typical GH-5 enzymes, CcsZ folds into an overall structure adopting a distorted TIM barrel fold, in which an (α/β)8 barrel is formed at the core of the fold by eight parallel β-strands (Fig 3A and 3B). This β-barrel motif is flanked by a series of eight partially distorted α-helices packed against the core β-strands, which are connected by extended loops along the C-terminal face of the β-barrel motif. The extended C-terminal loops shape a deep cleft that is typical of GH-5, where both the active site and the cleft for substrate accommodation are located. Interestingly, we observed this groove to be strongly negatively charged in CcsZ, which is not a typical feature of GH-5 (Fig 3C). The GH-5 consensus catalytic acid/base and nucleophile residues are present in CcsZ as Glu-164 and Glu-281, respectively (Fig 3A). Our structure of apo-CcsZ superimposed excellently on apo-TmCel5A, with a root mean square deviation (r.m.s.d.) of 0.677 Å across 252 equivalent Cα atoms, as expected given the reasonably high sequence identity (43%) between these proteins and the highly conserved GH-5 fold.
Comparison to known GH-5 structures
Carbohydrate-Active Enzyme family 5 (CAZy GH-5) is an exceptionally large family containing >15,000 known and predicted enzymes. In keeping with the size of GH-5, the substrate specificity of this family is widely variable and encompasses numerous carbohydrate structures, with some members demonstrating strict specificity for single glycan structures, while other members have demonstrated promiscuity with activity on many distinct glycans [64]. The diversity of substrate preference in the GH-5 family is also matched by the wide variety of chemical properties among GH-5 enzymes, such as temperature or pH optima [64–68]. In efforts to delineate the diversity of this large family of glycosyl hydrolases, phylogenetic analyses were performed and numerous sequence-based phylogenetic subfamilies with characteristic substrate preferences were created [64]. Under this classification, Aspeborg and colleagues placed CcsZ into GH-5 subfamily 25 (GH-5_25), a small subfamily of enzymes belonging primarily to thermophilic bacteria, although two other GH-5_25 enzymes from C. difficile were also classified in GH-5_25 [64]. It was not surprising that, based upon the molecular phylogenetics of GH-5_25, the TmCel5A enzyme we used as the search model for molecular replacement of CcsZ is also among the most closely related to CcsZ of all GH-5 sequences [64].
GH-5_25 is reported as a polyspecific subfamily of GH-5 that possesses multiple activities [64]. Notably, TmCel5A was reported to exhibit activity on both mannan and glucan polymers with β-(1,4) linkage, including linear and branched polysaccharides [69]. Supporting this finding, the crystal structure of apo-TmCel5A was also reported alongside structures of TmCel5A ligand complexes, including TmCel5A bound to cellotetraose, mannotriose, glucose and cellobiose. We compared the structure of TmCel5A in complex with cellotetraose (PDB ID 3AZT) to CcsZ in an effort to understand how their substrate accommodation might differ. The TmCel5A-cellotetraose complex also superimposed well over CcsZ, with an r.m.s.d. of 0.620 Å over 242 equivalent Cα atoms. A comparison of the substrate-binding cleft, informed by the positioning of the cellotetraose ligand of TmCel5A, demonstrated a clear difference between the subsite architectures of the two enzymes (Fig 4). The residues that contribute to the -1 and -2 subsites of TmCel5A appear to be partially conserved in CcsZ, present as H123, H124, N163, Y225 and N48. However, in TmCel5A, W210 shapes the face of the -2 subsite and stacks with glucopyranose rings, but in CcsZ, the equivalent Y237 occupies a position that would serve as the -3 subsite in TmCel5A; thereby resulting in a substrate binding cleft in CcsZ that terminates at the equivalent -2 subsite.
TmCel5A was also reported to be highly thermostable [68], a feature rationalized by a larger fraction of buried atoms, a smaller accessible surface area and the presence of shorter unstructured loops as compared to the structure of a mesophilic GH-5 cellulase from Clostridium cellulolyticum (CcCel5A; PDB id 1EDG) [65]. However, for the purpose of comparison, it is worth noting that CcCel5A does not belong to GH-5_25 but instead to GH-5_4, which includes endo-β-(1,4) glucanases specific for xyloglucan, as well as licheninases and xylanses. To assess if CcsZ possessed similar features attributed to the thermal stability of TmCel5A, we uploaded our structure to the PDBePISA server. This analysis revealed both a solvent-accessible surface area for CcsZ of 13497 Å2 and a proportion of buried atoms of 0.50 that was most similar to TmCel5A (13720 Å2 and 0.51) rather than CcCel5A (15,410 Å2 and 0.46). Furthermore, CcsZ possessed unstructured loops more similar to TmCel5A (r.m.s.d. 0.677 Å across 252 Cα atoms) than CcCel5A (r.m.s.d. 1.053 Å across 172 Cα atoms) based on superimposition of the structures.
After comparison of our structure of CcsZ to other related GH-5 structures, we found the relatively high sequence identity and structural similarity between CcsZ and TmCel5A, rather than CcCel5A, (i.e., 41% vs 18% identity) surprising, based on our expectation that CcsZ would be highly specific for an O-acetylated cellulose exopolysaccharide, in contrast to the broad specificity of TmCel5A for varied mannan and glucan structures. We also did not anticipate CcsZ to possess features associated with extended thermal stability given that C. difficile is a mesophilic bacterium and that the biofilm phenotype has been reported at temperatures of 25–37°C [8,34]. Accordingly, we set out to biochemically characterize CcsZ to further explore these properties.
The product-bound structure of CcsZ
Next, in order to experimentally resolve the subsite architecture and mechanism of substrate accommodation by CcsZ, we attempted to solve the structure of CcsZ in complex with cello-oligosaccharides. We were able to grow crystals in a distinct but chemically similar condition to our apo-CcsZ crystals and successfully introduced cellotriose following crystal growth but prior to crystal harvesting. These crystals diffracted X-rays to 1.65 Å resolution and also grew in the space group P 21 containing a single polypeptide in the asymmetric unit (Table 1). We solved the structure of our ligand complex with molecular replacement using the structure of the apo-form as the search model. The ligand-bound structure was in complete agreement with the apo-form and although the experimental resolution was higher for the ligand complex crystals than the apo-form, low completeness was observed for the high-resolution data and so the ligand complex structure was refined only against data with a maximum resolution of 2.0 Å. Refinement of the ligand complex structure resulted in an Rwork and Rfree of 0.238 and 0.287, respectively (Table 1). The final model of the CcsZ-ligand complex also covered almost the entirety of the protein, from residues 39–340 and included the missing Ser91 from the apo-form.
Following refinement of the complex structure, we observed a Fourier electron density peak, clearly interpretable in both the Fourier 2 Fo—Fc electron density map at 1.0 σ contour level and the Fo—Fc map at 3.0 σ contour level. This density corresponded well to an apparent glucopyranose ring near the active site. Subsequent ligand fitting placed glucose into this density with a promising real-space correlation coefficient (RSCC) of 0.714 in an unambiguous orientation, with placement aided by the strong electron density observed for the oxygen atom of the C6 hydroxyl. We then calculated a Polder omit map for the glucose ligand, resulting in an improved electron density that allowed refinement of the glucose ligand to a final RSCC of 0.861 (Fig 5C). The bound glucose makes polar contacts with the side-chain N atom of N48 via the C2 hydroxyl, the N atom of the H123 side chain via the C6 hydoxyl and the N atom of the W314 side chain via the C6 hydroxyl, with approximate distances of 3 Å in all cases (Fig 5B). The final refined structure of the CcsZ-glucose complex was deposited to the PDB under accession code 6UJF.
The observation of a single glucose residue bound to CcsZ when a trisaccharide was soaked into the crystals suggested to us that the bound ligand represented an enzymatic product rather than a bound substrate or glucosyl intermediate, which was not surprising given the observed activity of our CcsZ construct specifically on glucan substrates (noted below). Accordingly, this glucose molecule was bound at the equivalent of the -2 subsite in TmCel5A, but appears spatially confined by the side chain of Y237 and the loop presenting W314, both of which shifted only trivially between our apo- and glucose-bound structures with an all-atom r.m.s.d. of 0.206 Å across 2232 equivalent atoms (Fig 5A). Comparatively, the Gram-negative cellulose hydrolase BcsZ (a GH-8 enzyme), which was resolved both in apo-form (3QXF) and as a complex with cellopentaose (3QXQ), displays a substrate-binding architecture whereby the +1 and +2 subsites are angled at approximately 60° from the -1 to -4 subsites at the nonreducing side of the catalytic center [30]. Thus, structural plasticity of the substrate-binding groove and induced-fit distortion of longer saccharide substrates is certainly possible in these enzymes and is also conceivable in CcsZ, given the structure we observed.
CcsZ is an endo-β-(1,4)-glucanase
To test if CcsZ was in fact capable of hydrolytic activity on cellulose, we assessed its activity on the soluble substrate analogue carboyxmethylcellulose (CMC) first using the plate-based zymogram method reported by Mazur and Zimmer for BcsZ [30]. Following staining with Congo Red, a zone of clearing was observed suggesting CcsZ was capable of cleaving the glucosidic bonds of CMC after 1 h incubation (Fig 6A). We also performed this same plate-based zymogram experiment using 0.8% (w/v) concentrations of either CMC, hydroxyethyl cellulose (HEC) and beechwood xylan, or 0.4% (w/v) curdlan over a 24 h incubation. As expected, we observed that CcsZ was active on CMC and HEC but had only limited cleavage of xylan or curdlan (S1 Fig).
Subsequently, we sought to measure CcsZ activity on CMC quantitatively using the dinitrosalicylic acid (DNS) reducing sugar assay, where enzyme activity was calculated from our raw data using a standard curve of glucose prepared using the DNS method. We observed an increase in reducing sugar concentration following incubation with CcsZ in a concentration-dependent manner, further indicating CcsZ is capable of using CMC as a substrate (Fig 6B). We also tested CcsZ with CMC under a range of pH buffer conditions to measure pH stability. We found CcsZ to have a pH optimum of approximately 4.5, which is typical of other characterized GH-5 enzymes (Fig 7A) [65–67]. Although CcsZ was most active under acidic conditions, we found CcsZ was tolerant of a wide variety of pH and buffer conditions and still demonstrated detectable hydrolase activity on CMC across all pH values we tested (pH 4–10). CcsZ activity was reduced two-fold between pH 4.5 and 7.5, with a particular loss in activity under alkaline conditions, showing a six-fold reduction in activity at pH 9.5.
To assess substrate specificity, we also tested the common GH-5 polysaccharide substrates arabinoxylan, xylan, lichenin and β-glucan, along with β-(1,3)-linked curdlan, using the same DNS reducing sugar assay. We did not observe a significant increase in reducing sugar concentration following prolonged CcsZ incubation (i.e., 24 h) with arabinoxylan, xylan, or lichenin under our conditions (Fig 7B), consistent with the expected activity of CcsZ on cellulose exopolysaccharides and its classification in GH-5_25. However, we observed CcsZ exhibited 5-fold greater activity on β-glucan, a polymer with mixed β-(1,3) and β-(1,4) linkages. In agreement with our structural data, this observation suggested that carboxymethylcellulose was actually a poor CcsZ substrate owing to the stearic constraints of the carboxymethyl groups. Instead, activity on a β-glucan is likely more representative of the minimal, unbranched structure of cellulose exopolysaccharides and others have also reported that β-glucan is a superior substrate mimetic to CMC for GH-5 strict endo-β-glucanases [66]. In a follow-up analysis with curdlan, we did not observe a significant increase in reducing sugar concentration with this solely β-(1,3)-linked substrate; thereby indicating that the activity on β-glucan must have been limited to the β-(1,4) linkages, which is consistent with cellulose as the proposed preferred substrate for CcsZ.
Exo-acting activities are rare among GH-5 enzymes, with the majority of characterized GH-5 enzymes characterized as endo-acting [64], although it is worth noting that this is a challenging biochemical distinction to establish experimentally and that these activities may not be mutually exclusive. In the case of CcsZ, an unusual substrate-accommodation cleft in addition to our structure of CcsZ bound to a glucose monosaccharide suggested that CcsZ may exhibit strict exo-glucanase activity, which warranted further investigation. To assess the regioselectivity of CcsZ, we incubated the enzyme with the mock substrate cellopentaose (G5) and analyzed the enzymatic products by liquid chromatography-mass spectrometry (LC-MS; Fig 8). Our enzyme-free control contained only the m/z species for the intact G5 starting material, indicating the substrate was not solvent-labile. In the mass spectra of the CcsZ enzymatic products, we observed predominantly m/z species that corresponded to sodiated ions for glucose monosaccharide (G1), cellobiose disaccharide (G2), cellotriose trisaccharide (G3) and cellotetraose (G4) species. In addition, we also observed in very low abundance the unreacted and intact starting material (G5) among the CcsZ products, which were completely digested after 90 min elapsed incubation. This data suggested CcsZ is capable of mixed exo- and endo-glucanase activity on substrates at least 4 saccharide units in length, at least for the minimal substrate cellopentaose. Although glucose monosaccharides were released, the accumulation of di- and tri-saccharide species after prolonged enzymatic digestion suggests the exo-glucanase activity of CcsZ is negligible compared to its endo-glucanase activity. This mixed activity also accounted for the existence of a glucose monosaccharide as an enzymatic product, which we were able to structurally resolve bound to CcsZ.
To further asses CcsZ, we analyzed the endo- vs exo- activity using a K-CELLG5-4V Cellulase Kit (Megazyme, Ireland), which experimentally tests for endocellulase activity using a bimodified cellopentaose substrate that contains terminal p-nitrophenyl and 3-ketobutylidene groups. Upon incubation with endocellulase, the modified glucan is cleaved internally; thereby producing exposed terminal glucose subunits that are cleaved by an ancillary β-glucosidase to yield free glucose and p-nitrophenolate (pNP). The coupled enzyme assay (i.e., containing the tested endocellulase plus the manufacturer supplied exo-β-glucosidase) is terminated and the pNP colour is assessed upon addition of Tris stopping reagent. Thus, concentrations of pNP are directly proportionate to endocellulase activity as the chromophore cannot be released without the initial endo-glucanase activity that cleaves the internal β-(1,4)-linkages. In our analysis of CcsZ, we observed formation of pNP that verified CcsZ has endocellulase activity (Fig 9). A parallel reaction without exo-β-glucosidase yielded no formation of the colorimetric pNP, which suggests that CcsZ possesses no observable exo-activity over the time course of the reaction (i.e., exocellulase activity was only observed in the coupled assay with exo-β-glucosidase present). Thus, our results are in keeping with the literature on GH-5 enzymes, of which the majority have endo-acting activity and rarely exhibit exo-activity [64] and are also consistent with other cellulases found in cellulose synthase clusters that are proposed to release nascent chains from the cell into the biofilm [30,32,70].
Conclusions
Here, we demonstrate the presence of the ccsZABHI locus that bears sequence similarities to known Gram-negative cellulose exopolysaccharide biosynthesis systems and propose for the first time a model of cellulose synthesis in Gram positives (Fig 2). Our subsequent analysis of the cognate glycosyl hydrolase, CcsZ, reveals the Clostridial cellulose synthase contains a functionally analogous, but structurally distinct, glycosyl hydrolase from the known Gram-negative cognate glycosyl hydrolase BcsZ. The findings we present here represent an entry point to an understanding of the molecular mechanisms governing cellulose exopolysaccharide production in the important and well-studied human pathogens Clostridioides difficile, Clostridium botulinum and Clostridium perfringens, as well as other Gram-positive bacteria.
Supporting information
Acknowledgments
We thank Dr. Dyanne Brewer and Dr. Armen Charcoglycan at the Mass Spectrometry facility of the Advanced Analysis Center, University of Guelph for their technical assistance and expertise with mass spectrometry experiments. AA was previously supported by an Ontario Graduate Scholarship via Wilfrid Laurier University. Research described in this paper was performed using beamline 08ID-1 at the Canadian Light Source, which is supported by the Canada Foundation for Innovation, Natural Sciences and Engineering Research Council of Canada, the University of Saskatchewan, the Government of Saskatchewan, Western Economic Diversification Canada, the National Research Council Canada and the Canadian Institutes of Health Research.
Data Availability
All PDB coordinate files have been deposited in the Worldwide protein databank (https://www.rcsb.org) under accession codes 6UJE and 6UJF and will be made available upon acceptance of the manuscript. Copies of the validation reports have been submitted as supporting information alongside this manuscript.
Funding Statement
JTW is supported by the Natural Sciences and Engineering Research Council of Canada (NSERC) in the form of a grant (#418310) - https://www.nserc-crsng.gc.ca/index_eng.asp The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Valentini M, Filloux A. Biofilms and Cyclic di-GMP (c-di-GMP) Signaling: Lessons from Pseudomonas aeruginosa and Other Bacteria. J Biol Chem. 2016;291: 12547–55. 10.1074/jbc.R115.711507 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Flemming H-C, Wingender J, Szewzyk U, Steinberg P, Rice SA, Kjelleberg S. Biofilms: an emergent form of bacterial life. Nat Rev Microbiol. 2016;14: 563–575. 10.1038/nrmicro.2016.94 [DOI] [PubMed] [Google Scholar]
- 3.Flickinger ST, Copeland MF, Downes EM, Braasch AT, Tuson HH, Eun Y-J, et al. Quorum Sensing between Pseudomonas aeruginosa Biofilms Accelerates Cell Growth. J Am Chem Soc. 2011;133: 5966–5975. 10.1021/ja111131f [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Stewart PS, William Costerton J. Antibiotic resistance of bacteria in biofilms. Lancet. 2001;358: 135–138. 10.1016/s0140-6736(01)05321-1 [DOI] [PubMed] [Google Scholar]
- 5.Fux CA, Costerton JW, Stewart PS, Stoodley P. Survival strategies of infectious biofilms. Trends Microbiol. 2005;13: 34–40. 10.1016/j.tim.2004.11.010 [DOI] [PubMed] [Google Scholar]
- 6.Marsh P. Dental Plaque as a Microbial Biofilm Dental Plaque–Existing Perspective. Caries Res. 2004;38: 204–211. 10.1159/000077756 [DOI] [PubMed] [Google Scholar]
- 7.Ross P, Mayer R, Benziman M. Cellulose biosynthesis and function in bacteria. Microbiol Rev. 1991;55: 35–58. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Canale-Parola E, Borasky R, Wolfe RS. Studies on Sarcina ventriculi III. Localization of Cellulose. J Bacteriol. 1961;81: 311–318. 10.1128/JB.81.2.311-318.1961 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Aly MA, Reimhult E, Kneifel W, Domig KJ. Characterization of Biofilm Formation by Cronobacter spp. Isolates of Different Food Origin under Model Conditions. J Food Prot. 2018;82: 65–77. 10.4315/0362-028x.jfp-18-036 [DOI] [PubMed] [Google Scholar]
- 10.Saldaña Z, Xicohtencatl-Cortes J, Avelino F, Phillips AD, Kaper JB, Puente JL, et al. Synergistic role of curli and cellulose in cell adherence and biofilm formation of attaching and effacing Escherichia coli and identification of Fis as a negative regulator of curli. Environ Microbiol. 2009;11: 992–1006. 10.1111/j.1462-2920.2008.01824.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Spiers AJ, Bohannon J, Gehrig SM, Rainey PB. Biofilm formation at the air-liquid interface by the Pseudomonas fluorescens SBW25 wrinkly spreader requires an acetylated form of cellulose. Mol Microbiol. 2003;50: 15–27. 10.1046/j.1365-2958.2003.03670.x [DOI] [PubMed] [Google Scholar]
- 12.Whitfield GB, Marmont LS, Howell PL. Enzymatic modifications of exopolysaccharides enhance bacterial persistence. Front Microbiol. 2015;6: 1–21. 10.3389/fmicb.2015.00001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hollenbeck EC, Antonoplis A, Chai C, Thongsomboon W, Fuller GG. Phosphoethanolamine cellulose enhances curli-mediated adhesion of uropathogenic Escherichia coli to bladder epithelial cells. Proc Natl Acad Sci. 2018;115: 10106–10111. 10.1073/pnas.1801564115 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Low KE, Howell PL. Gram-negative synthase-dependent exopolysaccharide biosynthetic machines. Curr Opin Struct Biol. 2018;53: 32–44. 10.1016/j.sbi.2018.05.001 [DOI] [PubMed] [Google Scholar]
- 15.Bundalovic-Torma C, Whitfield GB, Marmont LS, Lynne Howell P, Parkinson J. A systematic pipeline for classifying bacterial operons reveals the evolutionary landscape of biofilm machineries [Internet]. PLoS Computational Biology. 2020. 10.1371/journal.pcbi.1007721 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Römling U, Gaperin MY. Bacterial cellulose biosynthesis: diversity of operons, subunits, products, and functions. Trends Microbiol. 2015;23: 545–557. Available: http://resolver.scholarsportal.info/resolve/0966842x/v23i0009/545_bcbdoospaf.xml [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Thongsomboon W, Serra DO, Possling A, Hadjineophytou C, Hengge R, Cegelski L. Phosphoethanolamine cellulose: A naturally produced chemically modified cellulose. Science (80-). 2018;359: 334–338. 10.1126/science.aao4096 [DOI] [PubMed] [Google Scholar]
- 18.Maeda K, Tamura J, Okuda Y, Narikawa R, Midorikawa T, Ikeuchi M. Genetic identification of factors for extracellular cellulose accumulation in the thermophilic cyanobacterium Thermosynechococcus vulcanus: proposal of a novel tripartite secretion system. Mol Microbiol. 2018;109: 121–134. 10.1111/mmi.13977 [DOI] [PubMed] [Google Scholar]
- 19.Anderson AC, Burnett AJN, Hiscock L, Maly KE, Weadge JT. The Escherichia coli cellulose synthase subunit G (BcsG) is a Zn2+-dependent phosphoethanolamine transferase. J Biol Chem. 2020;295: 6225–6235. 10.1074/jbc.RA119.011668 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Wang X, Iii JFP, Romeo T. The pgaABCD Locus of Escherichia coli Promotes the Synthesis of a Polysaccharide Adhesin Required for Biofilm Formation. J Bacteriol. 2004;186: 2724–2734. 10.1128/jb.186.9.2724-2734.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Arciola CR, Campoccia D, Ravaioli S, Montanaro L. Polysaccharide intercellular adhesin in biofilm: structural and regulatory aspects. Front Celluar Infect Microbiol. 2015;5: 1–10. 10.3389/fcimb.2015.00007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Heilmann C, Heilmann C, Schweitzer O, Schweitzer O, Mack D, Mack D, et al. Molecular basis of intercellular adhesion in the biofilm-forming Staphylococcus epidermidis. Mol Microbiol. 1996;20: 1083–1091. 10.1111/j.1365-2958.1996.tb02548.x [DOI] [PubMed] [Google Scholar]
- 23.Atkin KE, Macdonald SJ, Brentnall AS, Potts JR, Thomas GH. A different path: Revealing the function of staphylococcal proteins in biofilm formation. FEBS Lett. 2014;588: 1869–1872. 10.1016/j.febslet.2014.04.002 [DOI] [PubMed] [Google Scholar]
- 24.Whitney JC, Hay ID, Li C, Eckford PDW, Robinson H, Amaya MF, et al. Structural basis for alginate secretion across the bacterial outer membrane. Proc Natl Acad Sci U S A. 2011;108: 13083–8. 10.1073/pnas.1104984108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Omadjela O, Narahari A, Strumillo J, Mélida H, Mazur O, Bulone V, et al. BcsA and BcsB form the catalytically active core of bacterial cellulose synthase sufficient for in vitro cellulose synthesis. Proc Natl Acad Sci U S A. 2013;110: 17856–61. 10.1073/pnas.1314063110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.McNamara JT, Morgan JLW, Zimmer J. A molecular description of cellulose biosynthesis. Annu Rev Biochem. 2015;84: 895–921. 10.1146/annurev-biochem-060614-033930 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Keiski C-L, Harwich M, Jain S, Neculai AM, Yip P, Robinson H, et al. AlgK Is a TPR-Containing Protein and the Periplasmic Component of a Novel Exopolysaccharide Secretin. Structure. 2010;18: 265–273. 10.1016/j.str.2009.11.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Hay ID, Rehman ZU, Rehm BHA. Membrane topology of outer membrane protein AlgE, which is required for alginate production in Pseudomonas aeruginosa. Appl Environ Microbiol. 2010;76: 1806–12. 10.1128/AEM.02945-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Wolfram F, Arora K, Robinson H, Neculai AM, Yip P, Howell PL. Expression, purification, crystallization and preliminary X-ray analysis of Pseudomonas aeruginosa AlgL. Acta Crystallogr Sect F Struct Biol Cryst Commun. 2012;68: 584–587. 10.1107/S1744309112012808 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Mazur O, Zimmer J. Apo- and cellopentaose-bound structures of the bacterial cellulose synthase subunit BcsZ. J Biol Chem. 2011;286: 17601–6. 10.1074/jbc.M111.227660 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Ahmad I, Rouf SF, Sun L, Cimdins A, Shafeeq S, Le Guyon S, et al. BcsZ inhibits biofilm phenotypes and promotes virulence by blocking cellulose production in Salmonella enterica serovar Typhimurium. Microb Cell Fact. 2016;15: 177 10.1186/s12934-016-0576-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Yasutake Y, Kawano S, Tajima K, Yao M, Satoh Y, Munekata M, et al. Structural characterization of the Acetobacter xylinum endo-β-1,4- glucanase CMCax required for cellulose biosynthesis. Proteins Struct Funct Genet. 2006;64: 1069–1077. 10.1002/prot.21052 [DOI] [PubMed] [Google Scholar]
- 33.Poquet I, Saujet L, Canette A, Monot M, Mihajlovic J, Ghigo JM, et al. Clostridium difficile Biofilm: Remodeling metabolism and cell surface to build a sparse and heterogeneously aggregated architecture. Front Microbiol. 2018;9: 1–20. 10.3389/fmicb.2018.00001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Pantaléon V, Bouttier S, Soavelomandroso AP, Janoir C, Candela T. Biofilms of Clostridium species. Anaerobe. 2014;30: 193–198. 10.1016/j.anaerobe.2014.09.010 [DOI] [PubMed] [Google Scholar]
- 35.Whitfield GB, Marmont LS, Bundalovic-Torma C, Razvi E, Roach EJ, Khursigara CM, et al. Discovery and characterization of a Gram-positive Pel polysaccharide biosynthetic gene cluster. bioRxiv. 2019; 768473 10.1101/768473 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Balsells E, Shi T, Leese C, Lyell I, Burrows J, Wiuff C, et al. Global burden of Clostridium difficile infections: A systematic review and meta-analysis. J Glob Health. 2019;9 10.7189/jogh.09.010407 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Lessa FC, Mu Y, Bamberg WM, Beldavs ZG, Dumyati GK, Dunn JR, et al. Burden of Clostridium difficile infection in the United States. N Engl J Med. 2015;372: 825–834. 10.1056/NEJMoa1408913 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Zhang S, Palazuelos-Munoz S, Balsells EM, Nair H, Chit A, Kyaw MH. Cost of hospital management of Clostridium difficile infection in United States-a meta-analysis and modelling study. BMC Infect Dis. 2016;16 10.1186/s12879-016-1786-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, et al. SWISS-MODEL: Homology modelling of protein structures and complexes. Nucleic Acids Res. 2018;46: W296–W303. 10.1093/nar/gky427 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Kelley LA, Mezulis S, Yates CM, Wass MN, Sternberg MJE. The Phyre2 web portal for protein modeling, prediction and analysis. Nat Protoc. 2015;10: 845–858. 10.1038/nprot.2015.053 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Krogh A, Larsson B, Von Heijne G, Sonnhammer ELL. Predicting transmembrane protein topology with a hidden Markov model: Application to complete genomes. J Mol Biol. 2001;305: 567–580. 10.1006/jmbi.2000.4315 [DOI] [PubMed] [Google Scholar]
- 42.Nakai K, Horton P. PSORT: A program for detecting sorting signals in proteins and predicting their subcellular localization. Trends Biochem Sci. 1999;24: 34–35. 10.1016/s0968-0004(98)01336-x [DOI] [PubMed] [Google Scholar]
- 43.Kabsch W. XDS. Acta Crystallogr Sect D Biol Crystallogr. 2010;66: 125–132. 10.1107/S0907444909047337 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Evans PR. An introduction to data reduction: Space-group determination, scaling and intensity statistics. Acta Crystallogr Sect D Biol Crystallogr. 2011;67: 282–292. 10.1107/S090744491003982X [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Adams PD, Afonine P V., Bunkóczi G, Chen VB, Davis IW, Echols N, et al. PHENIX: A comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr Sect D Biol Crystallogr. 2010;66: 213–221. 10.1107/S0907444909052925 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Emsley P, Lohkamp B, Scott WG, Cowtan K. Features and development of Coot. Acta Crystallogr Sect D Biol Crystallogr. 2010;66: 486–501. 10.1107/S0907444910007493 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Krissinel E, Henrick K. Inference of macromolecular assemblies from crystalline state. J Mol Biol. 2007;372: 774–797. 10.1016/j.jmb.2007.05.022 [DOI] [PubMed] [Google Scholar]
- 48.Whitney JC, Howell PL. Synthase-dependent exopolysaccharide secretion in Gram-negative bacteria. Trends Microbiol. 2013;21: 63–72. 10.1016/j.tim.2012.10.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Carvalho AL, Goyal A, Prates JAM, Bolam DN, Gilbert HJ, Pires VMR, et al. The family 11 carbohydrate-binding module of Clostridium thermocellum Lic26A-Cel5E accommodates β-1,4- and β-1,3–1,4-mixed linked glucans at a single binding site. J Biol Chem. 2004;279: 34785–34793. 10.1074/jbc.M405867200 [DOI] [PubMed] [Google Scholar]
- 50.Morgan JLW, Strumillo J, Zimmer J. Crystallographic snapshot of cellulose synthesis and membrane translocation. Nature. 2013;493: 181–186. 10.1038/nature11744 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Ma D, Wang Z, Merrikh CN, Lang KS, Lu P, Li X, et al. Crystal structure of a membrane-bound O-acyltransferase. Nature. 2018;562: 286–290. 10.1038/s41586-018-0568-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Chang TY, Chang CCY, Ohgami N, Yamauchi Y. Cholesterol sensing, trafficking, and esterification. Annu Rev Cell Dev Biol. 2006;22: 129–157. 10.1146/annurev.cellbio.22.010305.104656 [DOI] [PubMed] [Google Scholar]
- 53.Liu Q, Siloto RMP, Lehner R, Stone SJ, Weselake RJ. Acyl-CoA:diacylglycerol acyltransferase: Molecular biology, biochemistry and biotechnology. Prog Lipid Res. 2012;51: 350–377. 10.1016/j.plipres.2012.06.001 [DOI] [PubMed] [Google Scholar]
- 54.Sychantha D, Little DJ, Chapman RN, Boons G-JJ, Robinson H, Howell PL, et al. PatB1 is an O-acetyltransferase that decorates secondary cell wall polysaccharides. Nat Chem Biol. 2018;14: 79–85. 10.1038/nchembio.2509 [DOI] [PubMed] [Google Scholar]
- 55.Saxena IM, Brown RM. Identification of a Second Cellulose Synthase Gene (acsAII) in Acetobacter xylinum. J Bacteriol. 1995;177: 5276–5283. 10.1128/jb.177.18.5276-5283.1995 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Sun L, Vella P, Schnell R, Polyakova A, Bourenkov G, Li F, et al. Structural and Functional Characterization of the BcsG Subunit of the Cellulose Synthase in Salmonella typhimurium. J Mol Biol. 2018;430: 3170–3189. 10.1016/j.jmb.2018.07.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Koza A, Jerdan R, Cameron S, Spiers AJ. Three biofilm types produced by a model pseudomonad are differentiated by structural characteristics and fitness advantage. Microbiology. 2020; 10.1099/mic.0.000938 [DOI] [PubMed] [Google Scholar]
- 58.Riley LM, Weadge JT, Baker P, Robinson H, Codée JDC, Tipton PA, et al. Structural and Functional Characterization of Pseudomonas aeruginosa AlgX: Role of AlgX in Alginate Acetylation. J Biol Chem. 2013;288: 22299–22314. 10.1074/jbc.M113.484931 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Baker P, Ricer T, Moynihan PJ, Kitova EN, Walvoort MTC, Little DJ, et al. P. aeruginosa SGNH Hydrolase-Like Proteins AlgJ and AlgX Have Similar Topology but Separate and Distinct Roles in Alginate Acetylation. PLoS Pathog. 2014;10 10.1371/journal.ppat.1004334 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Dannheim H, Will SE, Schomburg D, Neumann-Schaal M. Clostridioides difficile 630Δerm in silico and in vivo–quantitative growth and extensive polysaccharide secretion. FEBS Open Bio. 2017;7: 602–615. 10.1002/2211-5463.12208 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Häggström L, Förberg C. Significance of an extracellular polymer for the energy metabolism in Clostridium acetobutylicum: A hypothesis. Appl Microbiol Biotechnol. 1986;23: 234–239. 10.1007/BF00261921 [DOI] [Google Scholar]
- 62.Whitfield C, Wear SS, Sande C. Assembly of Bacterial Capsular Polysaccharides and Exopolysaccharides. Annu Rev Microbiol. 2020;74: 521–543. 10.1146/annurev-micro-011420-075607 [DOI] [PubMed] [Google Scholar]
- 63.Chen VB, Arendall WB, Headd JJ, Keedy DA, Immormino RM, Kapral GJ, et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr Sect D Biol Crystallogr. 2010;66: 12–21. 10.1107/S0907444909042073 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Aspeborg H, Coutinho PM, Wang Y, Brumer H, Henrissat B. Evolution, substrate specificity and subfamily classification of glycoside hydrolase family 5 (GH5). BMC Evol Biol. 2012;12 10.1186/1471-2148-12-186 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Pereira JH, Chen Z, McAndrew RP, Sapra R, Chhabra SR, Sale KL, et al. Biochemical characterization and crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima. J Struct Biol. 2010;172: 372–379. 10.1016/j.jsb.2010.06.018 [DOI] [PubMed] [Google Scholar]
- 66.McGregor N, Morar M, Fenger TH, Stogios P, Lenfant N, Yin V, et al. Structure-function analysis of a mixed-linkage β-glucanase/xyloglucanase from the key ruminal bacteroidetes prevotella bryantii B14. J Biol Chem. 2016;291: 1175–1197. 10.1074/jbc.M115.691659 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Wu TH, Huang CH, Ko TP, Lai HL, Ma Y, Chen CC, et al. Diverse substrate recognition mechanism revealed by Thermotoga maritima Cel5A structures in complex with cellotetraose, cellobiose and mannotriose. Biochim Biophys Acta—Proteins Proteomics. 2011;1814: 1832–1840. 10.1016/j.bbapap.2011.07.020 [DOI] [PubMed] [Google Scholar]
- 68.Datta S, Holmes B, Park JI, Chen Z, Dibble DC, Hadi M, et al. Ionic liquid tolerant hyperthermophilic cellulases for biomass pretreatment and hydrolysis. Green Chem. 2010;12: 338–345. 10.1039/b916564a [DOI] [Google Scholar]
- 69.Chhabra SR, Shockley KR, Ward DE, Kelly RM. Regulation of endo-acting glycosyl hydrolases in the hyperthermophilic bacterium Thermotoga maritima grown on glucan- and mannan-based polysaccharides. Appl Environ Microbiol. 2002;68: 545–554. 10.1128/aem.68.2.545-554.2002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Tan K, Joachimiak G, Endres M, Joachimiak A, Midwest Center For Structural Genomics. PDB ID 4Q2B: The crystal structure of an endo-1,4-D-glucanase from Pseudomonas putida KT2440 [Internet]. 2014. 10.2210/pdb4Q2B/pdb [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All PDB coordinate files have been deposited in the Worldwide protein databank (https://www.rcsb.org) under accession codes 6UJE and 6UJF and will be made available upon acceptance of the manuscript. Copies of the validation reports have been submitted as supporting information alongside this manuscript.