Abstract
The Cellulosome is an intricate macromolecular protein complex that centralizes the cellulolytic efforts of many anaerobic microorganisms through the promotion of enzyme synergy and protein stability. The assembly of numerous carbohydrate processing enzymes into a macromolecular multiprotein structure results from the interaction of enzyme-borne dockerin modules with repeated cohesin modules present in noncatalytic scaffold proteins, termed scaffoldins. Cohesin–dockerin (Coh-Doc) modules are typically classified into different types, depending on structural conformation and cellulosome role. Thus, type I Coh-Doc complexes are usually responsible for enzyme integration into the cellulosome, while type II Coh-Doc complexes tether the cellulosome to the bacterial wall. In contrast to other known cellulosomes, cohesin types from Bacteroides cellulosolvens, a cellulosome-producing bacterium capable of utilizing cellulose and cellobiose as carbon sources, are reversed for all scaffoldins, i.e., the type II cohesins are located on the enzyme-integrating primary scaffoldin, whereas the type I cohesins are located on the anchoring scaffoldins. It has been previously shown that type I B. cellulosolvens interactions possess a dual-binding mode that adds flexibility to scaffoldin assembly. Herein, we report the structural mechanism of enzyme recruitment into B. cellulosolvens cellulosome and the identification of the molecular determinants of its type II cohesin–dockerin interactions. The results indicate that, unlike other type II complexes, these possess a dual-binding mode of interaction, akin to type I complexes. Therefore, the plasticity of dual-binding mode interactions seems to play a pivotal role in the assembly of B. cellulosolvens cellulosome, which is consistent with its unmatched complexity and size.
Keywords: cellulosome, cohesin, dockerin, dual-binding, protein complex, crystal structure, cellulose, cellulase
Abbreviations: A. cellulolyticus, Acetivibrio (Hungateiclostridium) cellulolyticus; B. cellulosolvens, Bacteroides (Pseudobacteroides) cellulosolvens; C. cellulolyticum, Clostridium (Hungateiclostridium) cellulolyticum; C. thermocellum, Clostridium (Hungateiclostridium) thermocellum; CAZymes, Carbohydrate Active enZymes; Coh, Cohesin; Doc, Dockerin; IMAC, Immobilized Metal-ion Affinity Chromatography; ITC, Isothermal Titration Calorimetry; NGE, Nondenaturing Gel Electrophoresis; R. flavefaciens, Ruminococcus flavefaciens; rmsd, root-mean-square deviation; SLH, S-layer homology
Recycling of photosynthetically fixed carbon is a crucial microbial process, critical to the cycling of carbon between plants, herbivores, and microbes. Bacteroides (Pseudobacteroides) cellulosolvens is a mesophilic, anaerobic bacterium capable of degrading crystalline cellulose (1, 2). Similar to other bacteria such as Clostridium (Hungateiclostridium) thermocellum and Acetivibrio (Hungateiclostridium) cellulolyticus, Bacteroides cellulosolvens produces an extracellular multimodular cellulolytic complex—the cellulosome—responsible for the degradation of the plant cell wall (3). Noteworthy, B. cellulosolvens has the most intricate cellulosome so far described, conceivably capable of congregating up to 110 carbohydrate-active enzymes in a cell-associated mega-Dalton complex (Fig. 1) (4).
Figure 1.
Schematic representation of two possible B. cellulosolvens cellulosomal architectures. The cohesins and dockerins are color-coded to highlight the different Coh-Doc specificities. Doc-containing enzymes are incorporated into the ScaA1 scaffoldin through interaction with the 11 ScaA1 Cohs (light green). ScaA1 can then bind to anchoring scaffoldins ScaF or ScaB, tethering the cellulosome to the wall (yellow interactions). ScaF can only interact with a single ScaA1 scaffoldin, originating a cellulosomal unit with 11 enzymes. On the other hand, ScaB can integrate up to ten different ScaA1 units, generating a cellulosome with 110 enzymes. ScaA1 possesses an internal family 3 CBM unit (light blue) for substrate targeting. ScaF and ScaB both possess a SLH module (orange) for cell surface attachment, with an adjacent X-module (gray) with unknown function. The interaction here described (BcCohScaA111-DocCel48) is highlighted with a dashed red circle.
Cellulosomes are built around a primary noncatalytic protein scaffold, named scaffoldin, bearing reiterated cohesin (Coh) modules that serve as protein–protein interaction targets to dockerin (Doc) modules found in an extensive repertoire of independent Carbohydrate-Active enZymes (CAZymes). The ability to gather a large number of diverse enzymes into the cellulosome presumably provides these anaerobic microorganisms an important advantage in their competitive ecological niches, through complementary and synergic enzyme cooperation, while also promoting enzyme stability (5). The cellulosome’s Coh-Doc protein:protein interactions constitute the primary driving force for cellulosomal assembly and are among nature’s strongest protein:protein interactions (Ka > 109 M−1). Also, due to a distinctive twofold internal symmetry, Doc modules can potentially bind their cognate Cohs in two different orientations, by rotating 180° with respect to its protein ligand (6). This is known as a dual-binding mode, as opposed to a single-binding mode that occurs when only one of the Doc interfaces supports the formation of the Coh-Doc complex (7). A sequence-based classification of Cohs and Docs distinguishes type I and type II interactions as major categories (8). In the archetypal cellulosome of Clostridium thermocellum, the assembly of the different enzymes into the main scaffoldins is mediated by type I Coh-Doc interactions with a dual-binding mode, whereas the anchoring of the scaffoldin to the bacterial cell wall is achieved through type II single-binding mode Coh-Doc interactions (5, 9). Although there are noteworthy exceptions, such as a generalized Coh-Doc dual-binding mode in the cellulosome of Acetivibrio cellulolyticus (8, 10) or, conversely, a ubiquitous single-binding mode found in Ruminococcus flavefaciens (11), this has been considered the rule.
Based on the almost complete genome sequence of B. cellulosolvens (12), Zhivin et al. (4) performed a seminal study on the architecture and functional organization of the most complex cellulosome described so far. Their work has identified 31 different scaffoldins, many of which lack any known cell-surface binding domains, thus supporting an extensive putative cell-free cellulosome system. Besides a novel classification of Coh-Doc pairings, named type R, another striking feature of this cellulosome is a reversal of Coh types found in scaffoldins. Unlike in other species, B. cellulosolvens’primary scaffoldin recruitment of Doc-bearing enzymes is mediated by type II Coh-Doc interactions, while anchoring scaffoldins rely on type I Coh-Doc complexes for cell wall attachment. The primary scaffoldin subunit of B. cellulosolvens, termed ScaA1, like other primary scaffoldin proteins, incorporates a carbohydrate-binding module (family 3 CBM), but is unusually composed of 11 type II Cohs and a C-terminal type I Doc that does not have an associated X module, which is a known type II Coh-Doc interaction stabilizer (3). Cell-surface anchoring of ScaA1 occurs via interaction of its single type I Doc with one of the ten type I Cohs found in the ScaB anchoring scaffoldin. Anchoring to the peptidoglycan-associated polymers from the bacterium cell surface likely results from a noncovalent interaction with ScaB’s S-layer homology (SLH) domain (13). This SLH-mediated interaction is likely aided by an adjacent X module, another B. cellulosolvens peculiarity (4). While the functional implications of the reversed specificity of Coh types in B. cellulosolvens cellulosome architecture remain unclear, the actual structural determinants of specificity of its Coh-Doc type II interactions are also currently unknown. This knowledge gap is relevant considering that, although known cellulosome Coh-Doc structures share a remarkable overall conservation of structural topology, the major determinants for interspecies and intraspecies barriers, as well as for the binding mode, depend on very subtle amino acid residue differences (14).
The structure of B. cellulosolvens isolated 11th type II Coh module of BcScaA1 (PDB code: 1tyj (15)), BcScaA1-CBM3 (PDB code: 2xbt, (16)), and seventh type I Coh module of BcScaB (PDB code: 4ums, (10)) were previously reported. The latter report suggested a dual-binding mode for B. cellulosolvens cellulosomal cell anchoring. This presumed plasticity for cell anchoring binding mode is similar to that found on A. cellulolyticus Coh-Doc interactions involving the primary scaffoldin (AcScaA), a unique adaptor scaffoldin (AcScaB) and several anchoring scaffoldins (AcScaC, AcScaD, and AcScaF) (PDB codes 4u3s/4wi0 (8), and 4uyp/4uyq, (10). All of these are in contradiction to the canonical single-binding mode found for cellulosome cell anchoring in C. thermocellum (PDB code 2b59, (17). The 11th type II Coh structure of BcScaA1 shows an overall fold similarity with several type II Cohs from A. cellulolyticus, namely with AcCohScaB3 (PDB code 4u3s), with a Q-score of 0.79 and root-mean-square deviation (rmsd) of 1.22 Å over 163 aligned Cα residues. It also bears close homology to the type II CtCohScaF (PDB code 2b59) with a Q-score of 0.77 and rmsd of 1.40 Å against 164 aligned Cα residues. They all share the characteristic α-helical crowning between strands 6 and 7 and the two singular β-flaps that disrupt strands 4 and, particularly, strand 8 where its 12 residues are flanking the type II Coh Doc-binding plateau.
The elucidation of B. cellulosolvens cellulosome assembly and the reversal of Coh types found in its scaffoldins hinge upon the availability of type II CohScaA-Doc complex structures to understand the consequences of this unusual arrangement. Here is reported the crystal structure of the type II 11th Coh of the primary scaffoldin of B. cellulosolvens in complex with the Doc module of a glycoside hydrolase of family 48, BcCohScaA111-DocCel48 (PDB code: 2y3n). A detailed binding characterization informed by the structural data has also been carried out, which allowed the identification of the molecular determinants of Coh-Doc interaction and suggested a typical dual-binding mode for cellulosome enzyme assembly, thus agreeing with the common paradigm for a flexible arrangement, as found on C. thermocellum (6). In light of this report and considering the putative dual-binding mode for scaffoldin assembly in both B. cellulosolvens and A. cellulolyticus (10), it is fair to conclude that the single versus dual-binding mode in Coh-Doc complexes is independent of type classification. It also does not seem to be a matter of enzyme assembly versus cell-anchoring/scaffoldin assembly. Rather, it might be related to the size and complexity that is possible to achieve in a single unit within a cellulosomal system, with larger cellulosomes requiring a higher degree of flexibility for proper assembly and access to substrate.
Results and discussion
A critical factor to understand the mechanism of cellulosome assembly of B. cellulosolvens is the availability of an X-ray crystal structure of the type II Coh-Doc interaction, central to CAZyme assembly around the primary BcScaA1 scaffoldin. Escherichia coli coexpression strategies for the production and purification of Coh-Doc complexes were thus used to obtain good-quality crystals of highly pure protein complexes of the type II 11th Coh of the primary scaffoldin of B. cellulosolvens (BcCohScaA111), in complex with the Doc module of a glycoside hydrolase of family 48 (DocCel48). A previous study described the structure of this Coh in its unbound form (PDB code: 1tyj (15)), and solving its structure in complex with a bound Doc would allow probing structural differences arising from Coh binding to a Doc partner. The chosen Doc module belongs to one of the most abundant cellulosomal CAZymes, GH48 cellobiohydrolase, and was previously reported to bind Cohs from ScaA1 (4, 18).
Expression and crystallization of B. cellulosolvens Coh-Doc complex
The high degree of sequence conservation between the DocCel48’s two dockerin repeats suggests the existence of two cohesin-binding interfaces, thus supporting a dual-binding mode. This implies that two different complex conformations could be present in solution, which would likely compromise protein crystallization due to a lack of unit cell homogeneity. It is well established that residues at relative positions 10 and 11 of each of the two Doc duplicated segments play a key role in Coh recognition and act as specificity determinants (residues #17, #18, #50, and #51 of the construct used in this work) (9). Thus, a DocCel48 mutant was designed to force binding through a single interface, promoting homogeneity of the purified protein. The mutations used for the crystallization experiments were designed to replace the putative recognition residues in relative positions 10 and 11 of the C-terminal Doc repeat (Met50 and Ala51) with those of the B. cellulosolvens ScaA type I Doc (Ser-Asp), rather than the commonly applied alanine substitution. These amino acid changes were chosen based on the lack of cross-reaction between type I and type II Coh-Doc complexes. The sequence of the resulting Doc is displayed in Table S2. This strategy allowed us to obtain large yields of highly pure Coh-Doc complexes for crystallization, which led to the production of well-diffracting crystals.
Structure of a type II Coh-Doc complex from B. cellulosolvens
A molecular replacement strategy was used to solve the complex’s structure, using the available BcCohScaA111 structure as an input model (PDB code: 1tyj (15)). This yielded a solution with two Cohs in the asymmetric unit. Successive rounds of automated ARP/wARP (19) and manual COOT (20) adjustments to build the Doc modules in both crystallographic Coh-Doc dimer complexes resulted in a final REFMAC5 (21) refined structure at 1.90 Å resolution. The two molecules of the BcCohScaA111-DocCel48 heterodimer share 299 water molecules and each Doc is coordinating two calcium (Ca2+) ions. The Coh-Doc complex includes residues 2073 to 2242 from BcCohScaA111 (Nonredundant RefSeq accession number AAG01230), 683 to 752 from BcDocCel48A (Nonredundant RefSeq accession number WP_050753099). The structure belongs to the monoclinic space group P1211 with unit cell dimensions of a = 43.4 Å, b = 116.1 Å, c = 45.2 Å, and β = 112.5°. Due to disorder, some of the dockerin’s helix 2 residues could not be modeled, namely those between Gly32 and Asn37 of chain B and 15 residues from Ala30 to Asn44 in chain D. Some C- and N-terminal residues, including the 6-histidine tag, are also absent in all four chains. The final model was deposited in the Protein Data Bank under accession code 2y3n. Data collection and refinement statistics are shown in Table 1.
Table 1.
X-ray crystallography data collection and refinement statistics for BcCohScaA1-DocCel48
| Data quality | BcCohScaA1-DocCel48 |
|---|---|
| Cell dimensions, Å | a = 43.4 |
| b = 116.1 | |
| c = 45.2 | |
| β = 112.5° | |
| Space group | P1211 |
| X-ray source | ESRF, ID14-EH1 |
| Wavelength, Å | 0.934 |
| Resolution of data (outer shell), Å | 41.74–1.90 (2.00–1.90) |
| Rpim (outer shell)a | 0.073 (0.278) |
| Rmerge (outer shell)a | 0.090–0.051 (0.329) |
| Mean I/σ (I) (outer shell) | 15.0 (3.9) |
| Completeness (outer shell), % | 83.9 (66.4) |
| Multiplicity (outer shell) | 2.40 (2.2) |
| Structure quality | |
| N° of protein atoms (AU) | 3765 |
| N° calcium atoms | 4 |
| N° solvent waters | 299 |
| Resolution used in refinement, Å | 1.90 |
| Rwork/Rfree,%b | 16.3/22.5 |
| Average temperature factors, Å2 | |
| Main chain (CohA, DocB, CohC, DocD) | 21.3, 31.0, 21.2, 44.3 |
| Side chain (CohA, DocB, CohC, DocD) | 24.4, 33.0, 24.2, 47.7 |
| Calcium atoms (B1, B2, D1, D2) | 25.6, 22.9, 27.3, 61.84 |
| Solvent waters | 39.3 |
| RMS deviations | |
| Bond lengths, Å | 0.022 |
| Bond angles, ° | 1.696 |
| Ramachandran’s plot analysis | |
| Favorable, % | 96.1 |
| Allowed, % | 3.6 |
| Outlier, % | 0.2 |
| PDB accession code | 2y3n |
Values in parenthesis are for the highest resolution shell.
, where I is the observed intensity, and is the statistically-weighted average intensity of multiple observations. , a redundancy-independent version of Rmerge.
, where |Fcalc| and |Fobs| are the calculated and observed structure factor amplitudes, respectively. (Rfree is calculated for a randomly chosen 5% of the reflections).
BcCohScaA111 structure in the complex
The cohesin domain of the type II Coh11-Doc complex of B. cellulosolvens shows the typical flattened, elongated 9-stranded β-barrel jelly-roll topology (Fig. 2A) with a highly hydrophobic core. Similar to the C. thermocellum structure, the nine β-strands define two β-sheets—the first β-sheet is defined by strands 8-3-6-5 (front face) and the second is defined by strands 9-1-2-7 (back face). The common α-helical crowning observed between strands 6 and 7 and the two β-flap regions that disrupt the normal progression of strands 4 and 8 are maintained (10, 11, 15), with the latter making further contact with the dockerin counterpart. Comparing this structure with that of the unbound BcCohScaA111 (1tyj) shows that, globally, the Coh does not undergo significant conformational changes upon binding, as revealed by the low rmsd value (0.66 Å for 166 Cα atoms) between both structures. Nonetheless, some differences can be found on the binding plateau that better accommodates the ligand partner, namely on the β-strand 8 loop (defined by residues 136 and 145) as indicated by a larger rmsd value of 1.35 Å between 11 atom pairs. The closest BcCohScaA111 functionally relevant structural homologues according to the PDBeFold server (http://www.ebi.ac.uk/msd-srv/ssm/) are type II cohesins from A. cellulolyticus ScaB (PDB accession codes 3bwz and 4u3s), responsible for the anchoring between adaptor scaffoldin ScaB and main scaffoldin ScaA through a dual-binding mode interaction. These protein modules were matched with a Z-score of 11.4, rmsd of 1.18 Å, sequence identity of 44% over 163 aligned residues and Z-score of 12.6, rmsd of 1.22 Å, sequence identity of 40% over 162 aligned residues, respectively. The type II cohesin from C. thermocellum’s ScaF (SdbA) also shares a high degree of structural homology with BcCohScaA111 (PDB 2b59), Z-score of 12.3, rmsd of 1.40 Å, sequence identity of 30% over 164 aligned residues. Thus, the dual- and single-binding Coh structures show a high degree of sequence and structural conservation (Fig. 2D), which is curious but not unexpected as the dual versus Single-binding mode is determined by the nature of the dockerin partner.
Figure 2.
BcCohScaA111-DocCel48’s structure.A, bird’s-eye view of the Bacteroides cellulosolvens type II complex BcCohScaA111-DocCel48 in ribbon representation, with the dockerin color-ramped from N-terminus (blue) to C-terminus (red) and the cohesin in gold. The van der Waals surfaces of both modules are depicted in transparent coloring. B, detailed view of the N-terminal calcium coordinating loop. C, detailed view of the C-terminal calcium coordinating loop. Calcium coordinating residues are depicted in stick representation. Hydrogen bonds are shown as dashed lines, water molecules as red spheres, and calcium ions as green spheres. The coordinating residues and calcium ions are surrounded by a mesh representation of the Refmac5 maximum-likelihood σA-weighted 2Fo−Fc electron density map contoured at 1σ (0.46 electrons/A3). D, multiple sequence alignment of BcCohScaA111 with other type II cohesins from A. cellulolyticus (AcCohScaB) and C. thermocelum (CtCohScaF). A cartoon representation of BcCohScaA111’s secondary structure is displayed in orange, above the alignment. The sequences were aligned using the Clustal Omega tool and further processed with ALINE. Colouring according to similarity was implemented with ALINE (37): dark green, identical residues; green to white, lowering color-ramped scale of conservation. Residues involved in hydrogen bonding with BcDocCel48 are marked with a blue triangle and those establishing hydrophobic contacts with yellow circles.
Type II Doc structure in the complex
The dockerin domain of the B. cellulosolvens type II Coh-Doc complex reveals a classic Doc structure, composed of two loop α-helix motifs (EF-hand like motifs) each with a bound calcium ion, separated by a 12-residue unstructured linker (7, 11, 22, 23) (Fig. 2A). Helix 1 is formed by residues Asn16–Ser26, and helix 3 is formed by residues Ser50–Phe60. Helices 1 and 3 are arranged in an antiparallel orientation that places the two calcium ions in opposite sides of the dockerin module, similar to that observed for other dockerins. The calcium ions are coordinated in a typical octahedral geometry in both EF-hand motif loops (Fig. 2, B and C). The first calcium ion is located near the N-terminus of the dockerin and is coordinated by residues Asp8 (Oδ1), Asn10 (Oδ1), Asp12 (Oδ1), Val14 (backbone carbonyl), Asp19 (Oδ1 and Oδ2), and a water molecule. The second calcium is coordinated by Asp41 (Oδ1), Asn43 (Oδ1), Asp45 (Oδ1), Val47 (backbone carbonyl), Asp52 (Oδ1 and Oδ2), and a water molecule. The 12-residue linker region between helices 1 and 3 (Phe27-Asn49) shows a large degree of mobility, making it difficult to define several residues. This is particularly evident in the dockerin at chain D, where the calcium ion has a B-factor of 62 Å2 and calcium coordinating residues Asp41 and Asn43 could not be defined. Since the type II complex of B. cellulosolvens lacks the X module, which is thought to help stabilize the cohesin–dockerin interaction in type II complexes, it is possible that for the correct assembly of this complex, the presence of an adjacent module is required.
The structure of B. cellulosolvens type II Doc presents a functionally relevant internal symmetry, as illustrated in Figure 3. This internal symmetry is reflected by the low rmsd values between both helices (0.62 Å for 24 Cα atoms). When B. cellulosolvens dockerin structure was compared with both Xyn10 B type I (PDB code: 1ohz) and the CipB type II (PDB code: 5k39) dockerins from C. thermocellum, it is clear that helices 1 and 3 are highly homologous, with low rmsd values (1.20 Å for 26 Cα atoms and 0.75 Å for 39 Cα atoms for the type I and type II modules, respectively) (Fig. 4A). Interestingly, the first calcium-binding region is shorter than in the CipB complex, bringing it closer to the type I structure. Moreover, the internal symmetry between helices 1 and 3 exists not only at the tertiary structure level, as in the case of the CipB type II complex, but also at the primary sequence level (Fig. 3C), as in the type I Xyn10B module. This is a crucial feature for the dual-binding mode mechanism.
Figure 3.
Highly symmetric nature of BcDocCel48 supports a dual-binding mode of interaction.A, BcDocCel48’s structure overlaid with a 180° rotated version of itself (gray), showing conservation of key Coh interacting residues. B, overlaying BcCohScaA111-DocCel48 with a complex with a rotated dockerin (gray), shows that the overall structure of the complex as well as key contacts are maintained. The dockerin is shown color-ramped from N-terminus (blue) to C-terminus (red) and the cohesin in gold. The van der Waals surface of the cohesin is depicted in transparent coloring. C, primary structure alignment of the N-terminal half of the dockerin with its C-terminal half, showing a remarkable symmetry provided by a high degree of conservation. A cartoon representation of BcDocCel48’s secondary structure is displayed in orange, above the alignment. The sequences were aligned using the Clustal Omega tool and further processed with ALINE (37). Conserved residues are colored in green. Residues involved in hydrogen bonding with BcCohScaA111 are marked with a blue triangle and those establishing hydrophobic contacts with yellow circles. The second alpha-helix could not be modeled in the structure due to disorder but is shown in gray in panel C, as it is a highly conserved feature of bacterial dockerins.
Figure 4.
BcCohScaA111-DocCel48 type II interaction shares traits with both canonical type I and type II complexes.A, overlay of BcDocCel48 (light green) with a type I dockerin (blue) and a type II dockerin (pink) from C. thermocellum showing that, despite some subtle differences, there is an overall tertiary structure conservation. B, position of the “non-dominant” helix (the helix which is further removed from the binding interface) of BcDocCel48 (light green), a type I dockerin from C. thermocellum (blue), and a type II dockerin from C. thermocellum (pink), relatively to their cohesin partners. The image was obtained by superposing the cohesins of BcCohScaA111-DocCel48, a type I C. thermocellum complex (PDB: 1ohz), and a type II C. thermocellum complex (PDB: 5k39). The cohesin shown is BcCohScaA111 (in gold). The angle of the rotation was determined by measuring the angles formed by the helices and the longitudinal orientation of the cohesin’s β-strands at the binding interface. This reveals that the angle formed between BcDocCel48 nondominant helix and the cohesin stands between those seen in the canonical type I and type II complexes of C. thermocellum.
The complex interface—an alternative binding mode
The BcCohScaA1-DocCel48 cohesin–dockerin interface comprises mainly one face of the cohesin (defined by strands 8-3-6-5) and helices 1 and 3 of the dockerin. The contacts were calculated using the PDBSum server. Among these interactions there are several hydrophobic contacts and a few hydrogen bonds (Fig. 5), which occur mainly between helix 1 of the dockerin and the cohesin (Table 2 and Table S1). This indicates a preferential helix for the formation of the complex as in the case of the type I C. thermocellum complex. When compared with other Coh-Doc complexes, namely with the type I (PDB code: 1ohz) and type II (PDB code: 5k39) complexes from C. thermocellum, the position of the dockerin relatively to the cohesin lays midway between both structures. In C. thermocellum’s type I complex, the dockerin helix that is further away from the binding interface is tilted by about 40° relative to the longitudinal orientation of the cohesin’s β-strands, while in its type II counterpart they run almost parallel, with a slight 4° rotation of the helix. In comparison, the same helix in B. cellulosolvens’type II DocCel48 is tilted by about 12° (Fig. 4B). Consequently, this helix forms fewer contacts with the cohesin than in the C. thermocellum’s type II complex but a more than in the type I. Interestingly, there are no hydrogen bond contacts between side chains of both modules. All H-bonds present in the binding interface are either water-mediated or involve a backbone atom of either the cohesin or the dockerin. Thus, the data suggest that hydrophobic interactions are highly dominant in the formation of this complex (Table 2 and Table S1).
Figure 5.
Cohesin–dockerin interface of BcCohScaA111-DocCel48. Structure of BcCohScaA111-DocCel48 complex with a detailed view of the Coh-Doc interface showing the main polar interactions (panel A) and main hydrophobic contacts (panel B). In both panels, the most important residues involved in Coh-Doc recognition are depicted in stick representation. Dashed black lines represent hydrogen-bond interactions. The Doc is shown color-ramped from N-terminus (blue) to C-terminus (red). The Coh is shown in gold. Ca2+ ions are depicted as green spheres. In both panels, the transparent gray disk marks the plane defined by the 8-3-6-5 β-sheet, where the β-strands form a distinctive Doc-interacting plateau.
Table 2.
Main polar contacts between BcCohScaA1 and BcDocCel48 M2 mutant
| Direct hydrogen bonds | |||||
|---|---|---|---|---|---|
| # | CohScaA11 |
Distance (Å) | Dockerin |
||
| Residue | Atom | Residue | Atom | ||
| 1 | Gln37 | Oε1 | 2.74 | Met17 | N |
| 2 | Lys81 | NZ | 3.06 | Phe60 | O |
| 3 | Ser139 | O | 2.96 | Asn16 | Nδ2 |
| 4 | Asp153 | O | 2.82 | Gln25 | Nε2 |
| Water-mediated hydrogen bonds | ||||||||
|---|---|---|---|---|---|---|---|---|
| # | CohScaA11 |
Distance (Å) | H2O |
Distance (Å) | Dockerin |
|||
| Residue | Atom | Atom | Residue | Atom | ||||
| 1 | Gln37 | Nε2 | 3.09 | H2O11 | O | 3.17 | Gly61 | N |
| 2 | Gln37 | Nε2 | 3.09 | H2O11 | O | 2.70 | Ile15 | O |
| 3 | Ser99 | N | 3.58 | H2O50 | O | 2.80 | Ala24 | O |
| 4 | Asn141 | Oδ1 | 2.73 | H2O100 | O | 2.71 | Asn16 | Nδ2 |
| 5 | Asn141 | Oδ1 | 2.73 | H2O100 | O | 3.11 | Asn12 | Oδ2 |
| 6 | Thr77 | Oγ1 | 2.78 | H2O239 | O | 2.73 | Ser50 | Oγ |
Table was made using data from the PDBSum server.
BcCohScaA111-DocCel48’s binding mechanism: Dual-binding mode in a type II Coh-Doc complex
The contribution of BcDocCel48’s binding surface residues for Coh recognition was initially probed through nondenaturing gel electrophoresis. Residues establishing direct hydrogen bonds as well as those with the most extensive hydrophobic contacts were mutated to alanine and the resulting variants were tested against BcCohScaA1. Because of the possible dual-binding mode, all mutations were performed on both helices, at equivalent positions. The data revealed that out of all BcDocCel48 derivatives, the M17A/M50A mutant and the F27A/F60A mutant had the most impact on binding. To gain further insights into the driving forces of Coh-Doc recognition, the binding thermodynamics of BcDocCel48 to BcCohScaA111 were assessed by isothermal titration calorimetry (ITC) at 308 K (Table 3, Fig. S1). The results, presented in Table 3, revealed that indeed Met17/Met50 and Phe27/Phe60 are the most important dockerin residues for cohesin interaction, as their mutation completely abolishes binding of the two modules. The structure of the complex also shows the dockerin’s Met21 establishing a considerable number of hydrophobic interactions (Table S1) with the cohesin interface. This residue’s placement near key contacting residue Met17, in a highly hydrophobic pocket formed by Coh’s residues Leu94, Leu98, and Phe150, would suggest that Met21 also plays a significant role in the interaction between the two modules. Significantly, this residue is not conserved on helix 3, where it is replaced by an isoleucine (Ile54). Whether this contact is indeed a key component for the interaction (which would contradict the dual-binding mode hypothesis) or not was assessed using the M21A/I54A mutant variant. ITC experiments against the cohesin gave similar affinity to that of the wild-type dockerin, suggesting that these residues are not essential for the interaction. Finally, the N16A/N49A and Q25A/Q58A dockerin mutations have little to no effect on binding, reinforcing the modest role that direct hydrogen bond contacts assume in this interaction, when compared with nonbonded contacts. When comparing BcCohScaA111-DocCel48’s X-ray structure with a modeled one where the Doc was rotated 180° relative to the cohesin, it was observed that most contacts are maintained and no significant clashes were found (Fig. 3B) suggesting that, similarly to type I dockerins in C. thermocellum and B. cellulosolvens itself (6, 10, 22), this type II dockerin can interact with the cohesin in two distinct orientations. To test this hypothesis, the critical methionine and phenylalanine were replaced in only one of the dockerin repeats (BcDocCel48 mutants M1, M2, F27A, and F60A). ITC assays performed using these variants against the cohesin revealed that binding was still possible (Table 3), underpinning the presence of the dual-binding mode, which compensates for the mutations.
Table 3.
Thermodynamics of interaction between wild-type BcCohScaA1 and various variants of BcDocCel48
| Dockerin | Ka M−1 | ΔG° kcal mol−1 | ΔH kcal mol−1 | −TΔS° kcal mol−1 | N |
|---|---|---|---|---|---|
| WT | 3.31E7 ± 1.15E6 | −10.63 | −15.17 ± 0.27 | 4.54 | 0.98 |
| M1 (M17S/A18D) | 1.11E7 ± 7.39E6 | −9.97 | −16.02 ± 0.68 | 6.05 | 0.98 |
| M2 (M50S/A51D) (Structure) | 1.41E6 ± 1.49E5 | −8.7 | −17.73 ± 0.28 | 9.03 | 0.99 |
| M1 + M2 | Nb | Nb | Nb | Nb | Nb |
| M17A/M50A | Nb | Nb | Nb | Nb | Nb |
| N16A/N49A | 1.36E5 ± 1.66E4 | −7.27 | −14.09 ± 1.01 | 6.82 | 0.99 |
| M21A/I54A | 1.20E6 ± 2.85E5 | −8.57 | −9.30 ± 0.24 | 0.73 | 1.00 |
| Q25A/Q58A | 1.35E6 ± 2.25E5 | −8.64 | −8.86 ± 0.39 | 0.22 | 0.99 |
| F27A/F60A | Nb | Nb | Nb | Nb | Nb |
| M1 + F27A | Nb | Nb | Nb | Nb | Nb |
| M1 + F60A | 5.84E4 ± 6.99E3 | −6.89 | −60.92 ± 33.91 | 54.03 | 1.01 |
| M2 + F27A | 4.45E5 ± 3.03E4 | −7.98 | −13.23 ± 0.31 | 5.25 | 1.01 |
| M2 + F60A | Nb | Nb | Nb | Nb | Nb |
Nb, No binding.
All thermodynamic parameters were determined at 308 K.
The structure reported here shows that, by mutating the methionine at position 50, it is possible to force the dockerin to interact through a single interface, resulting in the protein homogeneity required for crystallization. Also, ITC experiments performed with BcDocCel48 mutant derivatives that had only one of the repeated methionine residues changed (mutants M1 and M2) have resulted in affinity values similar to those obtained with the wild-type dockerin. Besides supporting the dual-binding mode, this suggests that methionine residues 17 and 50 participate in binding in an alternating fashion, with either one or the other establishing important contacts with the cohesin, depending on the dockerin orientation. On the other hand, the data suggest that both Phe27 and Phe60 interact with the cohesin. ITC experiments with the F27A and F60A mutants show a drop in affinity, even though binding is still observed, suggesting that they both contribute to the two binding modes. To evaluate whether the relative importance of each phenylalanine is also dependent on dockerin orientation, four dockerin mutants were produced (M17S/F27A, M17S/F60A, M50S/F27A, and M50S/F60A) and had their cohesin affinity tested by ITC. The reasoning behind the mutant design was to use the methionine mutations to force the dockerin into a specific orientation and then test each phenylalanine independently. The results have shown that, when the dockerin interacts with its N-terminal critical methionine (Met17), changing the N-terminal phenylalanine causes a drop in affinity similar to that of the single phenylalanine mutants, while mutating the C-terminal phenylalanine (Phe60) completely abrogates binding. The inverse is true for a Met50-dominated interaction. This means that the phenylalanine on the contralateral helix to that of the binding methionine assumes a prominent role, providing a vital contribution to binding or, in other words, cohesin binding is not possible without the presence of both Met17 and Phe60 or both Met50 and Phe27 (Fig. S2). Rather than having a clear dominant helix interaction, like in classical dual-binding mode type I complexes, or two equally contributing helices as in C. thermocellum’s single-binding mode type II interactions, DocCel48’s binding surface seems to have a dominating “side” for each binding orientation, with major contributions from the N-terminal end of helix-1 and the C-terminal end of helix-3, or vice versa.
The ubiquity of the dual-binding mode
The role of the dual-binding mode in the cohesin–dockerin interaction has long been a topic of discussion. The most consensual hypothesis is that it provides an extra degree of flexibility that allows the accommodation of numerous cellulosomal components in close proximity, avoiding steric hindrance by placing enzymes and structural cellulosomal elements in the correct orientation (6, 23). When it was first reported in the cellulosome of C. thermocellum, the dual-binding mode was thought to be exclusive to type I Coh-Doc complexes. In this species, type I interactions are responsible for assembling the enzymatic components into the main scaffoldin via a dual-binding mode, whereas type II interactions occur between the main scaffoldins and cell-anchoring scaffoldins through single-binding mode contacts. The same was observed in other cellulosomes, such as that of Clostridium (Hungateiclostridium) cellulolyticum, leading to the assumption that enzyme recruiting would require more flexibility than cell-wall anchoring (23). Later, it was revealed that in the cellulosome of B. cellulosolvens there is a role reversal in the Coh-Doc types, with type I complexes working in scaffoldin assembling rather than enzyme recruiting. Nonetheless, B. cellulosolvens’type I dockerins are also able to interact with their cognate partners in two distinct orientations (10). In this context, the meaning of “reversed” types in B. cellulosolvens would indicate a higher requirement in flexibility for scaffoldin assembly rather than for enzyme recruiting. The results presented here seem to suggest that the dual-binding mode exists also in type II B. cellulosolvens Coh-Doc complexes. This contradicts previous notions that the dual-binding mode is either type- or function-dependent, as there are both type I and II complexes with dual-binding mode as well as enzyme recruiting and cell-surface anchoring/scaffoldin assembling complexes displaying this property. In fact, excluding the somewhat obscure “Group-R” Coh and Docs whose function and binding mechanism remain unknown, it seems that B. cellulosolvens cellulosome is assembled exclusively through dual-binding mode interactions.
Although there is a comprehensive structural and mechanistic understanding of Coh-Doc protein–protein interactions, the biological relevance of the dual-binding mode remains elusive. One hypothesis is that the intrinsic symmetry in Docs may result from gene duplication without functional relevance, apart from extending the Coh-Doc contact surface. The fact that R. flavefaciens can assemble its cellulosome exclusively through single-binding mode interactions (11, 24, 25) and the identification of functional nonduplicated “half-dockerins” in this species (26) support this notion and are strong arguments against the relevance of the dual-binding mechanism. Nonetheless, R. flavefaciens, unlike other cellulosome secreting species, inhabits a relatively stable environment with little temperature and pH fluctuations. Furthermore, it assembles a relatively small, albeit diverse, cellulosome, meaning that steric hindrance might not impose significant selective pressure for the evolution of highly dynamic Coh-Doc interactions. Thus, the hypothesis that the dual-binding mode could persist throughout evolution without presenting selective advantage is a controversial one. Also, recent evidence suggests the presence of regulatory mechanisms that dictate dockerin-binding orientation (27, 28). Whether by the influence of external factors, such as pH, or through the action of enzymes that alter the isomerization state of key dockerin residues, cellulosomes appear to be able to control the switch between dockerin-binding platforms and even display preference for one Doc-binding platform over the other (27, 28). This active regulation of cellulosome plasticity points to a functional relevance for the dual-binding mode, although further work is still required to address the premise that the dual-binding mode ultimately serves as a decluttering mechanism. Notwithstanding, given the unprecedented complexity of B. cellulosolvens system, with over 200 dockerin-bearing components and 78 cohesin modules scattered across 31 scaffoldins supporting the assembly of up to 110 enzymes in a single unit, it is likely that the added flexibility in both enzyme recruiting and scaffoldin assembly would result in an evolutionary advantage.
Conclusions
The present study revealed the structure and organization of the type II Coh-Doc complex interaction responsible for enzyme incorporation into the main scaffoldin ScaA of B. cellulosolvens cellulosome. The structure of this complex displays typical folds for both the cohesin and dockerin modules, similar to that found in the vast majority of Coh-Doc structures, although the binding interface presents some significant particularities. Typically, in the classic type I complexes, one Doc helix is further away from the Coh, while the other dominates the interaction. Contrastingly, in type II complexes both helices make equal contribution to the binding. The present complex seems to stand somewhere in between the two, with both helices making important contributions to cohesin recognition while there is still a clear dominant “side.” Computational modeling as well as binding studies indicates that the two modules interact via a dual-binding mode, thus imparting the dual-binding mode as a general feature of all type I and II interactions in B. cellulosolvens. This means that the assembly of B. cellulosolvens cellulosome relies solely on cohesin–dockerin complexes with a dual-mode of binding. Considering the incredible complexity of this system and its capacity to assemble up to 110 enzymes in a single unit, it is not surprising that this cellulosome has evolved mechanisms toward acquiring a higher degree of flexibility and plasticity that would allow the accommodation of such a large number of enzymes and structural components in close proximity. Recent evidence suggests that dockerin-binding orientation can be regulated through changes in the biochemistry of the surrounding environment or by altering the isomerization of certain proline residues. This active regulation of cellulosome plasticity suggests that, indeed, the dual-binding mode plays a fundamental role in cellulosome function. Nonetheless, the premise that avoiding steric hindrance of tightly packed cellulosomal enzymes is the ultimate goal for the dual-binding mode is one that requires further exploration.
Experimental procedures
Gene synthesis and DNA cloning
Docs are inherently unstable when produced in E. coli. To promote Doc stability, B. cellulosolvens Doc of protein WP_050753099 (residues 683–752) was coexpressed in vivo with the 11th Coh of ScaA1, BcCohScaA111 (AAG01230; residues 2073–2242) (29, 30). The immediate binding of BcDocCel48 to BcCohScaA111 is believed to confer the necessary Doc stabilization. The genes encoding the two proteins were designed with a codon usage optimized to maximize expression in E. coli, synthesized in vitro (GenScript Ltd), and cloned into pET28a (Merck Millipore) under the control of separate T7 promoters. The BcDocCel48-encoding gene was positioned at the 5’ end and the BcCohScaA111-encoding gene at the 3’ end of the artificial DNA. A T7 terminator sequence (to terminate transcription of the Doc gene) and a T7 promoter sequence (to control transcription of the Coh gene) were incorporated between the sequences of the two genes. This construct contained specifically tailored NheI and NcoI recognition sites at the 5’ end and XhoI and SalI at the 3’ end to allow subcloning of the nucleic acid into pET-28a (Merck Millipore) such that the sequence encoding a six-residue His tag could be introduced either at the N-terminus of the Doc (through digestion with NheI and SalI, incorporating the additional sequence MGSSHHHHHHSSGLVPRGSHMAS at the N-terminus of the BcDocCel48) or at the C-terminus of the BcCohScaA111 (by cutting with NcoI and XhoI, which incorporates the additional sequence LEHHHHHH at the C-terminus of the Coh). To block the dual-binding mode and promote the structural homogeneity required for protein crystallization, two different genes were synthetized, each with a distinct Doc mutant: mutant M1 with the M17S and A18D amino acid changes and mutant M2 with the M50S and A51D replacements. These substitutions mimic the Coh recognition residues present in Type I Docs of B. cellulosolvens, which do not cross-bind with type II Cohs such as the ones in ScaA1 (4). Thus, as a result of this strategy, four pET28a plasmid derivatives were produced: M1 and M2 variants with the engineered tag either in the Doc or the Coh module. The four plasmids were used to express BcCohScaA111-DocCel48 M1 and M2 complexes in E. coli. Recombinant BcDocCel48 and BcCohScaA111 primary sequences are presented in Table S2.
To produce recombinant BcDocCel48 and BcCohScaA111 individually, the sequences encoding each of the two modules were amplified from B. cellulosolvens genomic DNA by PCR, using NZYProof polymerase (NZYTech Ltd) and the primers shown in Table S3. Following gel purification, the BcDocCel48 encoding amplicon was inserted into the pHTP8 plasmid by homologous recombination (NZYTech Ltd). The resulting expressed product consists of His-tagged BcDocCel48 fused to TrxA for increased solubility and stability. The BcCohScaA111 encoding gene was cloned into pET28a after digestion with NcoI and XhoI restriction enzymes.
To produce the mutants used in the binding experiments, several TrxA-BcDocCel48 protein derivatives were produced using site-directed mutagenesis (Table S3). Each of the newly generated gene sequence was fully sequenced to verify that only the desired mutation accumulated in the nucleic acid chain.
Protein expression and purification
As higher yields were obtained with the cohesin-tagged complexes, these were the ones used in further crystallography experiments. The complex was purified in three steps using an AKTA FPLC machine. The first step was Immobilized Metal-ion Affinity Chromatography (IMAC) purification in a HisTrap HP 5 ml column (GE Healthcare). The column was equilibrated with 50 mM NaHepes buffer, pH 7.5, containing 1 M NaCl, 10 mM imidazole, and 5 mM CaCl2. Proteins were eluted from the column with an imidazole gradient ranging from 10 mM to 300 mM over 25 column volumes. The fractions containing the protein–protein complexes were selected based on SDS-PAGE analysis. The IMAC-purified proteins were then buffer-exchanged in PD-10 Sephadex G25 M gel filtration columns (GE Healthcare) into 20 mM Tris-HCl buffer, pH 8.0, and 2 mM CaCl2. The excess unbound cohesin in the sample was removed with a subsequent purification step by anion exchange chromatography using a column loaded with Source 30Q media (GE Healthcare). The separation of the individual proteins from the complex was achieved through the application of a 0 to 1 M NaCl elution gradient. Prior to filtration chromatography, the protein fractions were buffer-exchanged into 20 mM NaHepes buffer, pH 7.5, containing 200 mM NaCl and 2 mM CaCl2. The purity of the protein was confirmed by SDS-PAGE analysis of the collected fractions. The purified protein was concentrated with Amicon centricons with 10-kDa molecular-mass cutoff centrifugal membranes (Millipore) by centrifuging at 5000 rpm at 4 °C. The final concentration of the protein was kept around 25 mg/ml. A final purification step was performed through size-exclusion chromatography using a HiLoad 16/60 Superdex 75 column (GE Healthcare) previously equilibrated with 20 mM NaHepes buffer, pH 7.5, containing 200 mM NaCl and 2 mM CaCl2. Pure complexes were buffer-exchanged by washing with 2 mM CaCl2 and concentrated to 50 mg/ml.
Complex crystallization
The Type II complex Coh-Doc of B. cellulosolvens was crystallized at 293K by the hanging drop vapor diffusion method and obtained by mixing an equal volume (1 μl) of protein (50 mg/ml in water) and reservoir solution (30% (m/v) polyethylene glycol (PEG) 4000, 0.1 M Tris-HCl, pH 8.5, and 0.2 M magnesium chloride). The crystals were grown over a period of 5 to 6 days at room temperature. Single crystals were harvested in a solution containing 35% (m/v) PEG 4000 and 0.2 M magnesium chloride, and flash-frozen in a liquid nitrogen stream at 100K, using 30% (vol/vol) of glycerol as a cryoprotectant added to the harvesting solution. Only the complex involving the M2 mutant (C-terminal) of BcDocCel48 produced good-quality crystals.
X-ray diffraction, 3D structure determination and refinement
The data were collected at 100 K, using 0.934 Å wavelength radiation at the European Synchrotron Radiation Facility (ESRF), ID14-EH1 to a maximum resolution of 1.90 Å. Diffraction data were processed and scaled, respectively, with programs MOSFLM (31) and SCALA (32) from the CCP4 suite (33). The Matthews coefficient of the ScaA type II Coh-Doc crystal is 1.91 Å3 Da−1 for two 27.5 kDa heterodimers in the asymmetric unit, with a solvent content of 35.7%. The space group was determined to be P1211 with unit cell dimensions: a = 43.4 Å, b = 116.1 Å, c = 45.2 Å, with β = 112.45° (Table 1).
Considering the calculated Matthews coefficient and because there was no dockerin structure available from the cellulosome of B. cellulosolvens, molecular replacement attempts were performed searching just for two copies of the cohesin module in the monoclinic P2 cell. The previously described and available crystal structure of the ScaA type II cohesin module from B. cellulosolvens, with accession code 1tyj (15), was used as search model for molecular replacement. The Patterson search was done with program PHASER (34), implemented in the CCP4 interface (33), and a clear solution with two cohesins in the asymmetric unit was found in space group P21.
Initial building of the structures into the electron density as well as building of the dockerin modules was performed using the software ARP/wARP (19) and any remaining residues were built interactively using program COOT (20). Model refinement and electron density map calculations were done with program REFMAC5 (21) from the CCP4 suite (33). The final round of refinement was performed using the TLS/restrained refinement procedure using each module as a single group. The root mean square deviation of bond lengths, bond angles, torsion angles, and other indicators were continuously monitored using validation tools in COOT and MOLPROBITY (35). The final model has Rwork = 16.3% and Rfree = 22.5% and includes 299 water molecules and four calcium ions. Due to disorder, residues Met1, Ala2, and the 6 C-terminal histidine residues of chain A (cohesin), stretches Gly32-Asn37 and Ala66-Phe71 of chain B (dockerin), Met1, Ala2, Leu174, Glu175, and the 6 C-terminal histidine residues of chain C (cohesin) and Ala30-Asn44 and Ser65-Phe71 of chain D (dockerin) could not be built. The structure is available in the Protein Data Bank under the accession code 2y3n.
Structural alignments
Structural alignments were performed with the Matchmaker tool of UCSF Chimera using the Needleman-Wunsch algorithm and BLOSUM62 matrix (36).
Isothermal titration calorimetry (ITC)
All ITC experiments were carried out at 308 K. The purified wild-type BcDocCel48 and mutant variants and BcCohScaA111 were diluted to the required concentrations and filtered using a 0.45-μm syringe filter (PALL). During titrations, the Doc constructs were stirred at 307 revolutions/min in the reaction cell and titrated with 28 successive 10 μl injections of BcCohScaA111 at 220-s intervals. Integrated heat effects, after correction for heats of dilution, were analyzed by nonlinear regression using a single-site model (Microcal ORIGIN version 7.0, Microcal Software). The fitted data yielded the association constant (KA) and the enthalpy of binding (ΔH). Other thermodynamic parameters were calculated using the standard thermodynamic equation: ΔRTlnKA = ΔG = ΔH − TΔS.
Nondenaturing gel electrophoresis
For the NGE experiments each of the BcDocCel48 and BcDocCel48 variants, at a concentration of 30 μM, was incubated in the presence and absence of 30 μM BcCohScaA111 for 30 min at room temperature and separated on a 10% native polyacrylamide gel. Electrophoresis was carried out at room temperature. The gels were stained with Coomassie Blue. Complex formation was detected by the presence of an additional band displaying a distinct electrophoretic mobility from the one presented by the individual modules.
Data availability
Coordinates and structure factors have been deposited in the Protein Data Bank under accession code PDB 2Y3N [https://www.rcsb.org/structure/2Y3N]. All further data supporting the findings of this study are available from the corresponding author, upon reasonable request.
Supporting information
This article contains supporting information.
Conflict of interest
The authors declare that they have no conflicts of interest with the contents of this article.
Acknowledgments
The authors wish to thank Prof. Maria João Romão for access to the Macromolecular Crystallography Facilities in FCT-NOVA and for valuable discussions. We acknowledge the European Synchrotron Radiation Facility (ESRF) for access to the synchrotron facility through BAG-Portugal.
Author contributions
P. B. and C. M. G. A. F. have planned the experiments. P. B., A. V., A. L. C., M. D., V. D. A., and E. J. C. have interpreted the experiments. A. V. has purified and crystalized the complex. A. V. and A. L. C. have collected the X-ray diffraction data and solved the Coh-Doc complex structure. M. D. and P. B. have produced the mutants and carried the binding experiments. P. B., V. D. A., A. V., and M. D. have written the article. P. B. and V. D. A. have produced the figures. C. M. G. A. F., A. L. C., L. M. A. F., J. A. M. P., and S. N. have revised the article.
Funding and additional information
This work was supported by the Portuguese Foundation for Science and Technology (FCT-MCTES) through the Centro de Investigação Interdisciplinar em Sanidade Animal (CIISA) grant UIDB/00276/2020, the Applied Molecular Biosciences Unit (UCIBIO) grant UIDB/04378/2020, and through the National Project grant RECI/BBB-BEP/0124/2012. M. D. is supported by a PhD studentship (SFRH/BD/146965/2019) from FCT-MCTES. A. V. was supported by a PhD studentship (SFRH/BD/35992/2007) FCT-MCTES.
Edited by Gerald Hart
Contributor Information
Ana Luísa Carvalho, Email: almc@fct.unl.pt.
Pedro Bule, Email: pedrobule@fmv.ulisboa.pt.
Supporting information
References
- 1.Giuliano C., Khan A.W. Cellulase and sugar formation by Bacteroides cellulosolvens, a newly isolated cellulolytic anaerobe. Appl. Environ. Microbiol. 1984;48:446–448. doi: 10.1128/aem.48.2.446-448.1984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Giuliano C., Khan A.W. Conversion of cellulose to sugars by resting cells of a mesophilic anaerobe, Bacteriodes cellulosolvens. Biotechnol. Bioeng. 1985;27:980–983. doi: 10.1002/bit.260270708. [DOI] [PubMed] [Google Scholar]
- 3.Ding S.-Y., Bayer E.A., Steiner D., Shoham Y., Lamed R. A scaffoldin of the Bacteroides cellulosolvens cellulosome that contains 11 type II cohesins. J. Bacteriol. 2000;182:4915–4925. doi: 10.1128/jb.182.17.4915-4925.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Zhivin O., Dassa B., Moraïs S., Utturkar S.M., Brown S.D., Henrissat B., Lamed R., Bayer E.A. Unique organization and unprecedented diversity of the Bacteroides (Pseudobacteroides) cellulosolvens cellulosome system. Biotechnol. Biofuels. 2017;10:211. doi: 10.1186/s13068-017-0898-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Fontes C.M.G.A., Gilbert H.J. Cellulosomes: Highly efficient nanomachines designed to deconstruct plant cell wall complex carbohydrates. Annu. Rev. Biochem. 2010;79:655–681. doi: 10.1146/annurev-biochem-091208-085603. [DOI] [PubMed] [Google Scholar]
- 6.Carvalho A.L., Dias F.M.V., Nagy T., Prates J.A.M., Proctor M.R., Smith N., Bayer E.A., Davies G.J., Ferreira L.M.A., Romão M.J., Fontes C.M.G.A., Gilbert H.J. Evidence for a dual binding mode of dockerin modules to cohesins. Proc. Natl. Acad. Sci. U. S. A. 2007;104:3089–3094. doi: 10.1073/pnas.0611173104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Adams J.J., Webb B.A., Spencer H.L., Smith S.P. Structural characterization of type II dockerin module from the cellulosome of Clostridium thermocellum: Calcium-induced effects on conformation and target recognition †. Biochemistry. 2005;44:2173–2182. doi: 10.1021/bi048039u. [DOI] [PubMed] [Google Scholar]
- 8.Brás J.L.A., Pinheiro B.A., Cameron K., Cuskin F., Viegas A., Najmudin S., Bule P., Pires V.M.R., Romão M.J., Bayer E.A., Spencer H.L., Smith S., Gilbert H.J., Alves V.D., Carvalho A.L. Diverse specificity of cellulosome attachment to the bacterial cell surface. Sci. Rep. 2016;6:38292. doi: 10.1038/srep38292. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Bayer E.A., Belaich J.P., Shoham Y., Lamed R. The cellulosomes: Multienzyme machines for degradation of plant cell wall polysaccharides. Annu. Rev. Microbiol. 2004;58:521–554. doi: 10.1146/annurev.micro.57.030502.091022. [DOI] [PubMed] [Google Scholar]
- 10.Cameron K., Najmudin S., Alves V.D., Bayer E.A., Smith S.P., Bule P., Waller H., Ferreira L.M.A., Gilbert H.J., Fontes C.M.G.A. Cell-surface attachment of bacterial multienzyme complexes involves highly dynamic protein-protein anchors. J. Biol. Chem. 2015;290:13578–13590. doi: 10.1074/jbc.M114.633339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Bule P., Alves V.D., Leitão A., Ferreira L.M.A., Bayer E.A., Smith S.P., Gilbert H.J., Najmudin S., Fontes C.M.G.A. Single-binding mode integration of hemicellulose degrading enzymes via adaptor scaffoldins in Ruminococcus flavefaciens cellulosome. J. Biol. Chem. 2016;291:26658–26669. doi: 10.1074/jbc.M116.761643. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Dassa B., Utturkar S., Hurt R.A., Klingeman D.M., Keller M., Xu J., Reddy Y.H.K., Borovok I., Rozman Grinberg I., Lamed R., Zhivin O., Bayer E.A., Brown S.D. Near-complete genome sequence of the cellulolytic bacterium Bacteroides (Pseudobacteroides) cellulosolvens ATCC 35603. Genome Announc. 2015;3 doi: 10.1128/genomeA.01022-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Zhao G., Li H., Wamalwa B., Sakka M., Kimura T., Sakka K. Different binding specificities of S-layer homology modules from Clostridium thermocellum AncA, Slp1, and Slp2. Biosci. Biotechnol. Biochem. 2006;70:1636–1641. doi: 10.1271/bbb.50699. [DOI] [PubMed] [Google Scholar]
- 14.Bule P., Cameron K., Prates J.A.M., Ferreira L.M.A., Smith S.P., Gilbert H.J., Bayer E.A., Najmudin S., Fontes C.M.G.A., Alves V.D. Structure–function analyses generate novel specificities to assemble the components of multi-enzyme bacterial cellulosome complexes. J. Biol. Chem. 2018;293:4201–4212. doi: 10.1074/jbc.RA117.001241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Noach I., Frolow F., Jakoby H., Rosenheck S., Shimon L.W., Lamed R., Bayer E.A. Crystal structure of a type-II cohesin module from the Bacteroides cellulosolvens cellulosome reveals novel and distinctive secondary structural elements. J. Mol. Biol. 2005;348:1–12. doi: 10.1016/j.jmb.2005.02.024. [DOI] [PubMed] [Google Scholar]
- 16.Yaniv O., Shimon L.J.W., Bayer E.A., Lamed R., Frolow F. Scaffoldin-borne family 3b carbohydrate-binding module from the cellulosome of Bacteroides cellulosolvens: Structural diversity and significance of calcium for carbohydrate binding. Acta Crystallogr. D Biol. Crystallogr. 2011;67:506–515. doi: 10.1107/S0907444911011322. [DOI] [PubMed] [Google Scholar]
- 17.Adams J.J., Pal G., Jia Z., Smith S.P. Mechanism of bacterial cell-surface attachment revealed by the structure of cellulosomal type II cohesin-dockerin complex. Proc. Natl. Acad. Sci. U. S. A. 2006;103:305–310. doi: 10.1073/pnas.0507109103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Haimovitz R., Barak Y., Morag E., Voronov-Goldman M., Shoham Y., Lamed R., Bayer E.A. Cohesin-dockerin microarray: Diverse specificities between two complementary families of interacting protein modules. Proteomics. 2008;8:968–979. doi: 10.1002/pmic.200700486. [DOI] [PubMed] [Google Scholar]
- 19.Langer G., Cohen S.X., Lamzin V.S., Perrakis A. Automated macromolecular model building for X-ray crystallography using ARP/wARP version 7. Nat. Protoc. 2008;3:1171–1179. doi: 10.1038/nprot.2008.91. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Emsley P., Lohkamp B., Scott W.G., Cowtan K. Features and development of Coot. Acta Crystallogr. D Biol. Crystallogr. 2010;66:486–501. doi: 10.1107/S0907444910007493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Murshudov G.N., Skubák P., Lebedev A.A., Pannu N.S., Steiner R.A., Nicholls R.A., Winn M.D., Long F., Vagin A.A. REFMAC 5 for the refinement of macromolecular crystal structures. Acta Crystallogr. D Biol. Crystallogr. 2011;67:355–367. doi: 10.1107/S0907444911001314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Carvalho A.L., Dias F.M.V., Prates J.A.M., Nagy T., Gilbert H.J., Davies G.J., Ferreira L.M.A., Romão M.J., Fontes C.M.G.A. Cellulosome assembly revealed by the crystal structure of the cohesin-dockerin complex. Proc. Natl. Acad. Sci. U. S. A. 2003;100:13809–13814. doi: 10.1073/pnas.1936124100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Pinheiro B.A., Proctor M.R., Martinez-Fleites C., Prates J.A.M., Money V.A., Davies G.J., Bayer E.A., Fontesm C.M.G.A., Fierobe H.-P., Gilbert H.J. The Clostridium cellulolyticum dockerin displays a dual binding mode for its cohesin partner. J. Biol. Chem. 2008;283:18422–18430. doi: 10.1074/jbc.M801533200. [DOI] [PubMed] [Google Scholar]
- 24.Bule P., Alves V.D., Israeli-Ruimy V., Carvalho A.L., Ferreira L.M.A., Smith S.P., Gilbert H.J., Najmudin S., Bayer E.A., Fontes C.M.G.A. Assembly of Ruminococcus flavefaciens cellulosome revealed by structures of two cohesin-dockerin complexes. Sci. Rep. 2017;7:759. doi: 10.1038/s41598-017-00919-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Bule P., Pires V.M.R., Alves V.D., Carvalho A.L., Prates J.A.M., Ferreira L.M.A., Smith S.P., Gilbert H.J., Noach I., Bayer E.A., Najmudin S., Fontes C.M.G.A. Higher order scaffoldin assembly in Ruminococcus flavefaciens cellulosome is coordinated by a discrete cohesin-dockerin interaction. Sci. Rep. 2018;8:6987. doi: 10.1038/s41598-018-25171-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Israeli-Ruimy V., Bule P., Jindou S., Dassa B., Moraïs S., Borovok I., Barak Y., Slutzki M., Hamberg Y., Cardoso V., Alves V.D., Najmudin S., White B.A., Flint H.J., Gilbert H.J. Complexity of the Ruminococcus flavefaciens FD-1 cellulosome reflects an expansion of family-related protein-protein interactions. Sci. Rep. 2017;7:42355. doi: 10.1038/srep42355. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Yao X., Chen C., Wang Y., Dong S., Liu Y.-J., Li Y., Cui Z., Gong W., Perrett S., Yao L., Lamed R., Bayer E.A., Cui Q., Feng Y. Discovery and mechanism of a pH-dependent dual-binding-site switch in the interaction of a pair of protein modules. Sci. Adv. 2020;6 doi: 10.1126/sciadv.abd7182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Vera A.M., Galera-Prat A., Wojciechowski M., Różycki B., Laurents D.V., Carrión-Vázquez M., Cieplak M., Tinnefeld P. Cohesin-dockerin code in cellulosomal dual binding modes and its allosteric regulation by proline isomerization. Structure. 2021 doi: 10.1016/j.str.2021.01.006. [DOI] [PubMed] [Google Scholar]
- 29.Bule P., Ruimy-Israeli V., Cardoso V., Bayer E.A., Fontes C.M.G.A., Najmudin S. Overexpression, crystallization and preliminary X-ray characterization of Ruminococcus flavefaciens scaffoldin C cohesin in complex with a dockerin from an uncharacterized CBM-containing protein. Acta Crystallogr. F Struct. Biol. Commun. 2014;70:1061–1064. doi: 10.1107/S2053230X14012667. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Bule P., Correia A., Cameron K., Alves V.D., Cardoso V., Fontes C.M.G.A., Najmudin S. Overexpression, purification, crystallization and preliminary X-ray characterization of the fourth scaffoldin A cohesin from Acetivibrio cellulolyticus in complex with a dockerin from a family 5 glycoside hydrolase. Acta Crystallogr. F Struct. Biol. Commun. 2014;70:1065–1067. doi: 10.1107/S2053230X14013181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Battye T.G.G., Kontogiannis L., Johnson O., Powell H.R., Leslie A.G.W. iMOSFLM: A new graphical interface for diffraction-image processing with MOSFLM. Acta Crystallogr. D Biol. Crystallogr. 2011;67:271–281. doi: 10.1107/S0907444910048675. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Evans P. Scaling and assessment of data quality. Acta Crystallogr. D Biol. Crystallogr. 2006;62:72–82. doi: 10.1107/S0907444905036693. [DOI] [PubMed] [Google Scholar]
- 33.Cowtan K., Emsley P., Wilson K.S. From crystal to structure with CCP4. Acta Crystallogr. D Biol. Crystallogr. 2011;67:233–234. doi: 10.1107/S0907444911007578. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.McCoy A.J., Grosse-Kunstleve R.W., Adams P.D., Winn M.D., Storoni L.C., Read R.J. Phaser crystallographic software. J. Appl. Crystallogr. 2007;40:658–674. doi: 10.1107/S0021889807021206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Chen V.B., Arendall W.B., Headd J.J., Keedy D.A., Immormino R.M., Kapral G.J., Murray L.W., Richardson J.S., Richardson D.C. MolProbity: All-atom structure validation for macromolecular crystallography. Acta Crystallogr. D Biol. Crystallogr. 2010;66:12–21. doi: 10.1107/S0907444909042073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Pettersen E.F., Goddard T.D., Huang C.C., Couch G.S., Greenblatt D.M., Meng E.C., Ferrin T.E. UCSF Chimera--a visualization system for exploratory research and analysis. J. Comput. Chem. 2004;25:1605–1612. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
- 37.Bond C.S., Schüttelkopf A.W. Aline: A WYSIWYG protein-sequence alignment editor for publication-quality alignments. Acta Crystallogr. D Biol. Crystallogr. 2009;65:510–512. doi: 10.1107/S0907444909007835. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Coordinates and structure factors have been deposited in the Protein Data Bank under accession code PDB 2Y3N [https://www.rcsb.org/structure/2Y3N]. All further data supporting the findings of this study are available from the corresponding author, upon reasonable request.





