Abstract
Background
Cellobiohydrolase from glycoside hydrolase family 7 is a major component of commercial enzymatic mixtures for lignocellulosic biomass degradation. For many years, Trichoderma reesei Cel7A (TrCel7A) has served as a model to understand structure–function relationships of processive cellobiohydrolases. The architecture of TrCel7A includes an N-glycosylated catalytic domain, which is connected to a carbohydrate-binding module through a flexible, O-glycosylated linker. Depending on the fungal expression host, glycosylation can vary not only in glycoforms, but also in site occupancy, leading to a complex pattern of glycans, which can affect the enzyme’s stability and kinetics.
Results
Two expression hosts, Aspergillus oryzae and Trichoderma reesei, were utilized to successfully express wild-types TrCel7A (WTAo and WTTr) and the triple N-glycosylation site deficient mutants TrCel7A N45Q, N270Q, N384Q (ΔN-glycAo and ΔN-glycTr). Also, we expressed single N-glycosylation site deficient mutants TrCel7A (N45QAo, N270QAo, N384QAo). The TrCel7A enzymes were studied by steady-state kinetics under both substrate- and enzyme-saturating conditions using different cellulosic substrates. The Michaelis constant (KM) was consistently found to be lowered for the variants with reduced N-glycosylation content, and for the triple deficient mutants, it was less than half of the WTs’ value on some substrates. The ability of the enzyme to combine productively with sites on the cellulose surface followed a similar pattern on all tested substrates. Thus, site density (number of sites per gram cellulose) was 30–60% higher for the single deficient variants compared to the WT, and about twofold larger for the triple deficient enzyme. Molecular dynamic simulation of the N-glycan mutants TrCel7A revealed higher number of contacts between CD and cellulose crystal upon removal of glycans at position N45 and N384.
Conclusions
The kinetic changes of TrCel7A imposed by removal of N-linked glycans reflected modifications of substrate accessibility. The presence of N-glycans with extended structures increased KM and decreased attack site density of TrCel7A likely due to steric hindrance effect and distance between the enzyme and the cellulose surface, preventing the enzyme from achieving optimal conformation. This knowledge could be applied to modify enzyme glycosylation to engineer enzyme with higher activity on the insoluble substrates.
Keywords: GH7 cellulase, Trichoderma reesei Cel7A, Aspergillus oryzae, N-Glycosylation, Heterogeneous interfacial enzyme kinetics, MD
Background
Cellulose-degrading enzymes were first discovered in the secretome of the filamentous ascomycete Trichoderma reesei almost 65 years ago [1]. Since then, they have proven essential in commercial enzyme cocktails used in biorefineries that produce fuels and chemicals from lignocellulosic biomass. Due to the complexity, recalcitrance and insolubility of the plant biomass [2], high enzyme titers must be used to ensure efficient biomass hydrolysis, and this challenges the economic feasibility of the process. To overcome this, extensive research has sought to either engineer catalytically more efficient enzymes or to develop more efficient expression hosts such as Trichoderma reesei [3], Saccharomyces cerevisiae [4] and Aspergillus niger [5]. The latter effort has enabled industrial production of cellulases, but usually with a range of isoforms with different apparent molecular weights [6]. This is attributed to the ability of the fungal expression host to decorate proteins with short oligosaccharides. Such post-translational modification called glycosylation can occur at either threonine (T) or serine (S) residues (O-glycosylation), or at asparagine (N) residues (N-glycosylation), which have a general consensus motif N-X-S/T (where X denotes any amino acid residue except proline) [6].
The dominant cellulase in the secretome of T. reesei is the cellobiohydrolase TrCel7A (EC 3.2.1.176) [7], classified to glycoside hydrolase (GH) family 7 in the CAZy database [8]. The architecture of TrCel7A includes a catalytic domain (CD), with three N-glycosylation sites, which is connected to a carbohydrate-binding module (CBM) through a flexible, heavily O-glycosylated linker peptide [9]. Most of the N-glycans on TrCel7A have been characterized as high mannose (Man) type, containing from Man5–9 residues linked to a chitobiose core of two N-acetylglucosamine (GlcNAc)2 units, whereas the O-glycans consist mainly of Man1–4 randomly distributed in both the linker region and CBM domain with the majority bound to the linker of TrCel7A [7, 10]. Recently, Amore et al. [3] performed extensive mass spectrometry (MS) characterization of the different N-glycoforms of TrCel7A expressed in T. reesei, and this work indicated a broader complexity of N-glycans, including the presence of fucose, galactose or additional N-acetylglucosamine residues. The glycan complexity is not only influenced by the expression hosts and their extracellular activities of glycosidases and transferases, but also the composition of the growth media [11]. The functional roles of the N-glycosylation remain elusive, although it has been shown that disruption of N-glycosylation motifs lowers TrCel7A’s thermal [3, 12] and proteolytic stability [3]. Studies on the influence of N-glycosylation on the activity and cellulose adsorption of the CD TrCel7A have shown either promoting [5, 12] or no effects [3]. Most of these studies [3, 5, 12] evaluated the effect of N-glycosylation based on prolonged experiments, usually including the addition of synergistic cellulases. While these conditions mirror technical applications, they might complicate the molecular understanding of TrCel7A N-glycosylation. In the current work, we focused on 1 hour kinetic measurements, which have proven to provide a reasonable proxy for initial rates in comparative kinetic studies of cellobiohydrolases [13]. We performed an extensive biochemical characterization of a number of TrCel7A variants with different glycosylation patterns. Specifically, we produced both the wild-type and variants with mutated N-glycosylation motifs in two different expression hosts (the native T. reesei and the heterologous A. oryzae). The wide differences in the N-glycosylation pattern allowed us to identify important functional properties of the TrCel7A glycans.
Results
Thermal stability and intact protein mass spectrometry (MS) of TrCel7A and variants
In this work, we prepared and characterized a number of TrCel7A variants (Table 1). The wild-type (WT) and listed variants of TrCel7A were successfully expressed and purified as the full-length enzymes in Aspergillus oryzae and Trichoderma reesei. All the purified enzymes showed only one band in SDS-PAGE (Fig. 1). For the TrCel7A variants, the successful disruption of the N-glycosylation motifs through N to Q site-directed mutagenesis was verified by a decreased apparent molecular weight as shown by both SDS-PAGE (Fig. 1) and changes in the mass profiles of intact protein MS (Additional file 1: Fig. S1). At the same time, all the enzymes’ apparent MW was higher than the theoretical value of 52222 Da (see “Materials and methods” section), indicating a glycosylated protein with all functional domains. Among the TrCel7A variants expressed in A. oryzae, the most pronounced shift in molecular weight compared to the WTAo sequence was observed for the N45QAo, which migrated nearly as the ΔN-glycAo (N45Q, N270Q, N384Q triple mutant).
Table 1.
TrCel7A | Mutation | Expression host | Tm (°C) | Residual activity (%) |
---|---|---|---|---|
WTAo | Aspergillus oryzae | 67.9 | 57 ± 1 | |
N45QAo | N45Q | Aspergillus oryzae | 67.4 | 38 ± 2 |
N270QAo | N270Q | Aspergillus oryzae | 66.9 | 60 ± 3 |
N384QAo | N384Q | Aspergillus oryzae | 66.1 | 57 ± 3 |
ΔN-glycAo | N45Q, N270Q, N384Q | Aspergillus oryzae | 65.0 | 41 ± 2 |
WTTr | Trichoderma reesei | 67.9 | 45 ± 2 | |
ΔN-glycTr | N45Q, N270Q, N384Q | Trichoderma reesei | 65.2 | 44 ± 1 |
The mutated positions of N-glycosylation sites are presented in Fig. 2. The thermal stability, Tm, was determined from DSC. Residual activity is expressed as the activity after pre-incubation in the standard buffer for 1 h at T = Tm− 5 °C divided by the activity of the sample incubated at 25 °C. The activity was measured on 3 mM pNP-Lac in 50 mM sodium acetate buffer pH 5.0 for 30 min at 25 °C
Thermal stability as measured by differential scanning calorimetry (DSC) was only moderately different among the glycovariants, and this supports the view that their overall fold was unaffected by the mutations (Additional file 1: Fig. S2). The transition temperatures of both the WTs and the ∆N-glyc variants were independent of the expression organism, although a small decrease in melting temperature (Tm) was observed for the variants. The ∆N-glyc mutants exhibited the largest reduction of 2–3 °C in Tm as compared to the WT (Table 1). In case of the single mutation variants, the highest decrease in Tm was observed for the N384QAo, compared to WTAo, which is in accordance with previous studies by Amore et al. [3] and Adney et al. [12]. Kinetic stability (residual activity at 25 °C following 1 hour exposure to a temperature equal to Tm− 5 °C) was decreased from 57% in the wild-type to 41% in ΔN-glycAo and N45QAo. For enzymes expressed in T. reesei, no changes were detected in the residual activity of these two forms. Finally, removal of the glycan at position N45 lowered the activity as much as the removal of all three N-glycans (Table 1). Moreover, neither the glycan at N270 nor N384 appeared to promote kinetic stability (unchanged residual activity for both N270QAo and N384QAo).
Intact protein MS revealed a distinct difference in how the two expression hosts, T. reesei and A. oryzae, glycosylate the same protein (Additional file 1: Fig. S1). The WTAo showed a high mass heterogeneity, from which it was not possible to reconstruct the mass profile (Additional file 1: Fig. S1 D). The glycosylation pattern of WTTr is less heterogeneous, showing a clear mass distribution (Additional file 1: Fig. S1 A) and a lower average mass of total glycans (Table 2) as also observed by SDS-PAGE analysis (Fig. 1). Once all the N-glycans were removed (ΔN-glyc variants), it was possible to reconstruct a mass profile in all cases (Additional file 1: Fig. S1 C, E), and the remaining mass difference was mostly due to O-glycosylation of the linker and CBM [3]. In the case of the single N-glycan variants, removal of glycans at position N45 was sufficient to reconstruct the mass profile of the TrCel7AAo (Additional file 1: Fig. S1 F) while the other single variants retained high glycan heterogeneity (Additional file 1: Fig. S1 G, H). The estimation of mass differences for each enzyme is presented in Table 2.
Table 2.
TrCel7A | Molecular weight (MW) (kDa)a | |||
---|---|---|---|---|
Theoretical MW | MW contribution of N-glycosylation | MW contribution of O-glycosylation | Calculated MW | |
WTTr | 52 | 5–7 | 3–4 | 60–63 |
WTAo | 52 | 7–10 | 3–6 | 62–68 |
aValues were estimated based on the mass spectra presented in Additional file 1: Figure S1 A–H
Steady-state kinetics on insoluble cellulose substrates
To closer investigate the effect of glycosylation, extensive kinetic characterization of TrCel7A WTs and the variants was carried out using three insoluble cellulosic substrates, Avicel, RAC and BMCC, with widely varying physical properties (see “Materials and Methods”). To evaluate how the substrates’ properties and the changes in the N-glycosylation pattern influenced the kinetics, two quasi-steady-state approaches were implemented [16, 17]. The first, was the conventional Michaelis–Menten (convMM) approach, using an excess of the substrate. As in usual (bulk) steady-state kinetics, this entails rate measurements with a constant and low enzyme concentration and gradually increasing substrate loads (Fig. 3a, Additional file 1: Fig. S3 A, D). Specific rates (νss/E0) were plotted against substrate load and non-linear regression analyses were performed accordingly to Eq. (1).
1 |
The parameters’ maximal specific rate (convVmax/E0) and Michaelis constant (convKM) are found in Table 3. All the TrCel7A variants were active on the tested substrates. In contrast to previous studies [12, 18], this supports the conclusion from the DSC measurements that the investigated N-glycan variants retained the overall fold of Cel7A.
Table 3.
TrCel7A | convMM | invMM | Kinetic substrate accessibility | Adsorption isotherms | ||||
---|---|---|---|---|---|---|---|---|
convVmax/E0 | convKM | η | invVmax/S0 | invKM | Γattack | Γmax | Kd | |
(s−1) | (g/ L) | (L/g/s) | (μmol/g/s) | (μM) | (μmol/g) | (μmol/g) | (μM) | |
Avicel | ||||||||
WTAo | 0.40 ± 0.01a | 18.1 ± 1.8a | 0.022 | 0.034 ± 0.001 | 2.45 ± 0.18 | 0.08 ± 0.00 | 0.1 ± 0.01 | 0.26 ± 0.06 |
N45QAo | 0.34 ± 0.02 | 11.6 ± 2.5 | 0.030 | 0.048 ± 0a | 2.92 ± 0.04a | 0.14 ± 0.01 | 0.23 ± 0.02a | 0.69 ± 0.21 |
N270QAo | 0.35 ± 0.02 | 11.6 ± 2.4 | 0.030 | 0.031 ± 0.001 | 2.22 ± 0.2 | 0.09 ± 0.01 | 0.13 ± 0.01 | 1 ± 0.28 |
N384QAo | 0.35 ± 0.02 | 10.6 ± 1.5 | 0.033 | 0.037 ± 0.001a | 2.54 ± 0.25a | 0.11 ± 0.01 | 0.10 ± 0.01 | 0.3 ± 0.18 |
ΔN-glycAo | 0.24 ± 0.01a | 6.2 ± 0.8 | 0.040 | 0.048 ± 0.002a | 3.54 ± 0.3a | 0.20 ± 0.01 | 0.27 ± 0.02a | 1.01 ± 0.22a |
WTTr | 0.32 ± 0.01 | 9 ± 1.2 | 0.035 | 0.02 ± 0.001a | 1.16 ± 0.09a | 0.06 ± 0.00 | 0.08 ± 0.01a | 0.27 ± 0.08 |
ΔN-glycTr | 0.32 ± 0.01 | 7.4 ± 0.9 | 0.043 | 0.032 ± 0.001 | 1.71 ± 0.12 | 0.10 ± 0.00 | 0.11 ± 0.01 | 0.4 ± 0.11 |
RAC | ||||||||
WTAo | 0.48 ± 0.02a | 1.3 ± 0.2a | 0.37 | 0.65 ± 0.01 | 0.95 ± 0.06 | 1.4 ± 0.1 | 0.92 ± 0.04a | 0.03 ± 0.01 |
N45QAo | 0.37 ± 0.02 | 0.9 ± 0.1 | 0.41 | 0.74 ± 0.02a | 1.23 ± 0.1a | 2.0 ± 0.1 | 1.79 ± 0.0 a | 0.05 ± 0a |
N270QAo | 0.38 ± 0.02 | 1.1 ± 0.2 | 0.33 | 0.61 ± 0.01 | 1.15 ± 0.08 | 1.6 ± 0.1 | 1.14 ± 0.05a | 0.03 ± 0.01 |
N384QAo | 0.34 ± 0.01a | 0.8 ± 0.1 | 0.43 | 0.62 ± 0.02 | 1.03 ± 0.08 | 1.8 ± 0.1 | 1.2 ± 0.08a | 0.02 ± 0.01 |
ΔN-glycAo | 0.30 ± 0.01a | 0.5 ± 0.0a | 0.40 | 0.68 ± 0.01a | 0.82 ± 0.05 | 2.9 ± 0.1 | 1.58 ± 0.04 | 0.11 ± 0.01a |
WTTr | 0.47 ± 0.01a | 1 ± 0.1 | 0.45 | 0.46 ± 0.01a | 0.94 ± 0.09 | 1.0 ± 0.0 | 1.83 ± 0.18a | 0.01 ± 0.01a |
ΔN-glycTr | 0.47 ± 0.01a | 0.8 ± 0.1 | 0.58 | 0.59 ± 0.02 | 0.93 ± 0.1 | 1.3 ± 0.0 | 1.59 ± 0.16 | 0.03 ± 0.01 |
BMCC | ||||||||
WTAo | 1.46 ± 0.14a | 2.1 ± 0.4a | 0.71 | 0.45 ± 0.02a | 1.59 ± 0.15a | 0.31 ± 0.03 | 1.17 ± 0.12 | 1.28 ± 0.29 |
N45QAo | 1.33 ± 0.09a | 1.7 ± 0.2a | 0.77 | 0.52 ± 0.02a | 1.23 ± 0.11 | 0.39 ± 0.03 | 1.96 ± 0.35a | 1.2 ± 0.44 |
N270QAo | 1.13 ± 0.06a | 1.1 ± 0.1 | 1.01 | 0.44 ± 0.02 | 1.49 ± 0.14a | 0.39 ± 0.03 | 0.93 ± 0.15a | 0.99 ± 0.38 |
N384QAo | 1.01 ± 0.03 | 1 ± 0.1 | 1.02 | 0.38 ± 0.02 | 1.29 ± 0.15 | 0.38 ± 0.02 | 1.06 ± 0.04 | 0.85 ± 0.08 |
ΔN-glycAo | 0.96 ± 0.09 | 1.6 ± 0.3a | 0.59 | 0.48 ± 0.01a | 1.38 ± 0.1a | 0.50 ± 0.05 | 1.2 ± 0.06 | 0.62 ± 0.09 |
WTTr | 0.63 ± 0.03a | 0.7 ± 0.1 | 0.89 | 0.29 ± 0.01a | 0.66 ± 0.06a | 0.46 ± 0.03 | 1.34 ± 0.11 | 0.48 ± 0.12 |
ΔN-glycTr | 0.76 ± 0.03 | 0.6 ± 0.1a | 1.33 | 0.4 ± 0.01 | 0.82 ± 0.06a | 0.53 ± 0.03 | 1.71 ± 0.04a | 0.56 ± 0.04 |
The parameters were derived from convMM, invMM and binding isotherm using Avicel, RAC and BMCC. The ± values correspond to the error of non-linear fit of Michaelis–Menten curves and binding isotherm curves. The parameters statistically different from the others at the 0.05 level of significance are indicated with letter ‘a’ (Additional file 1: Table S2)
The convMM analysis (Table 3) demonstrated a correlation between KM and the degree of glycosylation. Thus, a reduction in the degree of N-glycosylation was consistently associated with lowered KM. This was particularly evident on Avicel and BMCC, and it appeared both when comparing WTs and ΔN-glyc variants and with the WT expressed in different hosts (i.e. WTTr exhibited a lower KM as compared to the more extensively glycosylated WTAo, c.f. Table 2).
In the well-known convMM approach, saturation represents the situation where all enzymes are complexed with the substrate, and the saturation rate hence reflects the maximal turnover (kcat = convVmax/E0). In the current work, steady-state kinetics in the opposite limit was also investigated, where the enzyme is in excess (inverse Michaelis–Menten, invMM). In this approach, measurements were made with 1 h reaction rates at a constant, low substrate load and gradually increasing enzyme concentration. This method has been reported [19] to be specifically useful for heterogeneous systems because a steady-state condition may occur also when the enzyme is in excess. Derivation of the associated rate equation (Eq. 2) [20, 21] as well as assumptions and limitations of this approach have been discussed elsewhere [17, 19].
2 |
In the invMM approach, saturation represents the situation where all attack sites on the surface of the insoluble substrate are complexed with an enzyme, and the maximal rate hence represents the product of the maximal turnover and the density of attack sites, invVmax/S0 = kcatΓattack, where S0 is the mass load of a substrate and Γattack the number of sites per mass unit of substrate [17].
It follows that one may calculate the density of attack sites Γattack (also named productive binding capacity [22]) as the ratio of two specific maximal rates as shown in Eq. (3).
3 |
The data points obtained from the invMM measurements (Fig. 3b and Additional file 1: Fig. S3 B, E) were subjected to the best fit from non-linear regression (Eq. 2). Based on this analysis, the parameters, as well as Γ attack values (Eq. 3), were calculated (Table 3). These results also show some interesting systematic trends. In particular, Γattack is increased in variants that do not have the glycan in position N45. This is true for both the N45QAo and the ∆N-glycAo and ∆N-glycTr. Removal of the other two N-glycans (N270QAo and N384QAo) also improved the enzyme’s ability to “find” attack sites on the cellulose surface, but the effect was less pronounced.
Binding isotherms
Surface coverage (Γ, in µmol/g) was derived from measurements of free enzyme concentration in 1-h equilibrated enzyme–substrate suspensions. Cruys-Bagger et al. [23] presented that equilibrium state on bacterial cellulose was achieved after 10 min and it is assumed the same for other cellulosic substrates used here. The experimental data were fitted to a simple Langmuir isotherm (Eq. 4) (Fig. 3c and Additional file 1: Fig. S3 C, F). While the adsorption mechanism of Cel7A is complex [24, 25], simple Langmuir parameters have often been used for comparative purposes.
4 |
The maximal adsorption capacities (Γmax) on Avicel, RAC and BMCC were higher for the N45QAo and ΔN-glycAo relative to WTAo (Table 3). As the Γmax varied only moderately for the other two variants (N270QAo and N384QAo), elimination of the N-glycan at position N45 seems to promote surface coverage on all cellulosic substrates. For the TrCel7A expressed in T. reesei, the difference in Γmax between WTTr and ΔN-glycTr was much lower than for the same enzyme pair expressed in A. oryzae.
Γmax and Γattack are related parameters in as much as they both quantify a site density on the cellulose surface. However, they differ as Γmax enumerates all sites (productive and non-productive) whereas Γattack specifically describes productive sites. Despite this difference, the two parameters are quite similar and show the same tendencies with respect to glycan dependence (Table 3).
Molecular dynamics (MD) simulations of TrCel7A N-glycan variants on cellulose crystals
To further elucidate the role of N-glycosylation in TrCel7A, 0.1-µs MD simulations of the WT TrCel7A and the corresponding N-glycan enzyme variants productively bound to cellulose crystal were initiated. The N- and O- glycosylation pattern was built as described in “Materials and methods” section. From the MD data, the number of contacts and hydrogen bonds between N-glycans and cellulose crystal surface was calculated. The glycans at position N45 and N384, but not N270, interacted with the cellulose surface (Fig. 4a). The glycans at N384 exhibited a higher number of contacts and hydrogen bonding compared to N45. Interestingly, the glycans at N45 engaged in a similar number of contacts and hydrogen bonds with linker O-glycans. The numbers of hydrogen bonding and contact pairs between the series of N-glycan variants and the cellulose crystals were calculated (Fig. 4b). The removal of glycans at either N45 or N384 decreased the number of contacts between the CD and cellulose crystal. The minor increase in the number of contacts for the ΔN-glyc variant corresponded to the improved peptide–crystal interaction. Therefore, the molecular simulations indicated that N-glycans can be involved in the modulation of TrCel7A kinetics due to the possible interactions with cellulose surface.
Discussion
Cellobiohydrolases from Glycoside Hydrolase family 7 make up a major part of enzyme cocktails for the industrial deconstruction of lignocellulosic biomass. The enzyme from T. reesei, TrCel7A, is the most studied GH7, and like other fungal GH7 cellobiohydrolases, it carries several N-glycans on its CD. The functional roles of these N-glycans have been widely discussed and they have been linked to both thermal stability and protease resistance [3]. They have also been associated with catalytic properties including lower cellulose conversion of hyperglycosylated enzymes and higher activity of enzymes with single deletions of N-glycosylation sites [12, 18]. As the position, pattern and heterogeneity of fungal enzyme glycosylation depend on numerous factors [6, 11], it is not straightforward to assess the structure–function relationships based on wild-type enzymes. Therefore, comparative biochemistry of glycovariants appears necessary to elucidate interrelationships of N-glycosylation and catalytic performance. To implement this strategy, a comparative biochemical study of TrCel7A variants with different N-glycosylation patterns was conducted. Specifically, two industrially relevant different expression hosts were evaluated and enzymes with different N-glycan knockouts and substrates with widely differing physical properties were tested. The results provided a wide range of kinetic and binding data and some general trends on the functional roles of N-glycans emerged.
A consistent tendency for the Michaelis constant, convKM, to increase with the degree of glycosylation was observed, as illustrated by the kinetics on Avicel of the N-glycan variants expressed in A. oryzae (Table 3). In this series, WTAo had a convKM of 18 g/L. Disruption of one of the glycosylation motifs in either of the variants N45QAo, N270QAo or N384QAo lowered this value significantly, and for the ΔN-glycAo, it was reduced even further to 6 g/L. A similar trend was observed for RAC and BMCC (Table 3). Moreover, the propensity of convKM to scale with the degree of glycosylation also emerged when comparing the WT sequence expressed in different hosts. A lower convKM was observed for the enzymes from T. reesei which were less glycosylated than the enzymes from A. oryzae [5, 12] (Fig. 1, Additional file 1: Fig. S1). In the same convMM approach, the ∆N-glycAo variant exhibited consistently lower maximal specific rate, convVmax/E0 (kcat), in respect to the WTAo. This was observed with all three cellulosic substrates: 0.4 vs. 0.24 s−1 on Avicel; 0.48 vs. 0.30 s−1 on RAC and 1.46 vs. 0.96 s−1. Inverse Michaelis–Menten showed systematic effects linked to the degree of glycosylation. Attack site density (Γattack) was found to vary distinctively between the substrates as demonstrated earlier [26] with RAC and BMCC being more accessible with more attack sites compared to Avicel. Moreover, Γattack elucidates the enzyme’s ability to engage in productive complexes on the cellulose surface and it was found to be systematically related with the extent of glycosylation (Table 3). With Avicel as an example, the WTAo had Γattack = 80 nmol/g cellulose. In variants with disruption of one glycosylation motif, the value ranged from 90 to 140 nmol/g cellulose, and for the ∆N-glycAo it was 200 nmol/g cellulose. In other words, ∆N-glycAo recognized more than twice as many productive binding sites on Avicel as compared to the WT. The inverse relationship of glycosylation and Γattack was also evident for the two other substrates (RAC and BMCC), irrespective of the production host (Table 3). The same trend was observed in the independently measured adsorption data in Table 3. Thus, the saturation coverage, Γmax, was generally increased by disruption of N-glycosylation motifs. Interestingly, WTAo and WTTr, did not show identical Γattack on each tested substrate, and this might be attributed to the overall glycosylation differences between these two enzymes (Table 2). In particular, this could be linked to changes in O-glycosylation of TrCel7A, which has been shown to be involved in cellulose binding [27, 28].
The systematic changes in both convKM and Γattack are most likely interlinked. Thus, an enzyme that recognizes many attack sites (high Γattack) requires less substrate (in g/L) to reach a reaction rate that is half of convVmax. In other words, the apparent molar concentration of attack sites of substrates increases with Γattack and it follows that enzyme–substrate systems with the high Γattack will tend to have low convKM. One possible explanation for the improved binding affinity and heightened ability to find more attack sites on the cellulose surface is that glycan structures impose a steric hindrance, which hampers the enzyme from achieving an optimal conformation on the cellulose surface. In other words, the N-glycans may interfere with the enzyme’s association to the cellulosic substrates. The glycans at N45 and N384 are located in close distance to the entrance and exit of the catalytic tunnel, respectively (Fig. 2). Therefore, glycans in both positions could be involved in modulating enzyme–substrate association. It has been suggested by Adney et al. [12] that the glycans at N384 may act as a “spacer”, limiting access to the substrate by increasing the distance between the catalytic domain and cellulose surface. A similar observation of the negative impact of N45 glycan on Cel7A from Penicillium verruculosum was reported by Dotsenko et al. [18]. This interpretation can be further linked to the results of the MD simulation of TrCel7A WT and the corresponding N-glycan variants. MD simulations demonstrated that both glycans at N45 and N384 interacted with the cellulose crystal providing multiple contact pairs and some hydrogen bond pairs. No interactions were found for the glycans at N270, most likely due to the distal position to the cellulose crystal (Fig. 4a). Simulations including the mutations N45Q and N384Q showed a lower number of contacts between TrCel7A CD and cellulose surface compared to TrCel7A CD with intact glycans (Fig. 4b). Therefore, the higher number of contacts enabled by the presence of N-glycans may prevent catalytic peptide–cellulose interactions. This would result in an apparent lowered cellulose affinity of TrCel7A and the frequency of finding productive binding sites. Moreover, a number of contacts were found between the glycans at N45 and linker O-glycans (Fig. 4a), which supported the view of Jeoh et al. [5] and Adney et al. [12] that N-glycans could potentially interact with linker and CBM domains of TrCel7A. However, we found that removal of N-glycans from the catalytic domain of TrCel7A (without linker and CBM) by endoglycosidase H treatment, still resulted in improved binding capacity and activity within invMM conditions on Avicel (Additional file 1: Fig. S4). These results are consistent with the trend observed for ΔN-glyc TrCel7A that N-glycosylation is more important for substrate interactions compared to interactions between the enzyme’s domains.
More detailed inspection of the kinetic data in Table 3 suggested that the three N-glycan variants varied with respect to their influence on enzyme function. In particular, it appeared that the glycans at N45 exerted the most pronounced effect on the kinetic- and binding parameters. N45 is located on a loop region preceding the B1 loop (Additional file 1: Fig. S5) [29], in proximity to the entrance of the catalytic tunnel (Fig. 2). The N-glycans at this position can have various degree of polymerization, varying from a single GlcNAc to high mannose type structures [3, 11]. However, in this study, WTAo seems to only have glycans with high molecular weight and heterogeneity (Fig. 1, Additional file 1: Fig. S1). The strong correlation between the attack site density (Γattack) and maximal adsorption capacities (Γmax) (Fig. 3c, Table 3) shows that the N45 N-glycans not only hinder the ability to form productive complexes on the cellulose surface, but it also decreases the overall adsorption. The negative impact of the N45 N-glycans on Γmax and Γattack could be further explained by the fact that TrCel7A loops, shaping the catalytic tunnel, play a key role in modulating the enzyme’s ability to attack cellulose surface [30] and kinetics [31]. Thus, a bulky, heterogeneous glycan structure on the entrance loop might lead to reduced productive binding on the cellulose surface.
Moreover, the decrease in Vmax/E0 observed for the ∆N-glycAo is well correlated with the higher substrate affinity (lower KM) on all of the three cellulosic substrates. This could be translated to the fact that the enzyme has a higher capability to achieve a tighter binding to the cellulose surface which might lead to a slower dissociation from the substrate. Since the rate-limiting step for TrCel7A is governed by dissociation constant (koff) [31], the removal of N-glycans might further decrease it.
The described kinetic trends for TrCel7A upon removal of N-glycans were also observed when tested on two different cellulosic substrates, RAC and BMCC which represent model substrates with low (CrI < 0.05 [33, 34]) and high (CrI ~ 0.92 [34]) crystallinity index, respectively. Avicel has a moderate crystallinity index (CrI ~ 0.6 [2, 33]). As the general trend of lower KM and higher Γattack was maintained for all three substrates (Table 3), this suggests that the overall effect of N-glycans on the substrate interactions was independent of crystallinity. In accordance with RAC being an easily accessible cellulosic substrate with a high number of potential attack sites, Γattack was found to be the highest on RAC. This result is accordance with previous work by Nill and Jeoh [26] in which showed that swollen cellulose, prepared similarly to RAC, had the highest productive binding capacity among the tested model cellulosic substrates.
It is important to emphasize that N-glycosylation motifs are not well conserved among the Cel7A catalytic domains in family GH7 (Additional file 1: Fig. S6). This might be correlated with the habitat of the microorganisms secreting these enzymes, the likelihood of horizontal gene transfer, and environmental pressure. Depending on the benefits conferred by the glycosylation, variability in N-glycan positions could result in higher stability or lower substrate affinity in Nature but they might not be desirable for the in vitro conditions. We propose that functional assessments of glycosylation require a broad characterization based on comparative biochemistry and the current work suggests one approach to this. We found that TrCel7A kinetics and binding affinity can be modulated by changing N-glycosylation composition achieved by site-directed mutagenesis, selection of expression hosts and N-glycan specific glycosidases. Recently, Rubio et al. [35] presented a study in which GH3 β-xylosidase activity was enhanced by a new N-glycosylation design. We believe that our kinetic toolbox can be applied to other biomass-degrading enzymes and to our knowledge, this is the first study showing an in-depth kinetic characterization TrCel7A with modified N-glycosylation pattern, tested on different insoluble cellulosic substrates.
Conclusions
Detailed kinetic analysis of a high number of enzymes with modified N-glycosylation pattern revealed additional roles in modulation of activity and binding properties of TrCel7A on insoluble cellulosic substrates. The removal of N-glycans from TrCel7A lowered the conventional Michaelis constant (KM), increased the kinetic substrate accessibility (Γattack) and saturation coverage (Γmax). In particular, the absence of glycans at N45 modulated ability of the Cel7A to find a higher amount of productive attack sites. Both TrCel7A WT and the variant ΔN-glyc exhibited the same kinetic trends regardless of the glycosylation patterns conferred by the used expression host, A. oryzae or T. reesei. The interrelationships between N-glycosylation and substrate binding may reflect steric hindrance which encumbers formation of the optimal conformation for substrate attack. This was further emphasized by the molecular simulations, which illustrated the capability of the N-glycosylation to interact with cellulose surface. Our results provide functional and structural insights into N-glycosylation of GH7 TrCel7A that can be further utilized for better understanding of this industrially relevant cellulose-degrading type of enzyme. Potentially, the findings could form a basis for predicting the effect of glycosylation for other interfacial-active enzymes.
Materials and methods
Enzymes
Trichoderma reesei Cel7A (TrCel7A) enzymes were cloned and expressed in Aspergillus oryzae as described earlier [36]. The changes in N-glycosylation were introduced by point mutations of asparagine to glutamine in the N-glycosylation motifs. Four TrCel7A mutants were generated: (1) N45Q, (2) N270Q, (3) N384Q and combination of all three mutations (4) (N45Q, N270Q, N384Q). Enzyme purification was performed according to the protocol also described previously [37]. T. reesei expressed TrCel7A WT and ΔN-glyc were supplied as purified enzymes from Novozymes A/S. Protein concentration was determined by absorbance measurement at 280 nm using a theoretical molar extinction coefficient 81930 M/cm derived from the amino acid sequence without signal peptide. Theoretical molecular weight and extinction coefficient were calculated based on the amino acid sequence without signal peptide from P62694 (UniProt entry). The enzyme purity was verified by SDS-PAGE using NuPAGE 4–12% Bis–Tris gels (GE Healthcare, Fig. 1) and molecular weight was estimated by a reference to the commercial marker kit (LMW, GE Healthcare).
Intact protein mass spectrometry
All the studied enzymes were analyzed for their intact molecular weight using a MAXIS II electrospray mass spectrometer (Bruker Daltonik GmbH, Bremen, Germany). The samples were diluted to 0.1 mg/mL and applied to an AdvanceBio Desalting-RP column (Agilent Technologies). Samples were eluted from the column with an acetonitrile linear gradient from 5 to 95% (v/v) and introduced to the electrospray source with a flow of 400 mL/min by an Ultimate 3000 LC system (Thermo Fisher Scientific). Data analysis was performed with DataAnalysis version 4.3 (Bruker Daltonik GmbH, Bremen, Germany).
Substrates
Enzyme activity and binding affinity were measured on three different cellulosic substrates: regenerated amorphous cellulose (RAC), Avicel PH101 (Sigma-Aldrich, St. Louis, MO) and bacterial microcrystalline cellulose (BMCC). RAC was prepared from Avicel as described earlier [39, 40]. BMCC was prepared from bacterial cellulose (BC) as described in [34, 35]. All substrates were washed in MiliQ water and then washed and stored in 50 mM sodium acetate buffer pH 5.0 (referred to as a standard buffer) in the presence of 5 mM sodium azide.
Steady-state kinetics
Enzymes’ kinetics were measured at the two different experimental settings: enzyme saturation (conventional Michaelis–Menten; convMM) and substrate saturation (inverse Michaelis–Menten; invMM). In convMM conditions, 230-μL substrate aliquots, from a homogenous vigorously stirred cellulosic substrate stock, with various concentration were pipetted into 96-well microtiter plate (96F 26960 Thermo Scientific, Waltham, MA). The hydrolysis reaction was initiated by adding 20 μL enzyme with to a final concentration of 100 nM when tested with Avicel and 50 nM with RAC and BMCC. In the invMM conditions, 190 μL substrate with fixed concentration (12 g/L Avicel, 0.4 g/L RAC, 0.75 g/L BMCC) was pipetted into 96-well microtiter plate. The reactions were initiated by adding 60-μL enzyme aliquots of different concentration. The enzymatic reactions were performed for 1 h at either 25 or 50 °C, mixed at 1100 rpm and then quenched by centrifugation for 3 min at 2500g. From each reaction, 60 μL of supernatant was collected and mixed with 90 μL of p-hydroxybenzoic acid (PAHBAH) to quantify released reducing sugar ends [40]. The detailed experimental procedure is described elsewhere [37]. The absorption of the colored products was measured at 405 nm using a plate reader (Spectra Max 3; Molecular Devices, Sunnyvale, CA, USA). The absorbance readouts were recalculated to reducing sugar ends’ concentration using a cellobiose standard curve (0–1 mM). The obtained reaction rate curves were fitted with Eqs. (1, 2) for convMM and invMM conditions, respectively. The fitting was done in Origin Pro (version 2019, OriginLab Corporation, Northampton, MA, USA). Each reaction was performed in triplicate.
Binding isotherms
Enzyme adsorption on cellulose substrates was measured using the same experimental setup as for the invMM conditions. 60 μL of supernatant was transferred to 96-well microtiter plate (655079, Greiner Bio One) and mixed with 90 μL standard buffer. The enzyme concentration was measured by intrinsic protein fluorescence at 280 nm excitation wavelength and 345 nm emission wavelength. The free enzyme concentration was quantified using the calibration curve composed of known enzyme concentrations diluted in the standard buffer. Reactions were performed in triplicate measurements. The results were fitted to the Langmuir isotherm shown in Eq. (3). Maximal adsorption capacity (Γmax) and dissociation constant (Kd) were estimated and the binding affinity for each studied enzyme was analyzed.
Analysis of means
To compare the parameters derived from steady-state kinetic and binding isotherm experiments, the datasets were modeled in JMP Pro 15 (version 2019, SAS Institute Inc., Cary, NC, 1989–2019) according to the corresponding equations. The obtained parameters within one dataset were compared using analysis of means. Based on the analysis of means plots (Additional file 1: Figure), the parameters statistically different from the others at the 0.05 level of significance are indicated with a letter ‘a’.
Thermal stability and residual activity
The studied enzymes were analyzed by differential scanning calorimetry (DSC, MicroCal VP-Capillary DSC from Malvern Panalytical). The enzymes were buffer exchanged using desalting columns (PD MidiTrap G-25, GE Healthcare Life Sciences) to 50 mM sodium acetate buffer pH 5.0 and further diluted to a concentration of 0.5 mg/mL. Thermal stability was tested with a heating scan range from 20 to 100 °C and a scan rate of 3.3 °C/min. Buffer scans were subtracted from enzyme scans. Thermal transitions were observed only for the enzyme samples. Data were collected and analyzed with Origin 7 software (OriginLab, Northampton, MA, USA). As a result, the transition midpoints (Tm) were obtained for each of the enzymes.
Residual activity of TrCel7A enzymes was measured on para-nitrophenyl β-d-lactopyranoside (pNPL) in 50 mM sodium acetate pH 5.0. Prior to the activity assay, the enzymes at the concentration of 0.5 μM were incubated at the temperature T = Tm− 5 °C for 1 h. Then, the activity assay was initiated by adding the enzyme to pNPL, with the assay concentration of 0.2 μM and 0.5 mM, respectively. After 30 min, the reaction was quenched with 150 μL of 1 M Na2CO3. 150 μL of the quenched reaction mixture was transferred to a new microtiter plate and the absorbance at 405 nm was measured. The concentration of released para-nitrophenol (pNP) was quantified against the standard curve made with known concentrations of pNP (15–1000 μM). Each reaction was performed in triplicate measurements. The residual activity was calculated by comparing the activities of the enzyme with and without thermal pre-incubation.
Molecular simulations
The structure of the CD was taken from PDB [41] entry 4C4C [14]. In the WT structure, all three N-glycosylation sites (N45, N270, and N384) were decorated with (Man)9(GlcNAc)2 using the doGlycans program [42]. The structure of the CBM was taken from PDB entry 2CBH [43]. The linker and the CBM were decorated as discussed by Harrison et al. [10]. A cellulose fibril of 15 cellobiose units in length was constructed using the Cellulose-Builder [15]. The general placement of the enzyme was done similar to work by Payne et al. [44]. The lower half of the fibril was removed to limit the simulation box size. A cellulose chain on the edge of the fibril was partly removed and the CD was placed in a way that the bound cellononaose could be linked to the remaining part of that cellulose chain. Besides the wild-type structure, TrCel7A enzyme variants were prepared (N45Q, N270Q, N384Q, and a ΔN-glyc variants).
The structure was taken and run through the CHARMM-GUI [45] to obtain a GROMACS topology. The CHARMM36 force field was used to describe the system [48, 49]. All simulations were run in GROMACS 2018.6 [48–55]. GROMACS was used to construct a triclinic box with a minimal distance of 10 Å and solvate the system with TIP3P water [58, 59]. To neutralize the net charge of the system, random water molecules were exchanged with sodium ions. Minimization was done in a steepest-descent over 10,000 iterations. Afterward, NVT simulations with incremental temperature (100 K to 300 K in 50 K steps) were performed in succession for 20 ps each. Thereafter, NPT simulations with restraints on all solutes, with restraints on the protein backbone and the crystal, and with restraints only on the crystal were performed for 100 ps in series. The production was run in the NPT ensemble at 300 K with a time-step of 2 fs for 100 ns, while only keeping the lowest layer of cellulose chains in the crystal restraint. The long-range electrostatic treated with the Particle-mesh Ewald method with cubic interpolation and a cut-off of 12 Å [58]. Van der Waals interactions were treated in a Verlet scheme with a cut-off distance of 12 Å and a switching function for the forces starting at 10 Å [59]. Bonds were restrained using the LINCS algorithm [60]. The solutes and the solvent were coupled to heat baths at 300 K with a Berendsen thermostat [61]. Pressure coupling was done with a Parrinello–Rahman barostat [62]. Analysis of the trajectories was performed with GROMACS. The trajectories were visualized in PyMOL.
Multiple sequence alignment and phylogenetic tree
To construct the phylogenetic tree, 29 characterized GH7 Cel7A amino acid sequences were retrieved from Carbohydrate Active Enzymes (CAZy) database (http://www.cazy.org/) [8] and the multiple sequence alignment was performed using ClustalW [63]. As the reference sequence, Trichoderma reesei Cel7A was selected. The gap-opening values of the pairwise and multiple alignments were set to 3 and 30, respectively, while the other parameters were kept default. The amino acid residues corresponding to signal peptide, linker and CBM were removed from the aligned sequences and subsequently realigned. The phylogenetic analysis was done using MEGA X [64]. The phylogenetic tree was built using the Maximum Likelihood method and Whelan and Goldman model [65]. The tree with the highest log likelihood (− 12018.02) is shown. Initial tree(s) for the heuristic search was obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using a JTT model, and then selecting the topology with superior log likelihood value. All positions with less than 90% site coverage were eliminated, i.e., fewer than 10% alignment gaps, missing data, and ambiguous bases were allowed at any position.
Supplementary information
Acknowledgements
The simulations were carried out at the high-performance cluster at the Technical University of Denmark. Research Associate Clive Phipps Walter (Novozymes) and Senior Science Manager Christian Isak Jørgensen (Novozymes) are thanked for technical assistance with mass spectrometry analysis. Ryan Janke (Novozymes) is thanked for the help with the expression of TrCel7A variants in T. reesei and K.C. McFarland is thanked for the help with purification of these enzymes. Garry Paul Gippert (Novozymes) is thanked for the help with mapping of N-glycosylation motifs in characterized Cel7A sequences from GH7 and the construction of phylogenetic tree.
Abbreviations
- TrCel7A
Trichoderma reesei Cel7A
- CAZy database
Carbohydrate-active enzyme database
- CD
Catalytic domain
- CBM
Carbohydrate-binding module
- Man
Mannose
- GlcNAc
N-acetylglucosamine
- MS
Mass spectrometry
- SDS-PAGE
Sodium dodecyl sulfate–polyacrylamide gel electrophoresis
- MW
Molecular weight
- DSC
Differential scanning calorimetry
- Tm
Melting temperature
- pNP-Lac
p-nitrophenyl β-d-lactopyranoside
- convMM
Conventional Michaelis–Menten kinetics
- invMM
Inverse Michaelis–Menten kinetics
- Vmax
Maximal rate
- KM
Michaelis constant
- RAC
Regenerated amorphous cellulose
- BMCC
Bacterial microcrystalline cellulose
- Γattack
Density of attack sites
- Γmax
Maximal adsorption capacity
- MD
Molecular dynamics
- PAHBAH
p-hydroxybenzoic acid hydrazide
- PDB
Protein data bank
Authors’ contributions
BMK conducted the overall study, performed experiments and wrote the manuscript. KSS and GHJP performed MD experiments and analyzed the data. THS cloned and expressed the enzymes. KJ, KBRMK and PW conceived the study, supervised all aspects of the work and revised the manuscript. All authors read and approved the final manuscript.
Funding
This work was supported by Roskilde University, Novozymes A/S, Innovation Fund Denmark [Grant number: 5150-00020B], the Novo Nordisk Foundation [Grant number: NNF15OC0016606 and NNFSA170028392] and the Carlsberg Foundation.
Availability of data and materials
The datasets supporting the conclusions of this article are included within the article and in Additional file 1.
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
KJ, KBRMK and TS work at Novozymes, a world leading company in the production of industrially used enzymes.
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Supplementary information accompanies this paper at 10.1186/s13068-020-01779-9.
References
- 1.Mandels M, Reese ET. Induction of cellulase in Trichoderma viride as influenced by carbon sources and metals. J Bacteriol. 1957;73(2):269–278. doi: 10.1128/jb.73.2.269-278.1957. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Zhang YHP, Lynd LR. Toward an aggregated understanding of enzymatic hydrolysis of cellulose: noncomplexed cellulase systems. Biotechnol Bioeng. 2004;88(7):797–824. doi: 10.1002/bit.20282. [DOI] [PubMed] [Google Scholar]
- 3.Amore A, et al. Distinct roles of N- and O-glycans in cellulase activity and stability. Proc Natl Acad Sci. 2017;114(52):13667–13672. doi: 10.1073/pnas.1714249114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Tang H, et al. N-hypermannose glycosylation disruption enhances recombinant protein production by regulating secretory pathway and cell wall integrity in Saccharomyces cerevisiae. Sci Rep. 2016;6(25654):1–13. doi: 10.1038/srep25654. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Jeoh T, Michener W, Himmel ME, Decker SR, Adney WS. Implications of cellobiohydrolase glycosylation for use in biomass conversion. Biotechnol Biofuels. 2008;1(1):1–3. doi: 10.1186/1754-6834-1-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Deshpande N, Wilkins MR, Packer N, Nevalainen H. Protein glycosylation pathways in filamentous fungi. Glycobiology. 2008;18(8):626–637. doi: 10.1093/glycob/cwn044. [DOI] [PubMed] [Google Scholar]
- 7.Hui JPM, et al. Characterization of cellobiohydrolase I (Cel7A) glycoforms from extracts of Trichoderma reesei using capillary isoelectric focusing and electrospray mass spectrometry. J Chromatogr B Biomed Sci Appl. 2001;752(2):349–368. doi: 10.1016/s0378-4347(00)00373-x. [DOI] [PubMed] [Google Scholar]
- 8.Lombard V, Golaconda Ramulu H, Drula E, Coutinho PM, Henrissat B. The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 2014;42(D1):D490–D495. doi: 10.1093/nar/gkt1178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Payne CM, et al. Fungal cellulases. Chem Rev. 2015;115(3):1308–1448. doi: 10.1021/cr500351c. [DOI] [PubMed] [Google Scholar]
- 10.Harrison MJ, et al. Modified glycosylation of cellobiohydrolase I from a high cellulase-producing mutant strain of Trichoderma reesei. Eur J Biochem. 1998;256(1):119–127. doi: 10.1046/j.1432-1327.1998.2560119.x. [DOI] [PubMed] [Google Scholar]
- 11.Stals I, Sandra K, Geysens S, Contreras R, Van Beeumen J, Claeyssens M. Factors influencing glycosylation of Trichoderma reesei cellulases. I: postsecretorial changes of the O- and N-glycosylation pattern of Ce17A. Glycobiology. 2004;14(8):713–724. doi: 10.1093/glycob/cwh080. [DOI] [PubMed] [Google Scholar]
- 12.Adney WS, et al. Probing the role of N-linked glycans in the stability and activity of fungal cellobiohydrolases by mutational analysis. Cellulose. 2009;16(4):699–709. [Google Scholar]
- 13.Kari J, Christensen SJ, Andersen M, Baiget SS, Borch K, Westh P. A practical approach to steady-state kinetic analysis of cellulases acting on their natural insoluble substrate. Anal Biochem. 2019;586(1):113411. doi: 10.1016/j.ab.2019.113411. [DOI] [PubMed] [Google Scholar]
- 14.Knott BC, et al. The mechanism of cellulose hydrolysis by a two-step, retaining cellobiohydrolase elucidated by structural and transition path sampling studies. J Am Chem Soc. 2013;136(1):321–329. doi: 10.1021/ja410291u. [DOI] [PubMed] [Google Scholar]
- 15.Gomes TCF, Skaf MS. Cellulose-builder: a toolkit for building crystalline structures of cellulose. J Comput Chem. 2012;33(14):1338–1346. doi: 10.1002/jcc.22959. [DOI] [PubMed] [Google Scholar]
- 16.Cruys-Bagger N, Elmerdahl J, Praestgaard E, Borch K, Westh P. A steady-state theory for processive cellulases. FEBS J. 2013;280(16):3952–3961. doi: 10.1111/febs.12397. [DOI] [PubMed] [Google Scholar]
- 17.Kari J, Andersen M, Borch K, Westh P. An inverse Michaelis–Menten approach for interfacial enzyme kinetics. ACS Catal. 2017;7(7):4904–4914. [Google Scholar]
- 18.Dotsenko AS, Gusakov AV, Volkov PV, Rozhkova AM, Sinitsyn AP. N-linked glycosylation of recombinant cellobiohydrolase I (Cel7A) from Penicillium verruculosum and its effect on the enzyme activity. Biotechnol Bioeng. 2016;113(2):283–291. doi: 10.1002/bit.25812. [DOI] [PubMed] [Google Scholar]
- 19.Andersen M, Kari J, Borch K, Westh P. Michaelis–Menten equation for degradation of insoluble substrate. Math Biosci. 2018;296:93–97. doi: 10.1016/j.mbs.2017.11.011. [DOI] [PubMed] [Google Scholar]
- 20.Bailey CJ. Enzyme kinetics of cellulose hydrolysis. Biochem J. 1989;262(3):1001. doi: 10.1042/bj2621001a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.McLaren AD, Packer L. Some aspects of enzyme reactions in heterogeneous systems. Adv Enzymol Relat Areas Mol Biol. 1970;33:245–308. doi: 10.1002/9780470122785.ch5. [DOI] [PubMed] [Google Scholar]
- 22.Karuna N, Jeoh T. The productive cellulase binding capacity of cellulosic substrates. Biotechnol Bioeng. 2017;114(3):533–542. doi: 10.1002/bit.26193. [DOI] [PubMed] [Google Scholar]
- 23.Cruys-Bagger N, Alasepp K, Andersen M, Ottesen J, Borch K, Westh P. Rate of threading a cellulose chain into the binding tunnel of a cellulase. J Phys Chem B. 2016;120(25):5591–5600. doi: 10.1021/acs.jpcb.6b01877. [DOI] [PubMed] [Google Scholar]
- 24.Jalak J, Väljamäe P. Multi-mode binding of cellobiohydrolase Cel7A from Trichoderma reesei to cellulose. PLoS ONE. 2014;9(9):e108181. doi: 10.1371/journal.pone.0108181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Ståhlberg J, Johansson G, Pettersson G. A new model for enzymatic hydrolysis of cellulose based on the two-domain structure of cellobiohydrolase I. Bio/Technology. 1991;9(3):286–290. [Google Scholar]
- 26.Nill J, Jeoh T. The role of evolving interfacial substrate properties on heterogeneous cellulose hydrolysis kinetics. ACS Sustain Chem Eng. 2020;8(17):6722–6733. [Google Scholar]
- 27.Payne CM, et al. Glycosylated linkers in multimodular lignocellulose-degrading enzymes dynamically bind to cellulose. Proc Natl Acad Sci USA. 2013;110(36):14646–14651. doi: 10.1073/pnas.1309106110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Chung D, et al. Glycosylation is vital for industrial performance of hyperactive cellulases. ACS Sustain Chem Eng. 2019;7(5):4792–4800. [Google Scholar]
- 29.Momeni MH, et al. Structural, biochemical, and computational characterization of the glycoside hydrolase family 7 cellobiohydrolase of the tree-killing fungus heterobasidion irregulare. J Biol Chem. 2013;288(8):5861–5872. doi: 10.1074/jbc.M112.440891. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Schiano-di-Cola C, et al. Systematic deletions in the cellobiohydrolase (CBH) Cel7A from the fungus Trichoderma reesei reveal flexible loops critical for CBH activity. J Biol Chem. 2019;294(6):1807–1815. doi: 10.1074/jbc.RA118.006699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Kari J, Olsen J, Borch K, Cruys-Bagger N, Jensen K, Westh P. Kinetics of cellobiohydrolase (Cel7A) variants with lowered substrate affinity. J Biol Chem. 2014;289(47):32459–32468. doi: 10.1074/jbc.M114.604264. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Pellegrini VOA, et al. Reversibility of substrate adsorption for the cellulases Cel7A, Cel6A, and Cel7B from Hypocrea jecorina. Langmuir. 2014;30(42):12602–12609. doi: 10.1021/la5024423. [DOI] [PubMed] [Google Scholar]
- 33.Cruys-Bagger N, Tatsumi H, Ren GR, Borch K, Westh P. Transient kinetics and rate-limiting steps for the processive cellobiohydrolase Cel7A: effects of substrate structure and carbohydrate binding domain. Biochemistry. 2013;52(49):8938–8948. doi: 10.1021/bi401210n. [DOI] [PubMed] [Google Scholar]
- 34.Väljamäe P, Sild V, Nutt A, Pettersson G, Johansson G. Acid hydrolysis of bacterial cellulose reveals different modes of synergistic action between cellobiohydrolase I and endoglucanase I. Eur J Biochem. 1999;266(2):327–334. doi: 10.1046/j.1432-1327.1999.00853.x. [DOI] [PubMed] [Google Scholar]
- 35.Rubio MV, et al. Redesigning N-glycosylation sites in a GH3 β-xylosidase improves the enzymatic efficiency. Biotechnol Biofuels. 2019;12(1):1–14. doi: 10.1186/s13068-019-1609-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Borch K et al. “Cellobiohydrolase variants and polynucleotides encoding same.” Google Patents. 2016.
- 37.Sørensen TH, Cruys-Bagger N, Windahl MS, Badino SF, Borch K, Westh P. Temperature effects on kinetic parameters and substrate affinity of Cel7A cellobiohydrolases. J Biol Chem. 2015;290(36):22193–22202. doi: 10.1074/jbc.M115.658930. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Zhang Y-HP, Cui J, Lynd LR, Kuang LR. A transition from cellulose swelling to cellulose dissolution by o-phosphoric acid: evidence from enzymatic hydrolysis and supramolecular structure. Biomacromol. 2006;7(2):644–648. doi: 10.1021/bm050799c. [DOI] [PubMed] [Google Scholar]
- 39.Christensen SJ, Kari J, Badino SF, Borch K, Westh P. Rate-limiting step and substrate accessibility of cellobiohydrolase Cel6A from Trichoderma reesei. FEBS J. 2018;285(23):4482–4493. doi: 10.1111/febs.14668. [DOI] [PubMed] [Google Scholar]
- 40.Lever M. Calorimetric determination and acid fluorometric with hydrazide. Anal Biochem. 1973;281:274–281. [Google Scholar]
- 41.Berman HM. The protein data bank. Nucleic Acids Res. 2000;28(1):235–242. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Danne R, et al. doGlycans—tools for preparing carbohydrate structures for atomistic simulations of glycoproteins, glycolipids, and carbohydrate polymers for GROMACS. J Chem Inf Model. 2017;57(10):2401–2406. doi: 10.1021/acs.jcim.7b00237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Kraulis J, et al. Determination of the three-dimensional solution structure of the C-terminal domain of cellobiohydrolase I from Trichoderma reesei. A study using nuclear magnetic resonance and hybrid distance geometry-dynamical simulated annealing. Biochemistry. 1989;28:7241–7257. doi: 10.1021/bi00444a016. [DOI] [PubMed] [Google Scholar]
- 44.Payne CM, Jiang W, Shirts MR, Himmel ME, Crowley MF, Beckham GT. Glycoside hydrolase processivity is directly related to oligosaccharide binding free energy. JACS. 2013;135:18831–18839. doi: 10.1021/ja407287f. [DOI] [PubMed] [Google Scholar]
- 45.Sunhwan J, Taehoon K, Vidyashankara IG, Wonpil I. CHARMM-GUI: a web-based graphical user interface for CHARMM. J Comput Chem. 2008;29(11):1859–1865. doi: 10.1002/jcc.20945. [DOI] [PubMed] [Google Scholar]
- 46.Best RB, et al. Optimization of the additive CHARMM all-atom protein force field targeting improved sampling of the backbone ϕ, ψ and side-chain χ 1 and χ 2 dihedral angles. J Chem Theory Comput. 2012;8:3257–3273. doi: 10.1021/ct300400x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Guvench O, et al. CHARMM additive all-atom force field for carbohydrate derivatives and its utility in polysaccharide and carbohydrate–protein modeling. J Chem Theory Comput. 2011;7(10):3162–3180. doi: 10.1021/ct200328p. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Abraham MJ, et al. GROMACS: high performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX. 2015;1–2(September):19–25. [Google Scholar]
- 49.Páll S, Abraham MJ, Kutzner C, Hess B, Lindahl E. Tackling exascale software challenges in molecular dynamics simulations with GROMACS. In: International conference on exascale applications and software 2014. pp. 3–27
- 50.Kutzner C, Apostolov R, Hess B, Grubmüller H. Scaling of the GROMACS 4.6 molecular dynamics code on SuperMUC. Adv Parallel Comput. 2014;25:722–727. [Google Scholar]
- 51.Pronk S, et al. GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit. Bioinformatics. 2013;29(7):845–854. doi: 10.1093/bioinformatics/btt055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Hess B, Kutzner C, van der Spoel D, Lindahl E. GROMACS 4: algorithms for highly efficient, load-balanced, and scalable molecular simulation. J Chem Theory Comput. 2008;4(3):435–447. doi: 10.1021/ct700301q. [DOI] [PubMed] [Google Scholar]
- 53.Van Der Spoel D, Lindahl E, Hess B, Groenhof G, Mark AE, Berendsen HJC. GROMACS: fast, flexible, and free. J Comput Chem. 2005;26(16):1701–1718. doi: 10.1002/jcc.20291. [DOI] [PubMed] [Google Scholar]
- 54.Lindahl E, Hess B, van der Spoel D. GROMACS 3.0: a package for molecular simulation and trajectory analysis. Mol Model Annu. 2001;7(8):306–317. [Google Scholar]
- 55.Berendsen H, van der Spoel D, Drunen R. GROMACS: a message-passing parallel molecular dynamics implementatio. Comput Phys Commun. 1995;91:43–56. [Google Scholar]
- 56.Bondi A. van der Waals volumes and radii. J Phys Chem. 1963;68(3):441–451. [Google Scholar]
- 57.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of simple potential functions for simulating liquid water. J Chem Phys. 1983;79(2):926–935. [Google Scholar]
- 58.Darden T, York D, Pedersen L. Particle mesh Ewald: an N log(N) method for Ewald sums in large systems. J Chem Phys. 1993;98(12):10089–10092. [Google Scholar]
- 59.Verlet L. Computer ‘experiments’ on classical fluids. I. Thermodynamical Properties of Lennard–Jones Molecules. Phys Rev. 1967;159(1):98–103. [Google Scholar]
- 60.Hess B, Bekker H, Berendsen HJC, Fraaije JGEM. LINCS: a linear constraint solver for molecular simulations. J Comput Chem. 1997;18(12):1463–1472. [Google Scholar]
- 61.Berendsen HJC, Postma JPM, van Gunsteren WF, DiNola A, Haak JR. Molecular dynamics with coupling to an external bath. J Chem Phys. 1984;81(8):3684–3690. [Google Scholar]
- 62.Bussi G, Zykova-Timan T, Parrinello M. Isothermal-isobaric molecular dynamics using stochastic velocity rescaling. J Chem Phys. 2009;130(7):074101. doi: 10.1063/1.3073889. [DOI] [PubMed] [Google Scholar]
- 63.Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994;22(22):4673–4680. doi: 10.1093/nar/22.22.4673. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol Biol Evol. 2018;35(6):1547–1549. doi: 10.1093/molbev/msy096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Whelan S, Goldman N. A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol. 2001;18(5):691–699. doi: 10.1093/oxfordjournals.molbev.a003851. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets supporting the conclusions of this article are included within the article and in Additional file 1.