Skip to main content
International Journal of Molecular Sciences logoLink to International Journal of Molecular Sciences
. 2020 Jun 4;21(11):4035. doi: 10.3390/ijms21114035

In-Silico Characterization of Glycosyl Hydrolase Family 1 β-Glucosidase from Trichoderma asperellum UPM1

Mohamad Farhan Mohamad Sobri 1,2, Suraini Abd-Aziz 1, Farah Diba Abu Bakar 3, Norhayati Ramli 1,*
PMCID: PMC7311958  PMID: 32512945

Abstract

β-glucosidases (Bgl) are widely utilized for releasing non-reducing terminal glucosyl residues. Nevertheless, feedback inhibition by glucose end product has limited its application. A noticeable exception has been found for β-glucosidases of the glycoside hydrolase (GH) family 1, which exhibit tolerance and even stimulation by glucose. In this study, using local isolate Trichoderma asperellum UPM1, the gene encoding β-glucosidase from GH family 1, hereafter designated as TaBgl2, was isolated and characterized via in-silico analyses. A comparison of enzyme activity was subsequently made by heterologous expression in Escherichia coli BL21(DE3). The presence of N-terminal signature, cis-peptide bonds, conserved active site motifs, non-proline cis peptide bonds, substrate binding, and a lone conserved stabilizing tryptophan (W) residue confirms the identity of Trichoderma sp. GH family 1 β-glucosidase isolated. Glucose tolerance was suggested by the presence of 14 of 22 known consensus residues, along with corresponding residues L167 and P172, crucial in the retention of the active site’s narrow cavity. Retention of 40% of relative hydrolytic activity on ρ-nitrophenyl-β-D-glucopyranoside (ρNPG) in a concentration of 0.2 M glucose was comparable to that of GH family 1 β-glucosidase (Cel1A) from Trichoderma reesei. This research thus underlines the potential in the prediction of enzymatic function, and of industrial importance, glucose tolerance of family 1 β-glucosidases following relevant in-silico analyses.

Keywords: Trichoderma asperellum, β-glucosidase, glycosyl hydrolase family 1, in-silico analyses

1. Introduction

Trichoderma sp. is a genus of fungi that has been the subject of several studies, ranging from phylogeny, distribution, defense mechanism, host interaction, production and secretion of enzymes, sexual development, and also responses to changes in the environment [1]. In particular, there has been extensive interest in its capacity for cellulase production, resulting in its application in biotechnology and subsequent industrial fields, such as bioethanol production from lignocellulosic materials, textile fields, and feed production [2,3]. Efforts on this front have involved Trichoderma reesei, making it a fungal model organism, of focus for both industrial and academic research teams [4].

Cellulases function by hydrolysis of β-1,4 linkages present within cellulose. In nature, complete degradation is obtained by the synergistic action of three separate types, namely (1) endoglucanases, (2) cellobiohydrolases, and (3) β-glucosidases [5,6]. From the surface of solid substrates, primary hydrolysis by endoglucanase and cellobiohydrolase lead to the release of soluble cellodextrins. Secondary hydrolysis then proceeds for the breakdown of cellodextrins such as cellobiose into glucose by β-glucosidases [6]. In the process of cellulose saccharification, occurrences of substrate and production inhibition are not uncommon. In substrate inhibition, this occurs on a two-domain structure found on cellobiohydrolase, of which for a given fixed enzyme load, rise in substrate concentration results in increase of time/distance necessary for cellobiohydrolases to act on the chain ends resulting from endoglucanase activity [7]. The presence of liberated cellobiose and monosaccharides may instead lead to product inhibition where cellobiose affects the active site of cellobiohydrolases by steric hindrance [8] and β-glucosidases are affected by the presence of glucose [9]. Hence, to achieve increased performance of hydrolysis, efforts have included balancing the enzyme cocktail and tailoring specific enzymes [10].

Among the three types involved, β-glucosidase has been focused on owing to several characteristics. In production terms, β-glucosidase is noticeable at much lower concentrations than endoglucanases and cellobiohydrolases, even from commercial cellulase producer such as T. reesei [11]. Unfavorable end product inhibition by glucose further exacerbates the problem, making it the rate-limiting enzyme, with impaired yields being a major obstacle for commercialization of cellulose hydrolysis [9].

Classification based on sequence identity and hydrophobic cluster analysis, as available in carbohydrate-active enzymes (CAZy) database has placed β-glucosidases in glycoside hydrolase (GH) families 1 and 3 [12]. Among biotechnology industries, the utilization of β-glucosidases with glucose tolerance and stimulation can improve the efficiency of substrate degradation and result in a reduction of production costs. Thus, the utilization of glucose-tolerant β-glucosidases has increased in interest in recent years. While the majority of β-glucosidases are sensitive to glucose, tolerance coupled with a stimulatory effect of the carbohydrate have been observed exclusively among GH family 1 (GH1) β-glucosidases [13]. Comparison of relative tolerance to that of GH family 3 (GH3) β-glucosidases has been extensive, ranging from tenfold to 1000-times fold higher. Thus, it has been suggested that GH1 β-glucosidases are more suitable for plant cell-wall saccharification in biotechnological applications [14].

Collection, retrieval and analysis of biological data using computational techniques, in a science known as bioinformatics, has grown significantly with increase in computational speed and memory storage capabilities. Major impacts of the field include automation of genome sequencing, integrated genomics and proteomics databases, genome comparisons to identify the genome function, as well as the automated derivation of metabolic pathways, the development of statistical techniques, and three-dimensional (3D) modeling of biochemical structures. While, in general, sequence reveals structure, which reveals function, the reality can differ where a similarity in protein structures may result in different functions, and adversely, proteins with different structures may exhibit similar functions. Understanding and proving such relationships would be essential for the spectrum of applications bioinformatics can apply to [15].

While T. reesei has been the model organism for utilization, comparative secretome analysis to Trichoderma asperellum following solid state fermentation on the same biomass substrate have shown T. asperellum to have higher enzymatic activities and increased abundance of main and side chain hemicellulases and β-glucosidases [16]. Hence, from local T. asperellum UPM1 strain, this work has sought to isolate the gene encoding GH1 β-glucosidase (TaBgl2) and following expression in Escherichia coli as a recombinant enzyme, relate the enzyme’s specificity and sensitivity to glucose to physicochemical characteristics, ancestral relationship, and structure determination at several levels, predicted prior via in-silico analyses.

2. Results and Discussion

2.1. Nucleotide and Deduced Amino Acid Sequence of TaBgl2

By means of nucleotide sequence analyzed using Basic Local Alignment Search Tool (BLAST) (Supplementary Table S1), isolation and sequencing of cDNA obtained was found to code for β-glucosidase of GH1, hereafter designated as TaBgl2. Translation into the corresponding amino acids led to identification of an open reading frame (ORF) of 1398 bp, G-C content of 55.65%, with subsequent protein BLAST elucidating a β-glucosidase, 465 amino acids in length, weighing approximately 52 kDa. From subsequent protein sequence, BLAST analysis (Supplementary Table S2) was carried out to deduce the identity of the isolated protein. Highest homology was found to β-glucosidase isolated from T. asperellum strain CBS 433.97 (accession number: XP 024766195.1) with identities of 99%. Upon comparison to selected Trichoderma spp. β-glucosidases, TaBgl2 was shown to have comparable amino acid length (between 455 and 466 amino acids). These findings coupled with the high degree of identities, i.e., exceeding 90%, further affirms the identity of the protein.

2.2. Sequence Alignment and Conserved Motif Identification

Multiple sequence alignments (MSAs) have become a fundamental approach in molecular biology and bioinformatics research domains, including phylogenetic tree reconstruction, three-dimensional (3D) structure prediction, conserved regions identification, and elucidation of molecular function [17]. Hence, for purpose of conserved motif identifications, TaBgl2 was aligned to several β-glucosidases of known Trichoderma spp. origins (Figure 1). GH1 protein identity was suggested by the presence of two distinct motifs, namely N-terminal signature sequence with 15 amino acids in length (F-x-[FYWM]-[GSTA]-x-[GSTA]-x-[GSTA](2)-[FYNH]-[NQ]-x-E-x-[GSTA], present as FQWGFATAAYQIEGA in TaBgl2) [18], and two cis-peptide bonds between Ala-180 and Pro-181 and between Trp-416 and Ser-417 [19].

Figure 1.

Figure 1

Sequence alignment of TaBgl2 with selected Trichoderma spp. β-glucosidase protein sequences (denoted by their enzyme abbreviations and NCBI/GenBank accession numbers) (TaBgl2, T. asperellum UPM1 Bgl 2 – ARW78142.1; TaBgl, T. asperellum CBS 433.97 glycoside hydrolase family 1 – XP_024766195.1; TgBgl, T. gamsii β-glucosidase – XP_018660766.2; TatBgl, T. atroviride IMI 206040 glycoside hydrolase family 1 – XP_013939543.1; TvBgl, T. virens β-1,4-glucosidase – AJW67427.1; TguBgl, T. guizhouense GH1 β-glucosidase Bgl2 – OPB339337.1; ThBgl, T. harzianum β-glucosidase – KKP02477.1; TrBgl2, T. reesei β-glucosidase – BAA74959.1). Highlighted motifs, in red boxes, include: (A) glycosyl hydrolase family 1 N-terminal signature, with (B) & (C) as catalytic site motifs. Boxed in blue and yellow are suggested glycone and aglycone binding residues, respectively.

β-glucosidase identity was confirmed by the presence of two conserved motifs [20], TFNEP and VTENG, each containing the catalytic acid/base and nucleophile glutamate residues, E165 and E366, respectively. Identification of glutamate residues on GH1 β-glucosidase active site in TaBgl2 suggests as to the mechanism of activity, proceeding by the β-retaining mechanism [21] commonly found in other GH1 β-glucosidases. In addition, 22 residues constructing the entrance to the active site region were found along with 11 glycone binding residues, six aglycone binding residues [22], and a lone conserved tryptophan W338, stabilizing the aglycone moiety at the +1 subsite [14]. Matching of signature sequences based on models classifying proteins into families or for prediction of characteristic domains and functionally relevant sites allows for prediction of enzyme function [23].

As mentioned, a case can be made as to the possible glucose tolerance of GH1 β-glucosidases. Work by Mariano et al. [24] has highlighted 22 essential residues to confer this property of which 14 was identified with two being active site residues (E165, E366), eight glycone binding residues (H119, W120, N164, N295, W416, E423, W424, F432), and four aglycone binding residues (C168, N224, Y297, T298).

2.3. Secondary and Tertiary Structure Predictions

Secondary structure of TaBgl2 as well as selected β-glucosidase sequences of Trichoderma spp. were estimated using GOR IV tools (Table 1). The GOR method is a popular secondary structure prediction scheme, long established as the first implemented as a computer program. It functions by providing estimates of probabilities for three secondary structures at a given residue position, and has the advantage over nearest-neighbor and neural network-based methods by the identification of what is considered and neglected parameters for prediction [25]. To this, the GOR IV results suggest the dominance of random coils (54.41%–57.94%) among Trichoderma spp. β-glucosidases. T. reesei Bgl2 in complex with Tris (PDB accession number: 3AHY_A) [26] was used as a template, given a shared sequence identity of 90.06% and coverage of 1.0. Alignment of both sequences with the resultant protein structure was made (Figure 2 and Figure 3). TaBgl2 structure was observed to have the classical (α/β)8 triosephosphate isomerase (TIM) barrel fold. From this, an outer opening was created which the slot forms an active site cleft, with the catalytic motifs present opposite each other at the bottom of the active site, where a parallel β-barrel is formed from the β-sheets with α-helices located outside of the barrel [22,27,28].

Table 1.

Prediction of secondary structures within β-glucosidases of selected Trichoderma spp., using ExPASY’s GOR IV.

Description Accession Number GOR IV Analyses
α-Helix (Hh) (%) Extended Strand (Ee) (%) Random Coils (%)
β-Glucosidase 2 (Trichoderma asperellum) * ARW78142.1 26.45 18.92 54.62
Glycoside hydrolase family 1 protein (Trichoderma asperellum CBS 433.97) XP_024766195.1 25.38 18.92 55.70
β-Glucosidase (Trichoderma gamsii) XP_018660766.2 27.31 18.28 54.41
Glycoside hydrolase family 1 protein (Trichoderma atroviride IMI 206040) XP_013939543.1 25.81 19.35 54.84
β-1,4-Glucosidase (Trichoderma virens) AJW67427.1 24.84 19.34 55.82
GH1 β-glucosidase BGL2/CEL1a (Trichoderma guizhouense) OPB39337.1 24.30 17.85 57.85
β-Glucosidase (Trichoderma harzianum) KKP02477.1 24.52 17.63 57.85
β-Glucosidase (Trichoderma reesei) BAA74959.1 24.03 18.03 57.94

* This study.

Figure 2.

Figure 2

Alignment of TaBgl2 (represented as Chain A) to template Trichoderma reesei Bgl2 associated with Tris (3ahy.1). QMEAN values, displayed by bar height, function to quantify modelling errors and simultaneously estimate the expected model accuracy. α-Helix and β-sheets are colored as blue and green segments, respectively.

Figure 3.

Figure 3

(A) Three-dimensional (3D) structure of TaBgl2 on the side of the active site entrance (boxed). Glutamate residues (E165 and E366) within catalytic motifs (TFNEP and VTENG) are as seen in (B,C), respectively. α-Helix and β-sheets are colored blue and green, respectively.

Referring to Jeng et al. [26] as the source of the template structure TrBgl2, comparisons were also made to β-glucosidases from bacterium Clostridium cellulovorans and termite Neotermes koshunensis. To all proteins, active sites were found to form slot like clefts, between 15–20 Å in depth, positioned on connecting loops at the C-terminal end of β-sheets of the TIM barrel, surrounded by negatively charged residues. Specifically, for TrBgl2, the catalytic acid/base of TrBgl2 of Glu165 is located on the TFNEP motif at the end of the β-strand 4, while the catalytic nucleophile of Glu367 is located on the VTENG motif at the end of β-strand 7. Both residues are positioned as predicted for TaBgl2.

SWISS-MODEL analysis also suggested as to the presence of a non-proline cis-peptide bond, located between Trp416 and Ser417. As is common for residues involved in non-proline cis peptides bonds, the main-chain dihedral angles of both residues were found located on the β-region of a Φ/Ψ plot on a Ramachandran plot [29]. While regarded as rare, presence of the bond close to the active site supports several works on carbohydrate-binding or processing proteins such as lacZ βDG structure from Escherichia coli [30] and β-D-galactosidase from Paracoccus sp. 32d [31], suggesting presence of the bond to be a characteristic for glycosyl hydrolase proteins.

Concerning the argument proposing the tolerance of glucose to GH1 β-glucosidases, focus may be placed on the shape and the electrostatic properties of the entrance to the active site. Relative higher glucose tolerance of GH1 β-glucosidases can be correlated to the active-site accessibility, of which the cavities are deeper and narrower compared to the shallow ones present in GH3 β-glucosidases. With protein structures elucidated and knowledge of the conserved residues in TaBgl2, the importance of two additional key residues (W169 and L174 on Thermoanaerobacter brockii β-glucosidase) has further been argued to be essential for glucose tolerance [14]. These residues help in reducing enzymatic inhibition by retaining active site width, keeping a narrow cavity by means of imposing restrictions at the +2 subsite, thus limiting access of glucose to the +1 subsite.

Comparison of known glucose tolerant β-glucosidases from Humicola insolens (HiBG) and from a compost metagenomic library (GenBank accession No. HV538882) (Td2F2) where these residues are present, highlighted instead to their corresponding absence in TaBgl2, to be instead replaced as L167 and P172. Nevertheless, it must be highlighted that the 14 conserved residues and two key residues mentioned are identical to that of a previously elucidated glucose tolerant β-glucosidase from T. reesei, known as Cel1A [32]. Hence, a comparison of enzyme activity between novel TaBgl2, following heterologous expression, to the aforementioned protein, is hereby argued to provide a basis in demonstrating the feasibility of applying in-silico bioinformatics analyses for prediction of enzyme activity, specifically in the presence of glucose inhibitors.

2.4. Physicochemical Characterization

Physicochemical properties (isoelectric point (pI), number of positive and negative amino acids (R+/−), extinction coefficient (EC), instability index (II), aliphatic index (AI), grand average of hydropathicity (GRAVY) and total number of atoms (TNA)) for β-glucosidases from different Trichoderma spp. were determined (Table 2). The isoelectric point of a protein indicates the net charge of a given protein (positive or negative) under physiological conditions, which in turn acts as a good indicator for the protein’s solubility at a given pH. A suggested pI value of 5.55 suggests for TaBgl2 to be considered an acidic protein [33]. By comparison, the value is also close to the pI range of the other Trichoderma spp. β-glucosidases compared (5.10–5.53) and to β-glucosidases from T. reesei, of which the range given was between 4.4 and 8.7 [20].

Table 2.

Physicochemical parameters of β-glucosidases from selected Trichoderma spp., computed using ExPASY’s ProtParam tool.

Description Accession Number pI R+ R− EC (M−1·cm−1) II Stability AI GRAVY Formula TNA
β-Glucosidase 2 (Trichoderma asperellum) * ARW78142.1 5.55 60 52 102,135 31.25 Stable 69.27 −0.450 C2395H3581N631O700S12 7319
Glycoside hydrolase family 1 protein (Trichoderma asperellum CBS 433.97) XP_024766195.1 5.45 60 51 102,135 31.25 Stable 69.27 −0.449 C2393H3575N631O701S12 7312
β-Glucosidase (Trichoderma gamsii) XP_018660766.2 5.23 63 51 103,625 32.33 Stable 68.65 −0.485 C2397H3576N630O706S12 7321
Glycoside hydrolase family 1 protein (Trichoderma atroviride IMI 206040) XP_013939543.1 5.30 62 51 103,625 31.58 Stable 69.89 −0.464 C2401H3587N631O705S12 7336
β-1,4-Glucosidase (Trichoderma virens) AJW67427.1 5.53 57 49 102,135 31.80 Stable 68.86 −0.442 C2338H3489N619O685S12 7143
GH1 β-glucosidase BGL2/CEL1a (Trichoderma guizhouense) OPB39337.1 5.10 64 50 104,990 25.55 Stable 72.19 −0.450 C2409H3589N627O704S11 7340
β-Glucosidase (Trichoderma harzianum) KKP02477.1 5.11 64 50 104,990 26.23 Stable 71.35 −0.454 C2409H3589N627O704S12 7341
β-Glucosidase (Trichoderma reesei) BAA74959.1 5.33 60 49 102,010 27.54 Stable 70.60 −0.396 C2368H3534N628O693S11 7234

* This study; Notes: Isoelectric point (pI), number of positive and negative amino acids (R+/); extinction coefficient (EC); instability index (II); aliphatic index (AI); grand average of hydropathicity (GRAVY); total number of atoms (TNA).

The instability index functions as a protein primary structure-dependent approach for in vivo protein stability predictions. Here, the instability index value of TaBgl2 which was given as 31.25 is again comparable to the range of the other Trichoderma spp. β-glucosidases compared (25.55–32.33), all of which suggests as to their stability, given values are less than 40 [34]. The aliphatic index in turn describes the relative volume of a protein occupied by its aliphatic side chains in which the higher the value, the more thermally stable the protein is predicted to be. The aliphatic index value of TaBgl2 was found to be 69.27, comparable to that of the other Trichoderma spp. β-glucosidases mentioned (68.65–72.19) and suggests the thermal stability of the enzyme [35].

With GRAVY value for TaBgl2 being negative at −0.450, similar consistent negative GRAVY values for Trichoderma spp. β-glucosidases meanwhile collectively suggests as to the high possibility for aqueous interactions [36]. The total number of atoms for the Trichoderma spp. β-glucosidases were between 7143 and 7341, with TaBgl2 containing 7319 in total, further suggests as to its comparable value. Screening through the amino acid compositions (%) of the selected Trichoderma spp. β-glucosidases (Table 3), indicates glycine to be consistently the dominant amino acid, ranging from 7.7% to 9.0%.

Table 3.

Amino acid composition (%) of selected Trichoderma spp. β-glucosidases computed using ExPASy’s ProtParam tool.

Amino Acid ARW78142.1 * XP_024766195.1 XP_018660766.2 XP_013939543.1 AJW67427.1 OPB39337.1 KKP02477.1 BAA74959.1
Ala (A) 7.7 7.7 7.5 7.5 7.7 7.3 7.3 9.0
Arg (R) 5.8 5.8 5.8 5.8 5.9 5.6 5.6 5.8
Asn (N) 4.5 4.7 4.5 4.7 4.8 4.7 4.7 4.1
Asp (D) 7.3 7.3 7.7 7.5 7.3 8.2 8.0 7.9
Cys (C) 1.3 1.3 1.3 1.3 1.3 1.1 1.1 1.1
Gln (Q) 2.8 2.8 2.8 2.8 2.9 2.6 2.6 2.6
Glu (E) 5.6 5.6 5.8 5.8 5.3 5.6 5.8 4.9
Gly (G) 8.0 8.0 8.0 7.7 8.1 8.2 8.2 9.0
His (H) 1.5 1.5 1.5 1.5 1.5 1.5 1.5 1.7
Ile (I) 5.4 5.4 5.8 5.6 5.5 5.2 5.2 5.2
Leu (L) 6.9 6.9 6.7 6.9 6.6 8.0 7.7 6.7
Lys (K) 5.4 5.2 5.2 5.2 4.8 5.2 5.2 4.7
Met (M) 1.3 1.3 1.3 1.3 1.3 1.3 1.5 1.3
Phe (F) 6.5 6.5 6.2 6.2 6.4 6.2 6.2 6.0
Pro (P) 5.6 5.6 5.8 5.6 5.5 5.8 5.8 6.0
Ser (S) 6.0 6.0 5.8 5.6 6.2 5.2 5.2 5.8
Thr (T) 6.0 6.0 6.0 6.2 6.2 5.6 5.6 5.2
Trp (W) 2.6 2.6 2.6 2.6 2.6 2.6 2.6 2.6
Tyr (Y) 5.2 5.2 5.4 5.4 5.3 5.6 5.6 5.2
Val (V) 4.7 4.7 4.3 4.7 4.8 4.7 4.7 5.4

* This study.

ProtScale tool analysis in turn led to prediction of minimum and maximum hydrophobic positions and scores for each β-glucosidase predicted (Table 4). Hydropathy plots constructed by ProtScale tool (Supplementary Figure S1) indicated to the absence of a 19-residue segment averaging greater than +1.6 for all β-glucosidases, which in turn suggests low probability for a membrane-bound segment of the protein, given that membrane spanning sequences are distinguishable to those that pass through a protein’s center, by the former’s higher hydropathy [37].

Table 4.

Comparison of hydrophobic scores and positions of selected Trichoderma spp. β-glucosidases using ExPASY ProtScale tool.

Description Accession Number Position Score
Min Max Min Max
β-Glucosidase 2 (Trichoderma asperellum) * ARW78142.1 445 114 −2.444 1.667
Glycoside hydrolase family 1 protein (Trichoderma asperellum CBS 433.97) XP_024766195.1 445 114 −2.444 1.667
β-Glucosidase (Trichoderma gamsii) XP_018660766.2 445 114 −2.444 1.667
Glycoside hydrolase family 1 protein (Trichoderma atroviride IMI 206040) XP_013939543.1 445 114 −2.444 1.667
β-1,4-Glucosidase (Trichoderma virens) AJW67427.1 445 114 −2.444 1.667
GH1 β-glucosidase BGL2/CEL1a (Trichoderma guizhouense) OPB39337.1 445 195 −2.444 1.689
β-Glucosidase (Trichoderma harzianum) KKP02477.1 445 195 −2.444 1.689
β-Glucosidase (Trichoderma reesei) BAA74959.1 446 114 −2.489 1.667

* This study.

2.5. Protein Localization

In predicting the presence of signal peptides, SignalP 5.0 and Phobius was utilized (Figure 4A,B, respectively), with the likelihood of signal peptide at 0.0213 suggested by the former. In terms of protein localization, the relatively low probability value suggests the inability to detect for presence of any signal peptide to be non-cytoplasmic [38], a conclusion further supported by Phobius analysis. MitoProt analysis further indicated probability of export to mitochondria to be at 7.08%. Subcellular localization was predicted from PSORT II and DeepLoc 1.0 (Figure 4C). The resultant values of 0.565 and 0.6021 obtained respectively suggest as to the highest probability of the protein to be present predominantly in the cytoplasm, with the latter value suggestive of the protein being in soluble form (Supplementary Table S3).

Figure 4.

Figure 4

Figure 4

The subcellular position of predicted TaBgl2 protein presented by (A) SignalP 5.0, (B) Phobius and (C) DeepLoc-1.0 software.

Such positioning may offer insight as to the function of TaBgl2 in host T. asperellum UPM1, as implied from other similarly located β-glucosidase from Trichoderma sp. In a previous study, a GH1 β-glucosidase designated as Cel1A from T. reesei which also exhibited intracellular expression was argued to be involved in the induction of cellulase genes by lactose [39]. Guo et al. [32] further proposed that the protein was involved in the generation of sophorose by means of transglycosylation activities, again for the purpose of cellulase induction. To account for the intracellular expression, focus was placed into the N-terminal signal sequence. In a comparative study on the N-terminal amino acid sequences of extracellular Humicola grisea BGL4 and intracellular T. reesei BGLII, the positioning of aspartate residue was considered to be key. As an acidic residue, aspartate is not often found in the amino terminal of signal sequences, having instead one or two positively charged amino acids, such as arginine, lysine or histidine [40].

To the purpose of affirming this hypothesis, alignment of N-terminal amino acid sequences of H. grisea BGL4, T. reesei BGLII and TaBgl2 was done (Figure 5). In both T. reesei BGLII and H. grisea Bgl4, lysine (K) was found to be present near to the aspartate (D) residue. However, a noticeable difference can be seen in their relative positioning, of which in H. grisea BGL4, the lysine is two residues after aspartate, while in T. reesei BGLII, the lysine is located exactly prior to the aspartate, similar to that seen in TaBgl2. Thus, it has been suggested that not only is the presence of key amino acids critical, the difference in relative position of lysine to the aspartate residue could also account for the effectiveness of the signal sequence.

Figure 5.

Figure 5

Alignment of N-terminal amino acid sequences of Humicola grisea Bgl4, Trichoderma reesei BglII and Trichoderma asperellum Bgl2 (TaBgl2). Boxed in blue and yellow are aspartate and lysine residues, respectively.

2.6. Phylogenetic Analysis

Phylogeny analysis via a neighbor-joining method to seven β-glucosidases of known Trichoderma spp. origins and several known glucose tolerant β-glucosidases was carried out (Figure 6). TaBgl2 was positioned to be clustered with putative β-glucosidases from Trichoderma spp., with closest relationship to GH1 protein from T. asperellum 433.97 (XP_024766195.1). Of further note is the further distance between TaBgl2 to known glucose tolerant β-glucosidases, which is suggestive of a lower glucose tolerance for the protein isolated.

Figure 6.

Figure 6

Neighbor-joining phylogram of selected homologous glucose tolerant β-glucosidases and TaBgl2 (ARW78142.1; boxed red) generated via MEGA X. The bootstrap consensus tree is inferred from 1000 replicates, with the confidence values shown next to the branches.

2.7. Heterologous Expression of TaBgl2 in E. coli

As previously mentioned, comparable structural features and motifs of TaBgl2 to Cel1A elucidated [32] have been identified. Hence, following heterologous expression, it is hereby argued to provide a basis in demonstrating the feasibility of applying in-silico bioinformatics analyses for the prediction of enzyme activity, specifically in the presence of glucose inhibitors. As per Genscript source calculations, codon optimization on TaBgl2 gene resulted in increased codon adaptation index from 0.68 to 0.89 (Table 5). Both host and TaBgl2 transformants were then subjected to cultivation and from crude protein samples, heterologous protein production was suggested by the presence of protein bands of approximately 52 kDa in size from both periplasmic and soluble cytoplasmic fractions (Figure 7). In the absence of post-translational modification, the presence of bands corresponding to the 50 kb band of the protein ladder utilized are suggestive of the theoretical protein size of 52 kb calculated for TaBgl2 and falls within the molecular weight range (between 51.6 and 52.9 kDa) of β-glucosidases from other Trichoderma spp. (Supplementary Table S2).

Table 5.

Codon usage of Escherichia coli corresponding to each amino acid and ‘stop’ signal (abbreviated), compared to codons present in native (bgl2n) and codon optimized (bgl2co) TaBgl2 gene sequences.

Amino Acid Codon Frequency Amino Acid Codon Frequency
E. coli bgl2n bgl2co E. coli bgl2n bgl2co
ILE ATT 0.49 0.28 0.56 TRP TGG 1.00 1.00 1.00
ATC 0.39 0.72 0.44 CYS TGT 0.46 0.17 0.00
ATA 0.11 0.00 0.00 TGC 0.54 0.83 1.00
LEU CTT 0.12 0.16 0.00 ALA GCT 0.18 0.17 0.00
CTC 0.10 0.16 0.00 GCC 0.26 0.61 0.00
CTA 0.04 0.00 0.00 GCA 0.23 0.06 0.00
CTG 0.47 0.50 1.00 GCG 0.33 0.17 1.00
TTA 0.14 0.00 0.00 GLY GGT 0.35 0.14 0.59
TTG 0.13 0.19 0.00 GGC 0.37 0.57 0.41
VAL GTT 0.28 0.27 0.50 GGA 0.13 0.24 0.00
GTC 0.20 0.41 0.00 GGG 0.15 0.05 0.00
GTA 0.17 0.05 0.00 PRO CCT 0.18 0.15 0.00
GTG 0.35 0.27 0.50 CCC 0.13 0.50 0.00
TTT 0.58 0.47 0.47 CCA 0.20 0.23 0.00
TTC 0.42 0.53 0.53 CCG 0.49 0.12 1.00
MET ATG 1.00 1.00 1.00 THR ACT 0.19 0.18 0.00
STOP TAA 0.61 1.00 1.00 ACC 0.40 0.36 1.00
TGA 0.30 0.00 0.00 ACA 0.17 0.11 0.00
TAG 0.09 0.00 0.00 ACG 0.25 0.36 0.00
PHE TTT 0.58 0.47 0.47 TYR TAT 0.59 0.33 0.46
TTC 0.42 0.53 0.53 TAC 0.41 0.67 0.54
SER TCT 0.17 0.21 0.00 ARG CGT 0.36 0.04 1.00
TCC 0.15 0.21 0.00 CGC 0.36 0.52 0.00
TCA 0.14 0.07 0.00 CGA 0.07 0.22 0.00
TCG 0.14 0.21 0.00 CGG 0.11 0.04 0.00
AGT 0.16 0.04 0.00 AGA 0.07 0.15 0.00
AGC 0.25 0.25 1.00 AGG 0.04 0.04 0.00
GLN CAA 0.34 0.15 0.31 ASP GAT 0.63 0.44 0.47
CAG 0.66 0.85 0.69 GAC 0.37 0.56 0.53
ASN AAT 0.49 0.14 0.00 LYS AAA 0.74 0.24 0.40
AAC 0.51 0.86 1.00 AAG 0.26 0.76 0.60
HIS CAT 0.57 0.14 0.00 GLU GAA 0.68 0.27 0.42
CAC 0.43 0.86 1.00 GAG 0.32 0.73 0.58

Figure 7.

Figure 7

Sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) profile of fraction samples (periplasm, cytoplasm and insoluble) for expressed protein visualization from crude protein extracts of host Escherichia coli and transformant TaBgl2 carrying plasmid pET-20b(+)-bgl2. Protein sizes were compared to bands separated from PageRuler™ Unstained Protein Ladder (Lane 1, in all gels) with a ~50 kDa band corresponding to the deduced recombinant TaBgl2 in the E. coli BL21(DE3) transformant (boxed yellow). H, Host; T, Transformant. Biological sample triplicates were prepared for all SDS-PAGE profiles.

Subsequent β-glucosidase activity assay on ρNPG substrate was carried out on both fractions from host and transformant (Figure 8). Enzymatic β-glucosidase activity assay was significantly 11.1-fold higher in the transformants compared to the host, supporting the successful expression of heterologous TaBgl2 from transformants as opposed to that present in host strain. Between the transformants, periplasmic fraction demonstrated the highest specific enzyme activity (8.1 × 10−3 U/mg) compared to the cytoplasm (7.0 × 10−3 U/mg). This is expected given the presence of the N-terminal pelB signal peptide, directing protein expression to the periplasm.

Figure 8.

Figure 8

Specific activity (U/mg) of crude enzyme extracts from the periplasmic and cytoplasmic fractions of Escherichia coli BL21(DE3) host and transformant TaBgl2 carrying plasmid pET-20b(+)-bgl2 with ρ-nitrophenyl-β-d-glucopyranoside (ρNPG) used as a substrate. Biological sample triplicates were prepared for analysis and the specific activities in the periplasm and cytoplasm of host and transformant were shown to be significant (p < 0.05), as per Analysis of Variance (ANOVA) Single Factor analysis.

Additionally, while transformants exhibit higher specific activity across both fractions, the basal activities of β-glucosidase observed in host samples are possibly attributable to the expression of bglX, a β-d-glucosidase located on the periplasm. The inherent gene coding for bglX is located adjacent to the did gene on the E. coli chromosome, with the 2.6 kb DNA sequence fragment revealing an ORF encoding a protein with 765 amino acids in length [41]. In the presence of glucose, relative enzyme activity of heterologous TaBgl2 was retained by as much as 40% in glucose concentration of up to 0.2 M (Figure 9). This was found to be comparable to that of Cel1A from T. reesei, which suggests the feasibility of utilizing in-silico analyses and subsequent heterologous expression for the prediction of glucose tolerance.

Figure 9.

Figure 9

Glucose tolerance of recombinant TaBgl2 against glucose concentrations up to 0.25 M, expressed in relative activity (%). Biological sample triplicates were prepared for analysis.

3. Materials and Methods

3.1. Strains and Plasmid Vector

Trichoderma asperellum UPM1 was a local isolate derived from rotten oil palm fruit bunch with species identification carried out via 18S rDNA methodology [42]. Escherichia coli BL21 (DE3) and plasmid pET-20b(+) were used for the expression of heterologous TaBgl2, targeted to the periplasm (Genscript, Piscataway, NJ, USA).

3.2. Isolation of TaBgl2 Nucleotide Sequence

Stock cultures of T. asperellum UPM1 were grown on potato dextrose agar (PDA) at 30 °C for 7 days prior to spore resuspension in 5% Tween 80. Mycelial hyphae were cultivated via submerged state fermentation in Mendel’s media (2 g/L potassium dihydrogen phosphate, 0.3 g/L magnesium sulphate heptahydrate, 0.3 g/L calcium chloride dihydrate, 1.4 g/L ammonium sulphate, 0.75 g/L peptone, 0.002% Tween 80) with trace element (0.005 g/L ferrous sulphate, 0.016 g/L manganese (II) sulphate, 0.014 g/L zinc sulphate heptahydrate, 0.002 g/L cobalt (II) chloride). Carboxymethylcellulose (CMC) at 1% was used as a carbon source and 1% β-lactose as an inducer at pH 5 with 106 conidia per flask for inoculation [43]. Harvest of mycelia was done by vacuum filtration [44] using pre-sterilized nylon filters (GE Healthcare Life Sciences, Buckinghamshire, UK).

Total RNA was prepared from 0.1 g of wet mycelium by grinding using liquid nitrogen as per protocol provided with the GeneMATRIX Universal RNA Purification Kit (EURx, Gdansk, Poland). First strand cDNA was amplified using RevertAid First Strand cDNA Synthesis Kit (Thermo Scientific, Waltham, MA, USA) with 1 μg of total RNA as a template. Partial TaBgl2 sequence amplification was carried out using degenerate primers (Table 6). PCR reactions proceeded with initial denaturation at 94 °C for 2 min, and 35 cycles of 94 °C for 30 s, 52.6 °C for 30 s, and 72 °C for 1 min, followed by final extension at 72 °C for 10 min [45]. PCR amplification consisted of 2.0 μL of 10× Taq buffer with KCl, 1.5 mM MgCl2, 0.8 mM deoxyribonucleoside triphosphate (dNTP) solution, 0.8 μM each of forward and reverse primers, and 2.5 × 10−2 U/μL of Taq DNA polymerase to a total volume of 20 μL. Polymerase, buffer, MgCl2 and dNTP were all sourced from Thermo Fisher Scientific, Waltham, MA, USA.

Table 6.

Degenerate primers from the conserved regions of β-glucosidase amino acid sequence variants from Trichoderma sp. belonging to the Bgl2/Cel1A family.

Primer Conserved Region Sequence (5′ to 3′)
DGFAMILYF YQIEGA TAYCARATHGARGGNGC
DGFAMILYR DFYGMN TTCATNCCRTARAARTC

Sequencing procedures were carried out by Apical Scientific Sdn. Bhd., Malaysia. Following sequencing, specific primers were designed (Table 7) for use with rapid amplification of cDNA ends (RACE) PCR as suggested by SMARTer® RACE 5′/3′ Kit User Manual (Clontech Laboratories Inc., Mountain View, CA, USA) until nucleotides coding for TaBgl2 gene sequence was obtained. From elucidated nucleotide sequence, TaBgl2 gene sequence was translated using ExPASy Translate Tool (https://web.expasy.org/translate/) using standard genetic code, for identification of ORF. The gene sequence was submitted to GenBank with the accession number KX772748.1 with the translated protein sequence accessible on NCBI through accession number ARW78142.1.

Table 7.

Gene specific primers designed for rapid amplification of cDNA ends (RACE) PCR. 5′-GATTACGCCAAGCTT-3′ sequence introduced and highlighted in bold, is for cloning into vector, as suggested by SMARTer ® RACE 5′/3′ Kit User Manual.

Primer Sequence (5′ to 3′)
GSPFAM1F GATTACGCCAAGCTTGACGACCTGCTGGAAGCGGGCATCACC
GSPFAM1R GATTACGCCAAGCTTATCGCTGGTGGTCTTGAACTCCTCTCG

3.3. Sequence Alignment and Conserved Motif Identification

Homologous β-glucosidase protein sequences from several Trichoderma spp. and the degrees of identities were identified via BLASTp (protein) available from National Center for Biotechnology Information (NCBI). BioEdit Sequence Alignment Editor version 7.2.6 (North Carolina State University, Raleigh, NC, USA) [46] and ExPASY-PROSITE (SIB Swiss Institute of Bioinformatics, Geneva, Switzerland) [47] were then utilized for amino acid sequence alignment and the identification of conserved motifs, respectively.

3.4. Secondary and Tertiary Structure Predictions

The construction of secondary and tertiary protein structures from primary protein sequence template was conducted using SWISS-MODEL (https://swissmodel.expasy.org/). Structure analyses were further supported with GOR IV tool [25] for the determination of α-helix, β-sheet, turns, and loops.

3.5. Physicochemical Characterization

Physicochemical properties such as pI, number of positive and negative charged residues, extinction coefficient, instability index, aliphatic index, GRAVY, amino acid composition of β-glucosidases retrieved were computed using ProtParam tool (SIB Swiss Institute of Bioinformatics, Geneva, Switzerland) [36]. The determination of pI is by the pK value of protein during protein migration under denaturation conditions [48]. The concentration of purified protein in sample is determined by extinction coefficient value [49]. Protein stability is predicted from the instability index with values smaller than 40 suggesting for stable proteins and the proteins are regarded as unstable at values above 40 [50]. The volume occupied by aliphatic amino acids side chains (alanine, valine, leucine, and isoleucine) relative to total volume occupied is indicated by the aliphatic index and determines a globular protein’s thermostability [51]. GRAVY, which indicates the ratio of sum of hydropathy values of all amino acids to total number of residues in sequence, suggests as to the hydrophilicity or hydrophobicity of the protein [49]. Hydrophobicity scores, positions, and hydropathy plots on Kyte and Doolittle scale for all sequences were predicted by ProtScale tool (SIB Swiss Institute of Bioinformatics, Geneva, Switzerland) [51].

3.6. Protein Localization

Several software tools including SignalP 5.0 (Technical University of Denmark, Kgs Lyngby, Denmark), Phobius (Karolinska Institutet, Stockholm, Sweden), MitoProt (École Normale Supérieure, Paris, France), PSORT II (The University of Tokyo, Tokyo, Japan) and DeepLoc 1.0 (Technical University of Denmark, Kgs Lyngby, Denmark) were used for the prediction of signal peptide, cleavage site locations, subcellular localization, and the cell compartment, of which TaBgl2 is expected to act [52,53,54,55,56].

3.7. Phylogenetic Analysis

Molecular Evolutionary Genetics Analysis (MEGA) X software tool version 10.0.4 (Pennsylvania State University, PA, USA) was used to determine phylogeny of TaBgl2 to selected β-glucosidase sequences. following multiple sequence comparison by log-expectation (MUSCLE) alignment, the neighbor-joining method was used for inferring evolutionary history, with assessment of ‘confidence’ of the results done through 1000 bootstrap replicates [57].

3.8. Heterologous Expression of Recombinant TaBgl2

Codon optimization, subsequent synthesis, cloning into plasmid pET-20b(+) and transformation of TaBgl2 gene into host Escherichia coli BL21(DE3) were carried out by GenScript (Hong Kong, China) Limited (Hong Kong, China). Restrictions sites BamHI and HindIII on multiple cloning site of plasmid pET-20b(+) were selected for insert cloning with the resultant plasmid annotated as pET-20b(+)-bgl2. Transformants were confirmed by selective growth on LB agar plates supplemented with 100 μg/mL ampicillin (Amresco, Solon, OH, USA), with subsequent sequencing procedures using T7 terminator primer on extracted plasmid template that were carried out by Apical Scientific Sdn. Bhd. (Seri Kembangan, Selangor, Malaysia).

E. coli strains harboring expression vector for TaBgl2 were grown in 50 mL of Terrific Broth liquid medium (24 g/L yeast extract, 12 g/L tryptone, 4 mL/L glycerol, 100 mL/L potassium phosphate solution containing 0.17 M KH2PO4 and 0.72 M K2HPO4) supplemented with 100 μg/mL ampicillin (Amresco, Solon, OH, USA) in 250-mL flasks on a rotary shaker (150 rpm) at 37 °C. Cultivation then continued for 14 h with temperature reduced to 25 °C once absorbance at 660 nm reached 0.7. Resultant cultures were later separated, with the fractions designated as ‘extracellular’, ‘periplasm’, and ‘cytoplasm’, respectively.

3.9. Preparation of Extracellular, Periplasmic and Cytoplasmic Protein Fractions

Extracellular fraction was separated by centrifugation at 4 °C, 1880× g for 12 min [58]. The resultant sediment was then re-suspended in 20 mL of 30 mM Tris-HCl, pH 8 with 20% sucrose (Sigma-Aldrich, St. Louis, MI, USA) prior to addition of 1 mM ethylenediaminetetraacetic acid (EDTA), agitated lightly for 10 min at room temperature. Centrifugation was again applied at 4 °C, 10,000× g for 10 min, with the supernatant discarded. The remaining cell sediment was resuspended in 1.5 mL of pre-chilled 5 mM MgSO4, agitated lightly for 10 min on ice prior to centrifugation at 4 °C, 10,000× g for 10 min with the resultant supernatant designated as periplasmic fraction [59].

For the release of intracellular protein fraction, sonication was applied to the sediment obtained, following resuspension in 500 μL of citrate-phosphate buffer, pH 6, on ice for 30 min. In 15 mL centrifuge tubes immersed in ice, sonication proceeded with amplitude set at 50%, pulse of 0.5 s for a duration of 5 min [60]. Centrifugation then proceeded at 4 °C, 16,128× g for 10 min with the resultant supernatant designated as cytoplasmic fraction. Finally, insoluble proteins were isolated by the addition of 1 mL of 6 M urea to the resultant sediment and left for 16 h at room temperature prior to centrifugation at 4 °C, 16,128× g for 10 min with the resultant supernatant designated as insoluble cytoplasmic fraction.

3.10. Determination of Protein Content and β-Glucosidase Activity

For determination of protein content within all fractions, 0.02 mL of each sample was mixed with 1 mL of Bradford reagent (20% (w/v)) comprised of Coomassie Brilliant Blue G-250 (Fisher Scientific, Waltham, MA, USA), dissolved in methanol, 20% (v/v) of 85% phosphoric acid, diluted with distilled water and left to stand at room temperature for 2 min. Optical density was subsequently measured at a wavelength of 595 nm, with values compared to known bovine serum albumin standards [61].

Crude samples were then examined using sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) [62] with positive samples subjected to further β-glucosidase activity analyses [63] on ρNPG (Merck, Kenilworth, NJ, USA) and glucose of several concentrations. The assay proceeded by incubation of samples with 0.15 g/L of ρNPG in citrate-phosphate buffer at pH 6, in a total volume of 2.2 mL at 40 °C for 30 min. The reaction was ceased by the addition of 2 mL of 1 M Na2CO3, with the released ρ-nitrophenol content determined at 400 nm. A one-way ANOVA test was conducted for statistical analysis where a value of p < 0.05 was considered significantly different. Sample activity on glucose concentrations of 0.05 to 0.25 M was done under the same conditions as described.

4. Conclusions

We described here the isolation, characterization, and heterologous expression of novel GH1 β-glucosidase from T. asperellum UPM1. Using several in-silico analyses, such as conserved motif identification, physicochemical characterization, phylogeny tree construction, protein localization, and higher structures prediction, a correlation between enzyme sequence and structure to resultant function was deduced. Owing to expected glucose tolerance, experimental works have subsequently proven the function of TaBgl2, with relative activity comparable to previous known protein expression. This work therefore demonstrates the plausibility of comparing novel β-glucosidases to prior homologs for the preliminary assessment of industrial application.

Acknowledgments

The authors appreciate all the members of the Environmental Biotechnology Research Group, UPM for their help and support throughout conducting these experiments and preparation of this manuscript.

Abbreviations

Bgl Beta-glucosidase
ANOVA Analysis of Variance
GH Glycoside hydrolase
ρNPG ρ-nitrophenyl-β-d-glucopyranoside
CAZy Carbohydrate-Active Enzymes
GH1 Glycoside hydrolase family 1
GH3 Glycoside hydrolase family 3
BLAST Basic Local Alignment Search Tool
ORF Open reading frame
MSA Multiple sequence alignments
TIM Triosephosphate isomerase
pI Isoelectric point
GRAVY Grand average of hydropathicity
PDA Potato dextrose agar
CMC Carboxymethylcellulose
RACE Rapid amplification of cDNA ends
EDTA Ethylenediaminetetraacetic acid
SDS-PAGE Sodium dodecyl sulfate-polyacrylamide gel electrophoresis

Supplementary Materials

Supplementary materials can be found at https://www.mdpi.com/1422-0067/21/11/4035/s1.

Author Contributions

Conceptualization: N.R.; Methodology, investigation and data curation: M.F.M.S.; Writing—Original draft preparation: M.F.M.S.; Writing—Review and editing: N.R., S.A.-A., and F.D.A.B.; Supervision: N.R., S.A.-A., and F.D.A.B.; Funding acquisition: N.R. and S.A.-A. All authors have read and agreed to the final version of the manuscript.

Funding

This work was supported by the Putra Grants GP-IPM (GP-IPM/2013/9402000) and GP-IPS (GP-IPS/2014/9429400) provided by Universiti Putra Malaysia (UPM). The authors gratefully acknowledge the scholarship of Bumiputera Academic Training Scheme (SLAB) from Universiti Malaysia Perlis (UniMAP) and Ministry of Education, Malaysia.

Conflicts of Interest

The authors declare that there is no conflict of interest.

References

  • 1.Schuster A., Schmoll M. Biology and biotechnology of Trichoderma. Appl. Microbiol. Biotechnol. 2010;87:787–799. doi: 10.1007/s00253-010-2632-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Strakowska J., Błaszczyk L., Chełkowski J. The significance of cellulolytic enzymes produced by Trichoderma in opportunistic lifestyle of this fungus. J. Basic Microbiol. 2014;54:S2–S13. doi: 10.1002/jobm.201300821. [DOI] [PubMed] [Google Scholar]
  • 3.Brotman Y., Kapuganti J.G., Viterbo A. Trichoderma . Curr. Biol. 2010;20:R390–R391. doi: 10.1016/j.cub.2010.02.042. [DOI] [PubMed] [Google Scholar]
  • 4.Paloheimo M., Haarmann T., Mäkinen S., Vehmaanperä J. Production of industrial enzymes in Trichoderma reesei. In: Schmoll M., Dattenböck C., editors. Gene Expression Systems in Fungi: Advancements and Applications. Springer; Cham, Switzerland: 2016. pp. 23–57. [DOI] [Google Scholar]
  • 5.Zhang X.-Z., Zhang Y.-P. Cellulases: Characteristics, sources, production, and applications. In: Yang S.-T., El-Enshasy H.A., Thongchul N., editors. Bioprocessing Technologies in Biorefinery for Sustainable Production of Fuels, Chemicals, and Polymers. Volume 1. John Wiley & Sons, Inc.; Hoboken, NJ, USA: 2013. pp. 131–146. [Google Scholar]
  • 6.Zhang Y.-H.P., Lynd L. A functionally based model for hydrolysis of cellulose by fungal cellulase. Biotechnol. Bioeng. 2006;94:888–898. doi: 10.1002/bit.20906. [DOI] [PubMed] [Google Scholar]
  • 7.Väljamäe P., Pettersson G., Johansson G. Mechanism of substrate inhibition in cellulose synergistic degradation. Eur. J. Biochem. 2001;268:4520–4526. doi: 10.1046/j.1432-1327.2001.02377.x. [DOI] [PubMed] [Google Scholar]
  • 8.Yue Z., Bin W., Baixu Y., Peiji G. Mechanism of cellobiose inhibition in cellulose hydrolysis by cellobiohydrolase. Sci. China C Life Sci. 2004;47:18–24. doi: 10.1360/02yc0163. [DOI] [PubMed] [Google Scholar]
  • 9.Hsieh C.C., Cannella D., Jørgensen H., Felby C., Thygesen L.G. Cellulase inhibition by high concentrations of monosaccharides. J. Agric. Food Chem. 2014;62:3800–3805. doi: 10.1021/jf5012962. [DOI] [PubMed] [Google Scholar]
  • 10.Sørensen A., Lübeck M., Lübeck P.S., Ahring B.K. Fungal beta-glucosidases: A bottleneck in industrial use of lignocellulosic materials. Biomolecules. 2013;3:612–631. doi: 10.3390/biom3030612. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Rani V., Mohanram S., Tiwari R., Nain L., Arora A. Beta-glucosidase: Key enzyme in determining efficiency of cellulase and biomass hydrolysis. J. Bioprocess Biotech. 2014;5:1–8. doi: 10.4172/2155-9821.1000197. [DOI] [Google Scholar]
  • 12.Ahmed A., Nasim F., Batool K., Bibi A. Microbial β-glucosidase: Sources, production and applications. J. Appl. Environ. Microbiol. 2017;5:31–46. doi: 10.12691/jaem-5-2-2. [DOI] [Google Scholar]
  • 13.Yang Y., Zhang X., Yin Q., Fang W., Fang Z., Wang X., Zhang X., Xiao Y. A mechanism of glucose tolerance and stimulation of GH1 β-glucosidases. Sci. Rep. 2015;5:17296–17307. doi: 10.1038/srep17296. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.De Giuseppe P.O., Souza T.A.C.B., Souza F.H.M., Zanphorlin L.M., Machado C.B., Ward R.J., Jorge J.A., Furriel R.P.M., Murakami M.T. Structural basis for glucose tolerance in GH1 β-glucosidases. Acta Crystallogr. D Biol. Crystallogr. 2014;70:1631–1639. doi: 10.1107/S1399004714006920. [DOI] [PubMed] [Google Scholar]
  • 15.Yilmaz-Temel H., Vardar-Sukan F. An engineering approach to bioinformatics and its applications. In: Hakeem K., Malik A., Vardar-Sukan F., Ozturk M., editors. Plant Bioinformatics. Springer; Berlin, Germany: 2017. pp. 417–443. [DOI] [Google Scholar]
  • 16.Marx I.J., van Wyk N., Smit S., Jacobson D., Viljoen-bloom M., Volschenk H. Comparative secretome analysis of Trichoderma asperellum S4F8 and Trichoderma reesei Rut C30 during solid-state fermentation on sugarcane bagasse. Biotechnol. Biofuels. 2013;6 doi: 10.1186/1754-6834-6-172. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Pervez M.T., Babar M.E., Nadeem A., Aslam M., Awan A.R., Aslam N., Hussain T., Naveed N., Qadri S., Waheed U., et al. Evaluating the accuracy and efficiency of multiple sequence alignment methods. Evol. Bioinform. 2014;10:205–217. doi: 10.4137/EBO.S19199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Henrissat B. A classification of glycosyl hydrolases-based on amino acid sequence similarities. Biochem. J. 1991;280:309–316. doi: 10.1042/bj2800309. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Seshadri S., Akiyama T., Opassiri R., Kuaprasert B., Cairns J.K. Structural and enzymatic characterization of Os3BGlu6, a rice β-glucosidase hydrolysing hydrophobic glycosides and (1→3)- and (1→2)-linked disaccharides. Plant Physiol. 2009;151:47–58. doi: 10.1104/pp.109.139436. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Tiwari P., Misra B.N., Sangwan N.S. β-Glucosidases from the fungus Trichoderma: An efficient cellulase machinery in biotechnological applications. BioMed Res. Int. 2013;2013 doi: 10.1155/2013/203735. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Koshland D.E. Stereochemistry and the mechanism of enzymatic reactions. Biol. Rev. 1953;28:416–436. doi: 10.1111/j.1469-185X.1953.tb01386.x. [DOI] [Google Scholar]
  • 22.Lee H.L., Chang C.K., Jeng W.Y., Wang A.H.J., Liang P.H. Mutations in the substrate entrance region of β-glucosidase from Trichoderma reesei improve enzyme activity and thermostability. Prot. Eng. Des. Sel. 2012;25:733–740. doi: 10.1093/protein/gzs073. [DOI] [PubMed] [Google Scholar]
  • 23.Beattie K.E., Ferrari L.D., Mitchell J.B.O. Why do sequence signatures predict enzyme mechanism? Homology versus chemistry. Evol. Bioinform. 2015;11:267–274. doi: 10.4137/EBO.S31482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Mariano D.C.B., Leite C., Santos L.H.S., Marins L.F., Machado K.S., Werhli A.V., Lima L.H.F., de Melo-Minardi R.C. Characterization of glucose-tolerant β-glucosidases used in biofuel production under the bioinformatics perspective: A systematic review. Genet. Mol. Res. 2017;16:1–19. doi: 10.4238/gmr16039740. [DOI] [PubMed] [Google Scholar]
  • 25.Garnier J., Gibrat J.F., Robson B. GOR method for predicting protein secondary structure from amino acid sequence. Method. Enzymol. 1996;266:540–553. doi: 10.1016/S0076-6879(96)66034-0. [DOI] [PubMed] [Google Scholar]
  • 26.Jeng W.Y., Wang N.C., Lin C.T., Liaw Y.C., Chang W.J., Liu C.I., Liang P.H., Wang A.H. Structural and functional analysis of three beta-glucosidases from bacterium Clostridium cellulovorans, fungus Trichoderma reesei and termite Neotermes koshunensis. J. Struct. Biol. 2011;173:46–56. doi: 10.1016/j.jsb.2010.07.008. [DOI] [PubMed] [Google Scholar]
  • 27.Wierenga R.K. The TIM-barrel fold: A versatile framework for efficient enzymes. FEBS Lett. 2001;492:193–198. doi: 10.1016/S0014-5793(01)02236-0. [DOI] [PubMed] [Google Scholar]
  • 28.Pang P., Cao L.-C., Liu Y.-H., Xie W., Wang Z. Structures of a glucose-tolerant β-glucosidase provide insights into its mechanism. J. Struct. Biol. 2017;198:154–162. doi: 10.1016/j.jsb.2017.02.001. [DOI] [PubMed] [Google Scholar]
  • 29.Jabs A., Weiss M., Hilgenfeld R. Non-proline cis peptide bonds in proteins. J. Mol. Biol. 1999;286:291–304. doi: 10.1006/jmbi.1998.2459. [DOI] [PubMed] [Google Scholar]
  • 30.Juers D.H., Heightman T.D., Vasella A., McCarter J.D., Mackenzie L., Withers S.G., Matthews B.W. Reversible lattice repacking illustrates the temperature dependence of macromolecular interactions. Biochemistry. 2001;40:14781–14794. doi: 10.1021/bi011727i. [DOI] [PubMed] [Google Scholar]
  • 31.Rutkiewicz-Krotewicz M., Pietrzyk-Brzezinska A.J., Sekula B., Cieśliński H., Wierzbicka-Woś A., Kur J., Bujacz A. Structural studies of a cold-adapted dimeric β-D-galactosidase from Paracoccus sp. 32d. Acta Crystallogr. Sect. D Struct. Biol. 2016;72:1049–1061. doi: 10.1107/S2059798316012535. [DOI] [PubMed] [Google Scholar]
  • 32.Guo B., Sato N., Biely P., Amano Y., Nozaki K. Comparison of catalytic properties of multiple β-glucosidases of Trichoderma reesei. Appl. Microbiol. Biotechnol. 2016;100:4959–4968. doi: 10.1007/s00253-016-7342-x. [DOI] [PubMed] [Google Scholar]
  • 33.Halligan B.D. ProMoST: A tool for calculating the pI and molecular mass of phosphorylated and modified proteins on two-dimensional gels. In: Graauw M., editor. Phospho-Proteomics, Methods and Molecular Biology. Volume 527. Humana Press; Totowa, NJ, USA: 2009. pp. 283–298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Gamage D.G., Gunaratne A., Periyannan G.R., Russell T.G. Applicability of instability index for in vitro protein stability prediction. Protein Peptide Lett. 2019;26:339–347. doi: 10.2174/0929866526666190228144219. [DOI] [PubMed] [Google Scholar]
  • 35.Panda S., Chandra G. Physicochemical characterization and functional analysis of some snake venom toxin proteins and related non-toxin proteins of other chordates. Bioinformation. 2012;8:891–896. doi: 10.6026/97320630008891. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Kumar N.V., Rani M.E., Gunaseeli R., Kannan N.D., Sridhar J. Modeling and structural analysis of cellulases using Clostridium thermocellum as template. Bioinformation. 2012;8:1105–1110. doi: 10.6026/97320630081105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Kyte J., Doolittle R.F. A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 1982;157:105–132. doi: 10.1016/0022-2836(82)90515-0. [DOI] [PubMed] [Google Scholar]
  • 38.Kolahi M., Yazdi M., Goldson-Barnaby A., Tabandeh M.R. In silico prediction, phylogenetic and bioinformatic analysis of SoPCS gene, survey of its protein characterization and gene expression in response to cadmium in Saccharum officinarum. Ecotox. Environ. Safe. 2018;163:7–18. doi: 10.1016/j.ecoenv.2018.07.032. [DOI] [PubMed] [Google Scholar]
  • 39.Xu J., Zhao G., Kou Y., Zhang W., Zhou Q., Chen G., Liu W. Intracellular β-glucosidases CEL1a and CEL1b are essential for cellulase induction on lactose in Trichoderma reesei. Eukaryot. Cell. 2014;13:1001–1013. doi: 10.1128/EC.00100-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Saloheimo M., Kuja-Panula J., Ylösmäki E., Ward M., Penttilä M. Enzymatic properties and intracellular localization of the novel Trichoderma reesei beta-glucosidase BGLII (Cel1A) Appl. Environ. Microbiol. 2002;68:4546–4553. doi: 10.1128/AEM.68.9.4546-4553.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Yang M., Luoh S., Goddard A., Reilly D., Henze W., Bass S. The bgIX gene located at 47.8 min on the Escherichia coli chromosome encodes a periplasmic β-glucosidase. Microbiology. 1996;142:1659–1665. doi: 10.1099/13500872-142-7-1659. [DOI] [PubMed] [Google Scholar]
  • 42.Abu Bakar N.K., Abd-Aziz S., Hassan M.A., Ghazali F.M. Isolation and selection of appropriate cellulolytic mixed microbial cultures for cellulases production from oil palm empty fruit bunch. Biotechnology. 2010;9:73–78. doi: 10.3923/biotech.2010.73.78. [DOI] [Google Scholar]
  • 43.Sarah R., Motallebi M., Zamani M.R. Cloning and characterization of β-1, 4 glucosidase 2 (Bgl2) gene from a high producer cellulolytic enzyme Trichoderma harzianum (T7) World Appl. Sci. J. 2007;2:315–322. [Google Scholar]
  • 44.Schumann U., Smith N.A., Wang M.B. A fast and efficient method for preparation of high-quality RNA from fungal mycelia. BMC Res. Notes. 2013;6 doi: 10.1186/1756-0500-6-71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Liu D., Zhang R., Yang X., Xu Y., Tang Z., Tian W., Shen Q. Expression, purification and characterization of two thermostable endoglucanases cloned from a lignocellulosic decomposing fungi Aspergillus fumigatus Z5 isolated from compost. Protein Expres. Purif. 2011;79:176–186. doi: 10.1016/j.pep.2011.06.008. [DOI] [PubMed] [Google Scholar]
  • 46.Hall T.A. Nucleic Acids Symposium Series. Volume 41. Information Retrieval Ltd.; London, UK: 1999. BioEdit: A user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT; pp. 95–98. [Google Scholar]
  • 47.Sigrist C.J.A., Cerutti L., Hulo N., Gattiker A., Falquet L., Pagni M., Bairoch A., Bucher P. PROSITE: A documented database using patterns and profiles as motif descriptors. Briefings Bioinform. 2002;3:265–274. doi: 10.1093/bib/3.3.265. [DOI] [PubMed] [Google Scholar]
  • 48.Bjellqvust B., Hughes G.J., Pasquali C., Paquet N., Ravier F., Sanchez J.C., Frutiger S., Hochstrasser D. The focusing positions of polypeptides in immobilized pH gradients can be predicted from their amino acid sequences. Electrophoresis. 1993;14:1023–1031. doi: 10.1002/elps.11501401163. [DOI] [PubMed] [Google Scholar]
  • 49.Umang M., Astha J., Aastha C., Neha A., Vibha R. Computational structural and functional characterization of protein family: Key for the hidden mystery. J. Pharm. Res. 2012;5:3643–3649. [Google Scholar]
  • 50.Guruprasad K., Reddy B., Pandit M.W. Correlation between stability of a protein and its dipeptide composition: A novel approach for predicting in vivo stability of a protein from its primary sequence. Protein Eng. Des. Sel. 1990;4:155–161. doi: 10.1093/protein/4.2.155. [DOI] [PubMed] [Google Scholar]
  • 51.Gasteiger E., Hoogland C., Gattiker A., Duvaud S., Wilkins M.R., Appel R.D., Bairoch A. Protein identification and analysis tools on the ExPASy server. In: Walker J.M., editor. The Proteomics Protocols Handbook. Humana Press; Totowa, NJ, USA: 2005. pp. 571–607. [DOI] [Google Scholar]
  • 52.Armenteros J.J.A., Tsirigos K.D., Sønderby C.K., Petersen T.N., Winther O., Brunak S., von Heijne G., Nielsen H. SignalP 5.0 improves signal peptide predictions using deep neural networks. Nat. Biotechnol. 2019;37:420–423. doi: 10.1038/s41587-019-0036-z. [DOI] [PubMed] [Google Scholar]
  • 53.Armenteros J.J.A., Sønderby C.K., Sønderby S.K., Nielsen H., Winther O. DeepLoc: Prediction of protein subcellular localization using deep learning. Bioinformatics. 2017;33:3387–3395. doi: 10.1093/bioinformatics/btx431. [DOI] [PubMed] [Google Scholar]
  • 54.Käll L., Krogh A., Sonnhammer E.L.L. Advantages of combined transmembrane topology and signal peptide prediction—The Phobius web server. Nucleic Acids Res. 2007;35:W429–W432. doi: 10.1093/nar/gkm256. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Claros M.G. MitoProt, a Macintosh application for studying mitochondrial proteins. Bioinformatics. 1995;11:441–447. doi: 10.1093/bioinformatics/11.4.441. [DOI] [PubMed] [Google Scholar]
  • 56.Horton P., Park K.J., Obayashi T., Fujita N., Harada H., Adams-Collier C.J., Nakai K. WoLF PSORT: Protein localization predictor. Nucleic Acids Res. 2007;35:W585–W587. doi: 10.1093/nar/gkm259. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Kumar S., Stecher G., Li M., Knyaz C., Tamura K. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 2018;35:1547–1549. doi: 10.1093/molbev/msy096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Gupta V.J., Kubicek C.P., Berrin J.-G., Wilson D.W., Couturier M., Berlin A., Filho E.X.F., Ezeji T. Fungal enzymes for bio-products from sustainable and waste biomass. Trends Biochem. Sci. 2016;41:633–645. doi: 10.1016/j.tibs.2016.04.006. [DOI] [PubMed] [Google Scholar]
  • 59.Ausubel F.M., Brent R., Kingston R.E., Moore D.D., Seidman J.G., Smith J.A., Struhl K. Current Protocols in Molecular Biology. John Wiley & Sons; New York, NY, USA: 1989. [Google Scholar]
  • 60.Wieteska L., Ionov M., Szemraj J., Feller C., Kolinski A., Gront D. Improving thermal stability of thermophilic l-threonine aldolase from Thermotoga maritima. J. Biotechnol. 2015;199:69–76. doi: 10.1016/j.jbiotec.2015.02.013. [DOI] [PubMed] [Google Scholar]
  • 61.Smyth T.J.P., Perfumo A., McClean S., Marchant R., Banat I.M. Isolation and analysis of lipopeptides and high molecular weight biosurfactants. In: Timmis K.N., McGenity T.J., van der Meer J.R., de Lorenzo M., editors. Handbook of Hydrocarbon and Lipid Microbiology. Springer; Heidelberg, Germany: 2010. pp. 3589–3703. [DOI] [Google Scholar]
  • 62.Laemmli U.K. Cleavage of structural proteins during the assembly of the head of bacteriophage T4. Nature. 1970;227:680–685. doi: 10.1038/227680a0. [DOI] [PubMed] [Google Scholar]
  • 63.Ariffin H., Abdullah N., Kalsom M.S.U., Shirai Y., Hassan M.A. Production and characterization of cellulase by Bacillus pumilus EB3. Int. J. Eng. Technol. 2006;3:47–53. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from International Journal of Molecular Sciences are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES