Skip to main content
The Journal of Biological Chemistry logoLink to The Journal of Biological Chemistry
. 2017 Jan 27;292(12):4996–5006. doi: 10.1074/jbc.M116.766535

The Structure of an Archaeal β-Glucosaminidase Provides Insight into Glycoside Hydrolase Evolution*

Shouhei Mine ‡,1,2, Masahiro Watanabe §,1,3, Saori Kamachi §, Yoshito Abe , Tadashi Ueda
PMCID: PMC5377812  PMID: 28130448

Abstract

The archaeal exo-β-d-glucosaminidase (GlmA) is a dimeric enzyme that hydrolyzes chitosan oligosaccharides into monomer glucosamines. GlmA is a member of the glycosidase hydrolase (GH)-A superfamily-subfamily 35 and is a novel enzyme in terms of its primary structure. Here, we present the crystal structure of GlmA in complex with glucosamine at 1.27 Å resolution. The structure reveals that a monomeric form of GlmA shares structural homology with GH42 β-galactosidases, whereas most of the spatial positions of the active site residues are identical to those of GH35 β-galactosidases. We found that upon dimerization, the active site of GlmA changes shape, enhancing its ability to hydrolyze the smaller substrate in a manner similar to that of homotrimeric GH42 β-galactosidase. However, GlmA can differentiate glucosamine from galactose based on one charged residue while using the “evolutionary heritage residue” it shares with GH35 β-galactosidase. Our study suggests that GH35 and GH42 β-galactosidases evolved by exploiting the structural features of GlmA.

Keywords: archaea, evolution, galactose, glycoside hydrolase, X-ray crystallography, glucosamine

Introduction

Chitin is a polysaccharide consisting of β-1,4-linked N-acetylglucosamine (GlcNAc). It is a major constituent of fungal cell walls, the exoskeletons of insects, and the shells of crustaceans. Glucosamine (GlcN),4 which is derived from the hydrolysis of deacetylated chitin (chitosan), has a variety of biological functions and, thus, has been used as a food additive and in medicines. Exo-β-d-glucosaminidase (EC 3.2.1.165) catalyzes the hydrolysis of the β(1–4) linkage of chitosan oligosaccharides to remove a GlcN residue from the non-reducing termini, and retaining enzymes of this type are classified into glycosidase hydrolase (GH) subfamilies 2 and 35 (according to the Carbohydrate Active Enzymes (CAZy) database (1)). This enzyme is found in bacteria and archaea and has been thoroughly investigated because of its ability to produce monomeric GlcN.

The role of exo-β-d-glucosaminidase in the chitin catabolic pathway of hyperthermophilic archaea has been defined. The degradation of chitin into diacetylchitobiose (GlcNAc)2 is initiated by chitinase (ChiA) (EC 3.2.1.14), and this product is then deacetylated at its nonreducing GlcNAc residue by deacetylase (Dac) (EC 3.5.1.-) (2). The resulting product, GlcN-GlcNAc, is subsequently hydrolyzed into GlcN and GlcNAc by exo-β-d-glucosaminidase, and the remaining GlcNAc is further deacetylated to GlcN by Dac (2, 3). To understand these enzymes' catalysis and adaptation to extreme high temperature, we had previously determined the structures of ChiA (46) and Dac (7); however, the structure of exo-β-d-glucosaminidase remained unknown. To date, two exo-β-d-glucosaminidases from hyperthermophilic archaea, which are called GlmA, have been described: GlmATk from Thermococcus kodakaraensis KOD1 (3) and GlmAPh from Pyrococcus horikoshii (8). The sequence identity between GlmATk and GlmAPh is 63%, and both enzymes show the same substrate specificities and exist as dimers in solution, suggesting that their tertiary structures and catalytic mechanisms are probably identical.

GlmA belongs to the GH35 subfamily of the GH-A superfamily, which is the largest GH superfamily and contains 19 subfamilies. All members of this superfamily include a TIM-barrel fold as a catalytic domain that contains two carboxylic acids that function as an acid/base catalyst (9, 10). Most characterized GH35 enzymes are β-galactosidases (EC 3.2.1.23), which hydrolyze the β(1–3) and β(1–4) galactosyl bonds in oligosaccharides. Interestingly, the sequence of GlmA has homology with parts of GH35 and GH42 β-galactosidases, although GlmA does not exhibit β-galactosidase activity (3). The highly conserved motifs around the catalytic residues of these β-galactosidases are not conserved in GlmA (3). Furthermore, GlmATk was found to exhibit weak β-glucosidase activity in addition to its major β-glucosaminidase activity (3). The only determined structure of an exo-β-d-glucosaminidase among GH-As is that from the bacteria Amycolatopsis orientalis (CsxA), a member of the GH2 subclass (11). However, GlmA is distinct from CsxA in its substrate specificity and oligomerization state (3, 11), and it shows low sequence similarity. These results suggest that GlmAs might have a unique active site structure that does not resemble that of CsxA.

To further clarify the existing knowledge regarding these enzymes, we determined the structures of GlmAPh and GlmATk, which are the first reported archaeal exo-β-d-glucosaminidase structures. The high-resolution structure of the product complex also reflects the unique structural features of GlmA that link the molecular evolution of GH35 and GH42 β-galactosidases in GH-A.

Results

Overall Structure

The structure of GlmAPh was solved using the single-wavelength anomalous dispersion of selenomethionine atoms and refined at 2.6 Å resolution (Table 1). The structure of GlmATk in complex with its reaction product, GlcN, was determined at 1.27 Å resolution using molecular replacement with the structure of the GlmAPh monomer as the search model (Table 1). GlmAPh and GlmATk showed almost identical tertiary structures, with a root mean square deviation (RMSD) of 0.90 Å for 775 Cα atoms (Fig. 1A). Therefore, we describe the highest-resolution structure of GlmATk throughout this report, unless otherwise noted. In the structure of GlmATk, the asymmetric unit contains two polypeptides (chains A and B) corresponding to a non-crystallographic 2-fold axis (Fig. 1B). This dimer assembly is consistent with the data obtained from gel filtration chromatography, suggesting that the crystal structure of GlmATk corresponds to the biologically relevant form of the protein. The monomer structure of GlmATk contains three distinct domains (Fig. 1B). The N-terminal domain is a (β/α)8 barrel, or TIM-barrel, domain (residues 1–435). It contains the catalytic machinery and is a common structure among glycoside hydrolases. The second domain is an α/β fold domain with central β-sheets and α-helices (residues 436–648). The C-terminal domain is a β-fold domain with antiparallel β-sheets (residues 649–786). These three domains of chains A and B form extensive interactions with all of the domains of the other chain. Because the fraction of buried atoms and their interactions (salt bridges, ion networks, and hydrogen bonds) represent features that might be important for thermostability (12), we explored the dimer interface of GlmATk using PISA (Protein Interfaces, Surfaces, Assemblies) (13) software. Our analysis showed that a large surface area of 5530 Å2 is buried in the structure and that 29 hydrogen bonds and 16 salt bridges are created upon dimer formation. These findings indicate that both monomers are intimately associated, which might contribute to the high thermostability of GlmATk.

TABLE 1.

Data collection and refinement statistics

Values in parentheses are for the highest-resolution shell. SeMet, selenomethionine.

graphic file with name zbc014176319t001.jpg

a Outer shell (1.42−1.40 Å).

b Outer shell (1.29−1.27 Å).

c Rmerge = ΣhklΣi|Ii(hkl) − 〈I(hkl)〉 |/ΣhklΣi Ii (hkl), where Ii (hkl) is the scaled intensity of the ith observation of reflection hkl. 〈I(hkl)〉 is the mean value and summation over all measurements.

d Rwork = ΣhΣi||Fo|−|Fc||/Σ|Fo|.

e Rfree is Rwork for approximately 5% of the reflections that were excluded from refinement.

FIGURE 1.

FIGURE 1.

The overall structure of GlmA. A, the structural superposition of GlmAPh (cyan) and GlmATk (orange) is shown as a ribbon diagram. B, the dimer structure of GlmATk is presented in two views. GlmATk consists of a homodimer (chains A and B) and comprises three distinct domains (TIM-barrel (red), α/β (blue), and β (green)). The bound GlcN is shown as a sphere. Arrow, N-terminal β-sheet (residues 4–13).

Structural Comparison with Other Glycoside Hydrolases

Structural similarity searches using the Dali server with the whole protein revealed that the dimer structure of GlmATk does not resemble those of any others. However, surprisingly, the monomer structure of GlmATk shares substantial similarity with GH42 β-galactosidase, whereas the TIM-barrel domain is found to be more similar to the architecture of GH35 β-galactosidases rather than to that of GH42; this observation will be explained below.

First, despite low sequence similarities (15–17%), the three domain structures of monomer GlmATk could be readily superimposed on four GH42 β-galactosidases with Z scores >25 and RMSD values of 2.6–3.0 Å for equivalent Cα atoms, except for 80 residues of the C-terminal region of GlmATk (Fig. 2A). This result indicates that these β-galactosidases are spatially homologous to GlmATk at the level of individual domain folds and domain orientation, although their quaternary structures differ from that of GlmATk (Fig. 3A) and belong to different GH families within the GH-A family. In contrast, the residues in the TIM-barrel domain of GlmATk (residues 1–435) have the highest similarities with four GH35 β-galactosidases with Z scores >36, RMSD values of 1.8–2.3 Å (Fig. 2B), and sequence identities of 22–32%; however, the overall structures of these β-galactosidases are distinct from that of GlmATk (Fig. 3B). For GH42 β-galactosidases, when only the TIM-barrel domain was compared, the RMSD values and the sequence identities were 2.3–2.9 Å and 13–18%, respectively. Based on these values, the TIM-barrel domain of GlmATk is more similar to that of GH35 than to that of GH42. As described below, the active site architecture of GlmATk bears a close resemblance to GH35 β-galactosidase. The unique structural features of GlmATk are similar to those reported previously; the amino acid sequence of the N-terminal and the central region of GlmATk showed homology to GH35 and GH42 β-galactosidases, respectively (3). From a structural perspective, these results also suggest that GlmATk is a common ancestor.

FIGURE 2.

FIGURE 2.

Structural comparisons of GlmATk and other glycoside hydrolases. A, superimposed models of GlmATk and GH42 β-galactosidases. A DALI (39) search for structural similarity for the monomer structure (chain A) of GlmATk identified four GH42 β-galactosidases: Thermus thermophilus A4-β-gal (Z = 25.5; RMSD = 3.0 Å; PDB code 1KWK (40)), Bacillus circulans sp. Bca-β-gal (Z = 27.8; RMSD = 3.0 Å; PDB code 3TTY (41)), Geobacillus stearothermophilus Gan42B (Z = 26.2; RMSD = 2.7 Å; PDB code 4OIF (24)), and Bifidobacterium animalis BlGal42A (Z = 29.4; RMSD = 2.6 Å; PDB 4UNI (42)). The structures are colored as follows: GlmATk (red), A4-β-gal (light yellow), Bca-β-gal (cyan), Gan42B (light blue), and BlGal42A (light green). The figure is drawn from the same orientation as the right panel of Fig. 1B. The right panel shows the sequence alignment around the catalytic residues (red) of the GH42 β-galactosidases by ClustalW (43). B, superimposed models of the TIM-barrel domain (chain A) of GlmATk and GH35 β-galactosidases. A DALI search was conducted on the residues in the TIM-barrel domain of GlmATk (residues 1–435) only, which resulted in a top hit consisting of four GH35 β-galactosidases: Homo sapiens Hs-β-gal (Z = 38.3; RMSD = 1.8 Å; PDB code 3THC (19)), T. reesei Tri-β-gal (Z = 36.8; RMSD = 2.2 Å; PDB code 3OGR (14)), Penicillium sp. Psp-β-gal (Z = 36.7; RMSD = 2.3 Å; PDB code 1XC6 (20)), and Streptococcus pneumoniae BgaC (Z = 36.2; RMSD = 2.0 Å; PDB code 4E8C (21)). The TIM-barrel structures are colored as follows: GlmATk (red), Hs-β-gal (light yellow), Tri-β-gal (light blue), Psp-β-gal (cyan), and BgaC (light green). The right panel shows the sequence alignment around the catalytic residues (red) of the GH35 β-galactosidases by ClustalW (43). Chain A and chain B are quite similar, with an overall RMSD of 0.43 Å over 786 Cα atoms; thus, superposition of chain B against GH42 β-galactosidases and the TIM-barrel domains of GH35 β-galactosidases showed almost the same RMSD values as those of chain A. Accordingly, for simplicity, only the results of chain A are shown in the figure.

FIGURE 3.

FIGURE 3.

The overall structure of GH42 and GH35 β-galactosidases. A, superposition of the trimer forms of GH42 β-galactosidases, A4-β-gal (light yellow), Bca-β-gal (cyan), Gan42B (light blue), and BlGal42A (light green). Comparing the structure of Gan42B with the other three structures revealed high structural similarities, with RMSD values of 1.2–2.0 Å (24). The right panel shows the dimer forms of GlmATk. The structures of the GH42 β-galactosidases are shown in the same orientation as the TIM-barrel domain of GlmATk. B, the superposition of the GH35 β-galactosidases, Tri-β-gal (light blue), Psp-β-gal (cyan), Hs-β-gal (light yellow), and BgaC (light green). Psp-β-gal and Hs-β-gal possess similar overall structures of three domains (TIM-barrel domain with β3- and β4-domains), whereas Tr-β-gal and Psp-β-gal contain two additional domains (β1- and β2-domains). The superposition of Tri-β-gal against Psp-β-gal, Hs-β-gal, and BgaC yielded RMSD values of 0.9 Å (for 947 Cα atoms), 2.4 Å (for 479 Cα atoms), and 2.3 Å (for 480 Cα atoms), respectively, indicating that these proteins possess similar overall structures, except for the two β-domains (β1- and β2-domain) of Tr-β-gal and Psp-β-gal. The right panel shows the monomer forms of GlmATk. The structures of the GH35 β-galactosidases are shown in the same orientation as the TIM-barrel domain of GlmATk.

The Active Site Similarities between GlmATk and GH35 β-Galactosidase

The 1.27 Å resolution structure of the product complex reveals an unambiguous electron density for a single molecule of GlcN, which was observed in the deep pocket within the TIM-barrel domain of each monomer (Fig. 4A). To provide insight into the substrate recognition of GlmATk, we compared the TIM-barrel structure of GlmATk with that of the GH35 β-galactosidase from Trichoderma reesei (Tri-β-gal) in complex with galactose (PDB entry 3OGR (14)) because its structure has the highest resolution (1.5 Å) among all GH35 enzymes with known structures whose functional residues in the active site are highly conserved and those with some conservative substitutions in the family (data not shown).

FIGURE 4.

FIGURE 4.

Active site comparisons of GlmATk and Tri-β-gal. A, the active site of GlmATk site in complex with GlcN. The hydrogen bonds between the side chains of GlmATk (cyan stick) and GlcN (yellow stick with hydroxyl or amine group numbers) are shown in dashed lines. Oxygen, nitrogen, and sulfur atoms are depicted in red, blue, and gold, respectively. The FoFc electron density (2∂ level) is contoured in green around GlcN. B, the superposition of the monomer structure of GlmATk (TIM-barrel (red), α/β (blue), and β (green)) and Tri-β-gal (gray). The figure is drawn in the same orientation as in Fig. 2B. The right panel shows a close-up of the active site. Shown are the amino acids of GlmATk (cyan stick with red labels) and Tri-β-gal (gray stick with black labels) in complex with GlcN (yellow stick) and galactose (green stick), respectively. C and D, schematic diagram showing the interaction of GlmATk with GlcN (C) and Tri-β-gal with galactose (D). Dashed lines, hydrogen bonds with the interatomic distance (in Å). The unique residues of GlmATk and Tri-β-gal are shown in red and blue letters, respectively. E–H, close-up views of the interaction between the protein and ligand shown in the right panel of B. E, catalytic residues and products. F, Asp178 (GlmATk), the N2 atom of GlcN, Asn199 (Tri-β-gal), and the O2 of galactose. G, Tyr379 (GlmATk), the equatorial O4 of GlcN, Asn140 (Tri-β-gal), and the axial O4 atom of galactose. H, Glu306 (GlmATk) and the O1 atom of GlcN.

Interestingly, the superimposition of the TIM-barrel domain of GlmATk with Tri-β-gal revealed a high degree of structural similarity between the −1 subsites of these proteins (Fig. 4B), although their substrates are different. Both the GlcN and galactose molecules adopt a chair conformation with their C1 hydroxyl group (O1) in the β-anomer configuration. They sit in almost the same position and form direct hydrogen bonds with eight residues. Four of the eight residues involved in direct substrate binding in GlmATk (Tyr53 (interacting with O3), Glu103 (O4, O6), Glu179 (O1), and Glu347 (N2)) could be superimposed onto Tyr96 (O3), Glu142 (O4, O6), Glu200 (O1), and Glu298 (O2) of Tri-β-gal, respectively, resulting in almost identical protein-carbohydrate interactions with no substantial differences in the interatomic distances (Fig. 4, C and D). Gly102 of GlmATk forms a hydrogen bond with O3 (2.9 Å) of GlcN via its main-chain amide, whereas the structurally equivalent residue of Tri-β-gal is Ala141, which fulfills the same function through a hydrogen bond to O3 (2.9 Å) of galactose (Fig. 4, C and D). Therefore, this replacement is a conservative substitution. In addition, Trp308 of GlmATk, which is involved in a hydrophobic stacking interaction with the planar face of the GlcN moiety, overlaps well with Tyr260 of Tri-β-gal (Fig. 4B). The component important for the recognition of the GlmATk substrate was not conserved in GH42 β-galactosidase (data not shown), despite the structural similarity of their monomers.

Among the substrate-binding residues described above, Glu179 and Glu347 of GlmATk are supposed to be catalytic residues, and the steric counterparts of Tr-β-gal are the acid/base Glu200 and the nucleophile Glu298, respectively (Fig. 4E). Glu179 forms a hydrogen bond with O1 (2.7 Å) and is oriented toward the glycosidic oxygen, whereas Glu347 forms a hydrogen bond with N2 (2.8 Å) and is positioned to serve as a catalytic nucleophile (Fig. 4, C and E). Consistent with the predicted roles of these residues, the mutation E347Q virtually inactivated the enzyme, whereas the mutation E179Q retained less than ∼3% of residual hydrolysis activity (Fig. 5, A and B) (Table 2). In the measurement performed here, acetate was produced as one of the reaction products (Fig. 5A) and could act as a nucleophile (1517). Therefore, the retained activity of E179Q might be ascribed to the chemical rescue of acetate. In addition, Glu179 and Glu347 are located in the β4- and β7-strands of the TIM-barrel domain, respectively, and the average distance between the oxygen atoms of these residues is 4.8 Å. This is consistent with the common structural features of the retaining enzyme in GH-A (9). Thus, these results strongly suggest that GlmATk hydrolyzes its substrate in a double displacement retaining mechanism using the acid/base residue Glu179 and nucleophilic residue Glu347 similarly to the GH35 enzymes characterized thus far. The catalytic center, which is almost entirely conserved, indicates the close evolutionary relationship between these enzymes.

FIGURE 5.

FIGURE 5.

The activities of wild-type and mutant GlmAs. A, mechanism of action of GlmA hydrolase using Dac deacetylase. First, (GlcNAc)2 is deacetylated at its nonreducing GlcNAc residue by Dac. The resulting product, GlcN-GlcNAc, is then hydrolyzed into GlcN and GlcNAc by GlmA, and the remaining GlcNAc is further deacetylated to GlcN by Dac. Peaks I and II are those shown in B and C. B and C, representative NMR spectra of the methyl proton regions. The left spectra in both B and C correspond to the reaction mixtures of Dac with (GlcNAc)2. The other spectra represent the subsequent reaction with GlmATk (wild type, D178N, E179Q, E306Q, and E347Q) (B) and GlmAPh (wild type, D180N, E181Q, E308Q, and E349Q) (C) as their counterparts in GlmATk, shown in the right panel of Fig. 2, A and B, in the presence of Dac. Peaks I and II are assigned to the N-acetyl group of GlcN-GlcNAc and the methyl group of the resulting acetic acid, respectively, here and in the reaction scheme shown in A.

TABLE 2.

Catalytic activity of GlmA and mutants

Enzyme Relative activitya
%
GlmATk
    Wild type 100
    D178N 0
    E179Q 3
    E306Q 40
    E347Q 0

GlmAPh
    Wild type 100
    D180N 0
    E181Q 10
    E308Q 15
    E349Q 0

a The activities of the mutants are calculated from the relative peak I area against that of the wild type.

Based on their sequence alignment (Fig. 2, A and B), the acid/base residue Glu179 of GlmATk aligns with those of GH35 and GH42 β-galactosidases. The nucleophile Glu347 of GlmATk also aligns with those of GH42 β-galactosidases. However, it could not be aligned with those of GH35 β-galactosidases. Together with the fact that the highly conserved motifs around the catalytic residues of GH35 and GH42 β-galactosidases are not conserved in GlmATk (3), these results indicate that the locations of the catalytic residues predicted based on the sequence comparisons are uncertain and unreliable. Our structure determination of GlmATk combined with structure-guided mutagenesis studies facilitated identifying the catalytic residues accurately.

The Discrimination of GlcN from Galactose by GlmATk

Despite the high structural similarities at the active site, radical differences were observed in the rest of the substrate-binding residues of GlmATk: Asp178, Tyr379, and Glu306. The former two residues correlated with the chemical structure of GlcN. GlcN and galactose differ in terms of the substituent at C2 and the chirality of C4 and C6. The major difference is the substituent at C2, which is an amine group (N2) in GlcN and a hydroxyl group (O2) in galactose. Asn199 of Tri-β-gal, which precedes the acid/base residue Glu200, forms a hydrogen bond (2.9 Å) with O2 (Fig. 4, D and F), and this Asn-Glu motif is highly conserved in GH35 and GH42 β-galactosidases (Fig. 2, A and B). The equivalent motif in GlmATk is Asp178-Glu179 (the acid/base), and Asp178 forms a hydrogen bond (2.7 Å) with N2 (Fig. 4, C and F). The pKa of N2 in GlcN is reported to be 7.4 (18); therefore, at pH 6.0, at which the activity of GlmATk is maximized (3), the N2 of GlcN will be in its protonated NH3+ form. Additionally, the side chain of Asp178 will be negatively charged based on its average pKa of ∼3.7. To confirm the importance of the acidic character of the carboxyl group of Asp178, we mutated Asp178 to asparagine. As a result, the D178N mutant dramatically lost its catalytic activity (Fig. 5B) (Table 2), implying that charge-charge complementarity is indispensable for the interaction between Asp178 and the N2 of GlcN. The necessity of this interaction is also supported by a previous report indicating that GlmATk has very weak β-glucosidase activity (3). Chemically, glucose differs from GlcN only at the C2 of the pyranose ring, which contains a hydroxyl group (O2), indicating that the absence of a charged interaction between Asp178 and O2 of glucose should cause a profound loss of β-glucosidase activity. Additionally, GlmATk could not hydrolyze (GlcNAc)2 at all (3). GlcNAc also differs from GlcN only at the C2 substituent, which is replaced by a bulky acetoamido group; thus, the presence of GlcNAc at the −1 subsite cannot be tolerated because of its steric clash with Asp178.

An additional difference between GlcN and galactose is the chirality of O4, which is equatorial in GlcN and axial in galactose. Tyr379 of GlmATk forms a hydrogen bond (2.8 Å) with the equatorial O4 of GlcN (Fig. 4, C and G) and also serves as the lateral face of the hydrophobic pocket to accommodate GlcN. Surprisingly, Tyr379 could be superimposed onto Tyr342 of Tri-β-gal (Figs. 4, B and G), which is a strictly conserved residue in GH35 β-galactosidases. Tyr342 packs against the C4 atom of galactose in a similar manner as Tyr379 of GlmATk; however, its position is too distant (∼4.6 Å) to form a hydrogen bond with the axial O4 of galactose. Instead of Tyr342, Asn140 of Tri-β-gal is in a suitable position to form a hydrogen bond (2.8 Å) with the axial O4 (Fig. 4, D and G). Likewise, Asn140 could be structurally superimposed onto Cys101 of GlmATk (Fig. 4, B and G), and other GH35 β-galactosidases, such as BgaC and Hs-β-gal, also have Cys residues at the same position. In BgaC, the counterpart Cys96 forms a hydrogen bond with the axial O4 of galactose via its thiol group (19). Therefore, Cys101 of GlmATk is supposed to be a conservative substitution. However, it is located 4.8 Å away from the equatorial O4 of GlcN, preventing hydrogen bond formation. Briefly, GlmATk and GH35 β-galactosidases possess residues with the potential to form hydrogen bonds with the axial and equatorial forms of O4 in the glycosidic substrate, respectively, thereby contributing to the recognition of GlcN or galactose. Therefore, we defined these Cys (Asn) and Tyr residues as “evolutionary heritage residues.” To our knowledge, this is the first time such a heritage was seen in different functional glycoside hydrolases.

The other unique substrate-binding residue of GlmATk is Glu306, which forms a hydrogen bond (2.9 Å) with O1 (Fig. 4, C and H) and is important for maximal catalytic activity; this was confirmed by determining that the mutation E306Q decreased the enzymatic activity ∼2.5-fold (Fig. 5B) (Table 2). In Tri-β-gal, Asp258 is located at this position (Fig. 4, B and H); however, it cannot form a hydrogen bond with the O1 in galactose because of its side chain orientation (at a distance of ∼4.6 Å). Because the configuration of O1 of GlcN and galactose is a β-anomer, Glu306 may contribute to transition state stabilization rather than substrate recognition. In contrast, as described above, the O6 hydroxyl group of GlcN and galactose form hydrogen bonds with Glu103 of GlmATk and Glu142 of Tri-β-gal, respectively, despite the different chirality of the O6 hydroxyl group (Fig. 4, C and D). However, Tyr364 of Tri-β-gal forms an additional hydrogen bond with the equatorial O6 of galactose (Fig. 4D), whereas the corresponding residue is absent in GlmATk. Tyr364 is well conserved in other GH35 β-galactosidases (14, 1921), indicating that it contributes to the recognition of galactose, but not in a major way.

These data indeed suggest that Asp178, Glu306, and Tyr379 of GlmATk play an important role in the recognition or stabilization of the GlcN molecule. However, as explained above, Asp178 is supposed to be the most important residue responsible for the recognition of GlcN.

Additionally, the sequence alignment between GlmATk and GlmAPh illustrated that the catalytic residues, together with the other key residues of GlmATk discussed above, were strictly conserved in GlmAPh (data not shown) and are consisted with the mutagenesis analyses of GlmAPh (Fig. 5C)(Table 2). These findings suggest that their catalytic mechanisms and substrate profiles are probably identical.

The Dimer Structure Influences Substrate Specificity

In the TIM-barrel domain, one of the ends of the β-barrel is closed by the N-terminal β-sheet (residues 4–13) (Fig. 1B), whereas the other end, termed the “catalytic face” (22), is buried within a deep and narrow pocket upon dimer formation. The most notable feature of the dimer interface is that the 310-helix of the α/β-domain protrudes toward the catalytic face of the adjacent monomer, interacting via some hydrogen bonds and a salt bridge. These interactions involve Arg563, Asn565, and Arg567 of the α/β-domain and Asp132, Tyr134, Tyr135, and Gln188 of the TIM-barrel domain (Fig. 6A). This interaction decreased the size of the active site entrance, and the α/β-domain and the TIM-barrel domain of chains A and B create one large cavity, which abuts the active site pockets of both monomers (Fig. 6A). As previously stated, the opposite side of the catalytic face is closed; thus, this cavity is the only means of entry or egress for the substrate or product. The depth of the active site pocket from the center of the cavity is ∼20 Å (Fig. 6B), which could restrict the access of lengthy substrates. Consistently, GlmATk showed higher activity against GlcN2 (∼12 Å in length) than against longer molecules (N>2) (3). These results suggest that dimer formation is essential for GlmATk to exhibit enzymatic activity and to form an active site with an appropriate shape. Conversely, CsxA, which is the only other exo-β-d-glucosaminidase with a known structure, can hydrolyze oligomeric substrates ranging from GlcN2 to GlcN6 with similar efficiencies (11). CsxA is a member of the GH2 family of the GH-A group and functions as a monomeric enzyme with its active site easily accessible to the solvent. However, its domain organization differs substantially from that of GlmATk (data not shown). This structural feature of CsxA is suitable for longer molecules.

FIGURE 6.

FIGURE 6.

The dimerization interface and the deep active site pocket of GlmATk. A, surface representation of the GlmATk dimer. Two monomers are colored gray and green. An arrow indicates the cavity created by dimerization. The inset is an expanded view of the part of the dimer interface composed of Asp132, Tyr134, Tyr135, and Gln188 (chain A) and Arg563, Asn565, and Arg567 (chain B) with hydrogen bonds (black dashed lines). The figure is drawn from the same orientation as the left panel in Fig. 1B. B, a section drawing of the GlmATk dimer in the plane containing the GlcN molecules (red sticks) in the active site. The figure is drawn in the same orientation as in A.

Discussion

The structure of GlmATk provides new insights into the structural composition and substrate recognition mechanisms of different enzymes and, thus, their molecular evolution. Briefly, a monomeric form of GlmATk shares substantial structural similarity with GH42 β-galactosidases, whereas a high number of conserved active site residues are shared with GH35 β-galactosidase, allowing GlmATk to discriminate glucosamine from galactose based on a subtle difference in the structure of β-galactosidase bound to galactose. Indeed, Asp178 of GlmATk plays an essential role in the discrimination of GlcN from galactose, whereas the equivalent in GH35 β-galactosidase is an Asn residue. To the best of our knowledge, this is the first observation of such a high degree of conservation within the entire catalytic centers of different enzymes. In addition, the evolutionary heritage residues, which have the potential to form hydrogen bonds with the axial and equatorial forms of O4 in the glycosidic substrate, respectively, are an interesting finding that emphasizes the high evolutionary conservation of these enzymes. These structural features strongly suggest that GlmA is a common ancestor of these β-galactosidases, as discussed below.

The active sites of glycoside hydrolases are classified into three types: cleft type, tunnel type, and pocket type (23). Both GlmA and GH42 β-galactosidases have a cleft-type active site in their monomeric forms; however, the shape of the active site changes to a pocket type upon oligomerization, which can better accommodate smaller substrates (24) (Fig. 6). Thus, oligomerization is a key factor for size-based substrate specificity and the high stability of these proteins. β-Galactosidase may have evolved from a prototypical single TIM-barrel domain with a cleft- or tunnel-type active site, and then, during the subsequent process of modifying the active site to prefer a smaller substrate, extra domains were added to change the active site from a cleft to a pocket type (25). As described above, the monomer structure and a part of the sequence of GlmA show similarity to those of GH42 β-galactosidase, suggesting that GH42 β-galactosidase might have emerged from the evolutionary branch that originated from GlmA in the oldest organisms, archaea, and then differentiated into other members of the glycoside hydrolase family. Additionally, the frameworks of their monomer structure (i.e. the domain organization) might be suitable or necessary for oligomerization. However, the substrate-binding residues of GH42 enzymes are not conserved in GlmA (data not shown), excluding GlmA from being classified into the GH42 family, and the underlying evolutionary selection pressures that led to this diversity in the active site remain unknown. In contrast, the residues within the active site pocket are well conserved between GlmA and GH35 β-galactosidase, suggesting that GH35 β-galactosidase evolved from archaeal exo-β-d-glucosaminidase through gene duplication. Enzyme substrate ambiguity is probably the starting point for the evolution of divergent enzymes through gene duplication (26, 27). Consistently, GlmATk exhibits broad substrate specificity, showing weak hydrolytic activities toward various β-disaccharides (such as cellobiose and laminaribiose) in addition to its major β-glucosaminidase activity (3). Thus, the promiscuous activities of GlmATk might have developed through mutations that affected the subsequent functional adaptation of the newly emergent β-galactosidases (which favored β-galactoside) while retaining the original substrate-binding residues and the catalytic machinery. However, the highly conserved active site residues of both GlmA and GH35 β-galactosidase indicate that there has been relatively weak evolutionary pressure on the catalytic center to convert the enzyme to perform different functions. As described before, GlmA and GH35 and GH42 β-galactosidases belong to the same GH-A “superfamily.” A superfamily is a group that shows significant similarities in the tertiary structure together with conservation of the catalytic residues and mechanism, and its members are therefore considered to have a common ancestry (28). Accordingly, our finding that GlmA shares structural and mechanistic features with both the GH35 and GH42 β-galactosidases strongly suggested that GlmA is a common ancestor of these β-galactosidases.

Taken together, our results suggest that GH35 and GH42 β-galactosidases have evolved by taking advantage of the structural features of GlmA. The structural information reported here for GlmA could be used to design a new enzyme, such as a thermostable β-galactosidase or β-glucosidase, by subtly changing the active site residues in GlmA.

Experimental Procedures

Protein Expression and Purification

The genes encoding GlmAPh (residues 1–778, GenBankTM accession number PH_RS02375) and GlmATk (residues 1–786, GenBankTM accession number AB100422) were codon-optimized for expression in E. coli and synthesized (Eurofins NMG Operon). The constructs for GlmAPh and GlmATk were cloned into a pCold-II vector (Takara Bio) and a pET-32b vector (Merck Millipore), respectively, with an N-terminal PreScission protease cleavage site followed by a hexahistidine tag using the NdeI and EcoRI restriction sites. The resulting vectors were transformed into E. coli Rosetta (DE3)pLysS (Merck Millipore). For GlmAPh expression, the cells were grown at 37 °C in lysogeny broth (LB) medium until an A600 nm of 0.5 was reached, and then the cultivation was continued for 24 h at 15 °C. The overexpression of GlmATk was induced with 1.0 mm isopropyl-β-d-galactosidase after an A600 nm of 0.5 was reached, and the culture was incubated for 4 h at 37 °C. The cells were harvested by centrifugation at 5,000 × g for 15 min and stored at −20 °C. For protein purification, the cells were disrupted by sonication in buffer A (50 mm Tris-HCl, 0.5 m NaCl, pH 8.0). The cell debris and insoluble proteins were removed by centrifugation at 15,000 × g for 30 min after heat treatment at 80 °C for 30 min. The supernatant was loaded onto a nickel-nitrilotriacetic acid (GE Healthcare) column equilibrated with buffer A, and the bound protein was eluted with buffer A containing 0.5 m imidazole. Subsequently, the eluted protein was dialyzed against 50 mm Tris-HCl (pH 8.0) in the presence of PreScission protease at 4 °C overnight. The cleaved tag was removed with a second nickel-nitrilotriacetic acid purification, and the flow-through was further purified using a HiTrap Q HP column (GE Healthcare) with a linear gradient of 0–1.0 m NaCl in 50 mm Tris-HCl (pH 8.0). The further purified protein was then subjected to a HiLoad 26/600 Superdex 200 preparation grade column (GE Healthcare) equilibrated with buffer (20 mm Tris-HCl, 150 mm NaCl, pH 8.0), concentrated to 10 mg/ml, and stored at −80 °C. The point mutants of GlmAPh and GlmATk were generated using site-directed mutagenesis. All mutant proteins were expressed and purified in the same manner as the wild-type proteins. The expression levels and isolated yields of these mutants were comparable with those of the wild-type enzyme.

Crystallography

After many crystallization trials of GlmAPh and GlmATk, only GlmAPh produced crystals. The crystals of GlmAPh were grown at 20 °C using the sitting drop vapor diffusion method in 100 mm Tris-HCl (pH 8.5) and 20% (w/v) polyethylene glycol (PEG) 1000 with a protein/reservoir volume ratio of 1:1. Crystals appeared within a week. Therefore, we prepared selenomethionine-substituted GlmAPh according to the procedure described under “Protein Expression and Purification,” except the cells were grown in LeMaster broth (29). The selenomethionine-incorporated protein was crystallized using the same conditions as for the native protein. Next, the co-crystallization of GlmAPh and GlmATk with GlcN (10, 50, and 100 mm) was performed using commercial crystallization screens. However, we obtained crystals for only the GlmATk-GlcN complex, not the GlmAPh-GlcN complex. Among the several sets of conditions attempted for GlmATk-GlcN complex crystallization, well diffracting crystals of the GlmATk-GlcN complex were obtained via sitting drop vapor diffusion at 20 °C in 100 mm sodium propionate-sodium cacodylate-Bistris propane buffer (pH 7.0), 25% (w/v) PEG 1500, and 50 mm GlcN. Crystals of GlmAPh (SeMet and native) and GlmATk-GlcN were cryoprotected with 15% (v/v) glycerol and 25% (w/v) PEG 400, respectively, and subsequently flash-frozen in liquid nitrogen. All data sets were collected on a BL44XU instrument at SPring-8 (Harima, Japan) with a MX300HE detector (Raynonix) under a cryostream at 90 K, and the data were processed and scaled using the HKL-2000 program suite (30). Data sets for native and SeMet GlmAPh were collected at wavelengths of 0.9 and 0.97898 Å, respectively, with a single-wavelength anomalous dispersion of selenium atoms (peak). The initial phases of the GlmAPh structure were determined at 3.5 Å resolution from the data of SeMet GlmAPh using Phenix (31), and then the native data were phase-extended to 2.6 Å. The GlmAPh model was built using Phenix. Further model building was performed with Coot (32). The structure was refined by CNS (Crystallography and Nuclear Magnetic Resonance (NMR) System) (33) and REFMAC (Refinement of Macromolecular Structures) (34) with rigid body refinement. Data sets of the GlmATk-GlcN complex were also collected at a wavelength of 0.9 Å. The initial phases of the GlmATk-GlcN complex model were solved using Phaser (35) with the structure of GlmAPh as a search model, and then the model building and refinement and the addition of water molecules were performed using ARP/wARP (36). Further model building and refinements were performed using Coot and REFMAC with individual anisotropic B-factor value refinement, respectively. The final models of the GlmAPh and GlmATk-GlcN complexes were validated by MolProbity (37).

Enzymatic Activities of GlmATk and GlmAPh Mutants

The hydrolase activities of the GlmAs were detected using 1H NMR spectroscopy. NMR experiments were conducted at 35 °C on a Varian Inova (600 MHz) (Palo Alto, CA) equipped with a z-gradient, triple-resonance TR probe. The chemical shifts were referenced to an internal standard: 4,4-dimethyl-4-silapentane-1-sulfonic acid. The substrate GlcN-GlcNAc for the GlmAs was prepared as follows. 1.6 mm (GlcNAc)2 was incubated with 5 μm Dac for 10 min in 50 mm potassium phosphate buffer (pH 6.0) containing 10% (w/w) D2O. After the first reaction was completed, the subsequent reaction was initiated by adding wild-type GlmAs or mutants to a final concentration of 5 μm in the presence of Dac, and the spectra were collected after 150 min. One of the resulting products, GlcNAc, was immediately deacetylated to GlcN by Dac; thus, the NMR spectrum of the N-acetyl group of GlcN-GlcNAc was expected to disappear when the GlmA reaction was complete (Fig. 5A). The water signal was suppressed using the Watergate pulse sequence (38). The one-dimensional 1H signals consisted of 8,192 sampling points covering a spectral width of 15 ppm. The relaxation delay was set at 1 s, and 64 scans were accumulated for each spectrum. This acquisition was repeated every 1.45 s. The chemical shifts were assigned as described previously (7).

Author Contributions

S. M. designed the research; S. M., M. W., and Y. A. performed the research; S. M., M. W., S. K., Y. A., and T. U. analyzed the data; S. M. and M. W. wrote the paper.

Acknowledgments

We thank K. Uechi and H. Akita for experimental support. We thank the beamline staff at BL44XU of SPring-8 for technical assistance during data collection. Synchrotron experiments were performed at SPring-8 under the approval of the Japan Synchrotron Radiation Research Institute under proposal numbers 2014B6953, 2014B6903, 2015A6546, 2015A6559, 2015B6559, 2015B6546, 2016A6645, and 2016B6657.

*

This work was supported in part by Grant-in-Aid for Scientific Research 25450143 from the Japan Society for the Promotion of Science (to S. M.). The authors declare that they have no conflicts of interest with the contents of this article.

The atomic coordinates and structure factors (codes 5GSL and 5GSM) have been deposited in the Protein Data Bank (http://wwpdb.org/).

4
The abbreviations used are:
GlcN
glucosamine
GH
glycosidase hydrolase
GlmATk
archaeal exo-β-d-glucosaminidases from T. kodakaraensis KOD1
GlmAPh
archaeal exo-β-d-glucosaminidase from P. horikoshii
RMSD
root mean square deviation
SeMet
selenomethionine
Bistris propane
1,3-bis[tris(hydroxymethyl)methylamino]propane
PDB
Protein Data Bank.

References

  • 1. Lombard V., Golaconda Ramulu H., Drula E., Coutinho P. M., and Henrissat B. (2014) The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 42, D490–D495 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Tanaka T., Fukui T., Fujiwara S., Atomi H., and Imanaka T. (2004) Concerted action of diacetylchitobiose deacetylase and exo-β-d-glucosaminidase in a novel chitinolytic pathway in the hyperthermophilic archaeon Thermococcus kodakaraensis KOD1. J. Biol. Chem. 279, 30021–30027 [DOI] [PubMed] [Google Scholar]
  • 3. Tanaka T., Fukui T., Atomi H., and Imanaka T. (2003) Characterization of an exo-β-d-glucosaminidase involved in a novel chitinolytic pathway from the hyperthermophilic archaeon Thermococcus kodakaraensis KOD1. J. Bacteriol. 185, 5175–5181 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Nakamura T., Mine S., Hagihara Y., Ishikawa K., and Uegaki K. (2007) Structure of the catalytic domain of the hyperthermophilic chitinase from Pyrococcus furiosus. Acta Crystallogr. Sect. F Struct. Biol. Cryst. Commun. 63, 7–11 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Nakamura T., Mine S., Hagihara Y., Ishikawa K., Ikegami T., and Uegaki K. (2008) Tertiary structure and carbohydrate recognition by the chitin-binding domain of a hyperthermophilic chitinase from Pyrococcus furiosus. J. Mol. Biol. 381, 670–680 [DOI] [PubMed] [Google Scholar]
  • 6. Mine S., Nakamura T., Sato T., Ikegami T., and Uegaki K. (2014) Solution structure of the chitin-binding domain 1 (ChBD1) of a hyperthermophilic chitinase from Pyrococcus furiosus. J. Biochem. 155, 115–122 [DOI] [PubMed] [Google Scholar]
  • 7. Mine S., Niiyama M., Hashimoto W., Ikegami T., Koma D., Ohmoto T., Fukuda Y., Inoue T., Abe Y., Ueda T., Morita J., Uegaki K., and Nakamura T. (2014) Expression from engineered Escherichia coli chromosome and crystallographic study of archaeal N,N′-diacetylchitobiose deacetylase. FEBS J. 281, 2584–2596 [DOI] [PubMed] [Google Scholar]
  • 8. Liu B., Li Z., Hong Y., Ni J., Sheng D., and Shen Y. (2006) Cloning, expression and characterization of a thermostable exo-β-d-glucosaminidase from the hyperthermophilic archaeon Pyrococcus horikoshii. Biotechnol. Lett. 28, 1655–1660 [DOI] [PubMed] [Google Scholar]
  • 9. Henrissat B., Callebaut I., Fabrega S., Lehn P., Mornon J. P., and Davies G. (1995) Conserved catalytic machinery and the prediction of a common fold for several families of glycosyl hydrolases. Proc. Natl. Acad. Sci. U.S.A. 92, 7090–7094 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Henrissat B., and Davies G. (1997) Structural and sequence-based classification of glycoside hydrolases. Curr. Opin. Struct. Biol. 7, 637–644 [DOI] [PubMed] [Google Scholar]
  • 11. van Bueren A. L., Ghinet M. G., Gregg K., Fleury A., Brzezinski R., and Boraston A. B. (2009) The structural basis of substrate recognition in an exo-β-d-glucosaminidase involved in chitosan hydrolysis. J. Mol. Biol. 385, 131–139 [DOI] [PubMed] [Google Scholar]
  • 12. Zheng B., Yang W., Zhao X., Wang Y., Lou Z., Rao Z., and Feng Y. (2012) Crystal structure of hyperthermophilic endo-β-1,4-glucanase: implications for catalytic mechanism and thermostability. J. Biol. Chem. 287, 8336–8346 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Krissinel E., and Henrick K. (2007) Inference of macromolecular assemblies from crystalline state. J. Mol. Biol. 372, 774–797 [DOI] [PubMed] [Google Scholar]
  • 14. Maksimainen M., Hakulinen N., Kallio J. M., Timoharju T., Turunen O., and Rouvinen J. (2011) Crystal structures of Trichoderma reesei β-galactosidase reveal conformational changes in the active site. J. Struct. Biol. 174, 156–163 [DOI] [PubMed] [Google Scholar]
  • 15. Wang Q., Trimbur D., Graham R., Warren R. A., and Withers S. G. (1995) Identification of the acid/base catalyst in Agrobacterium faecalis β-glucosidase by kinetic analysis of mutants. Biochemistry 34, 14554–14562 [DOI] [PubMed] [Google Scholar]
  • 16. Richard J. P., Huber R. E., Heo C., Amyes T. L., and Lin S. (1996) Structure-reactivity relationships for β-galactosidase (Escherichia coli, lac Z). 4. Mechanism for reaction of nucleophiles with the galactosyl-enzyme intermediates of E461G and E461Q β-galactosidases. Biochemistry 35, 12387–12401 [DOI] [PubMed] [Google Scholar]
  • 17. Schwarz A., Brecker L., and Nidetzky B. (2007) Acid-base catalysis in Leuconostoc mesenteroides sucrose phosphorylase probed by site-directed mutagenesis and detailed kinetic comparison of wild-type and Glu237 → Gln mutant enzymes. Biochem. J. 403, 441–449 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Beecher C. N., and Larive C. K. (2015) 1H and 15N NMR characterization of the amine groups of heparan sulfate related glucosamine monosaccharides in aqueous solution. Anal. Chem. 87, 6842–6848 [DOI] [PubMed] [Google Scholar]
  • 19. Ohto U., Usui K., Ochi T., Yuki K., Satow Y., and Shimizu T. (2012) Crystal structure of human β-galactosidase: structural basis of Gm1 gangliosidosis and morquio B diseases. J. Biol. Chem. 287, 1801–1812 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Rojas A. L., Nagem R. A., Neustroev K. N., Arand M., Adamska M., Eneyskaya E. V., Kulminskaya A. A., Garratt R. C., Golubev A. M., and Polikarpov I. (2004) Crystal structures of β-galactosidase from Penicillium sp. and its complex with galactose. J. Mol. Biol. 343, 1281–1292 [DOI] [PubMed] [Google Scholar]
  • 21. Cheng W., Wang L., Jiang Y. L., Bai X. H., Chu J., Li Q., Yu G., Liang Q. L., Zhou C. Z., and Chen Y. (2012) Structural insights into the substrate specificity of Streptococcus pneumoniae β(1,3)-galactosidase BgaC. J. Biol. Chem. 287, 22910–22918 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Sterner R., and Höcker B. (2005) Catalytic versatility, stability, and evolution of the (βα)8-barrel enzyme fold. Chem. Rev. 105, 4038–4055 [DOI] [PubMed] [Google Scholar]
  • 23. Davies G., and Henrissat B. (1995) Structures and mechanisms of glycosyl hydrolases. Structure 3, 853–859 [DOI] [PubMed] [Google Scholar]
  • 24. Solomon H. V., Tabachnikov O., Lansky S., Salama R., Feinberg H., Shoham Y., and Shoham G. (2015) Structure-function relationships in Gan42B, an intracellular GH42 β-galactosidase from Geobacillus stearothermophilus. Acta Crystallogr. D Biol. Crystallogr. 71, 2433–2448 [DOI] [PubMed] [Google Scholar]
  • 25. Juers D. H., Huber R. E., and Matthews B. W. (1999) Structural comparisons of TIM barrel proteins suggest functional and evolutionary relationships between β-galactosidase and other glycohydrolases. Protein Sci. 8, 122–136 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Khersonsky O., and Tawfik D. S. (2010) Enzyme promiscuity: a mechanistic and evolutionary perspective. Annu. Rev. Biochem. 79, 471–505 [DOI] [PubMed] [Google Scholar]
  • 27. Pandya C., Farelli J. D., Dunaway-Mariano D., and Allen K. N. (2014) Enzyme promiscuity: engine of evolutionary innovation. J. Biol. Chem. 289, 30229–30236 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Henrissat B., and Bairoch A. (1996) Updating the sequence-based classification of glycosyl hydrolases. Biochem. J. 316, 695–696 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Hendrickson W. A., Horton J. R., and LeMaster D. M. (1990) Selenomethionyl proteins produced for analysis by multiwavelength anomalous diffraction (MAD): a vehicle for direct determination of three-dimensional structure. EMBO J. 9, 1665–1672 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Terwilliger T. C., and Berendzen J. (1999) Automated MAD and MIR structure solution. Acta Crystallogr. D Biol. Crystallogr. 55, 849–861 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Adams P. D., Afonine P. V., Bunkóczi G., Chen V. B., Davis I. W., Echols N., Headd J. J., Hung L. W., Kapral G. J., Grosse-Kunstleve R. W., McCoy A. J., Moriarty N. W., Oeffner R., Read R. J., Richardson D. C., et al. (2010) PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D Biol. Crystallogr. 66, 213–221 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Emsley P., and Cowtan K. (2004) Coot: model-building tools for molecular graphics. Acta Crystallogr. D Biol. Crystallogr. 60, 2126–2132 [DOI] [PubMed] [Google Scholar]
  • 33. Brünger A. T., Adams P. D., Clore G. M., DeLano W. L., Gros P., Grosse-Kunstleve R. W., Jiang J. S., Kuszewski J., Nilges M., Pannu N. S., Read R. J., Rice L. M., Simonson T., and Warren G. L. (1998) Crystallography & NMR system: A new software suite for macromolecular structure determination. Acta Crystallogr. D Biol. Crystallogr. 54, 905–921 [DOI] [PubMed] [Google Scholar]
  • 34. Murshudov G. N., Vagin A. A., and Dodson E. J. (1997) Refinement of macromolecular structures by the maximum-likelihood method. Acta Crystallogr. D Biol. Crystallogr. 53, 240–255 [DOI] [PubMed] [Google Scholar]
  • 35. McCoy A. J., Grosse-Kunstleve R. W., Adams P. D., Winn M. D., Storoni L. C., and Read R. J. (2007) Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Langer G., Cohen S. X., Lamzin V. S., and Perrakis A. (2008) Automated macromolecular model building for X-ray crystallography using ARP/wARP version 7. Nat. Protoc. 3, 1171–1179 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Word J. M., Lovell S. C., LaBean T. H., Taylor H. C., Zalis M. E., Presley B. K., Richardson J. S., and Richardson D. C. (1999) Visualizing and quantifying molecular goodness-of-fit: small-probe contact dots with explicit hydrogen atoms. J. Mol. Biol. 285, 1711–1733 [DOI] [PubMed] [Google Scholar]
  • 38. Piotto M., Saudek V., and Sklenár V. (1992) Gradient-tailored excitation for single-quantum NMR spectroscopy of aqueous solutions. J. Biomol. NMR 2, 661–665 [DOI] [PubMed] [Google Scholar]
  • 39. Holm L., and Sander C. (1993) Protein structure comparison by alignment of distance matrices. J. Mol. Biol. 233, 123–138 [DOI] [PubMed] [Google Scholar]
  • 40. Hidaka M., Fushinobu S., Ohtsu N., Motoshima H., Matsuzawa H., Shoun H., and Wakagi T. (2002) Trimeric crystal structure of the glycoside hydrolase family 42 β-galactosidase from Thermus thermophilus A4 and the structure of its complex with galactose. J. Mol. Biol. 322, 79–91 [DOI] [PubMed] [Google Scholar]
  • 41. Maksimainen M., Paavilainen S., Hakulinen N., and Rouvinen J. (2012) Structural analysis, enzymatic characterization, and catalytic mechanisms of β-galactosidase from Bacillus circulans sp. alkalophilus. FEBS J. 279, 1788–1798 [DOI] [PubMed] [Google Scholar]
  • 42. Viborg A. H., Fredslund F., Katayama T., Nielsen S. K., Svensson B., Kitaoka M., Lo Leggio L., and Abou Hachem M. (2014) A β1–6/β1–3 galactosidase from Bifidobacterium animalis subsp. lactis Bl-04 gives insight into sub-specificities of β-galactoside catabolism within Bifidobacterium. Mol. Microbiol. 94, 1024–1040 [DOI] [PubMed] [Google Scholar]
  • 43. Larkin M. A., Blackshields G., Brown N. P., Chenna R., McGettigan P. A., McWilliam H., Valentin F., Wallace I. M., Wilm A., Lopez R., Thompson J. D., Gibson T. J., and Higgins D. G. (2007) Clustal W and Clustal X version 2.0. Bioinformatics 23, 2947–2948 [DOI] [PubMed] [Google Scholar]

Articles from The Journal of Biological Chemistry are provided here courtesy of American Society for Biochemistry and Molecular Biology

RESOURCES