Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2018 Jan 16;8:865. doi: 10.1038/s41598-018-19192-6

The crystal structure of D-xylonate dehydratase reveals functional features of enzymes from the Ilv/ED dehydratase family

Mohammad Mubinur Rahman 1, Martina Andberg 2, Anu Koivula 2, Juha Rouvinen 1, Nina Hakulinen 1,
PMCID: PMC5770437  PMID: 29339766

Abstract

The Ilv/ED dehydratase protein family includes dihydroxy acid-, gluconate-, 6-phosphogluconate- and pentonate dehydratases. The members of this family are involved in various biosynthetic and carbohydrate metabolic pathways. Here, we describe the first crystal structure of D-xylonate dehydratase from Caulobacter crescentus (CcXyDHT) at 2.7 Å resolution and compare it with other available enzyme structures from the IlvD/EDD protein family. The quaternary structure of CcXyDHT is a tetramer, and each monomer is composed of two domains in which the N-terminal domain forms a binding site for a [2Fe-2S] cluster and a Mg2+ ion. The active site is located at the monomer-monomer interface and contains residues from both the N-terminal recognition helix and the C-terminus of the dimeric counterpart. The active site also contains a conserved Ser490, which probably acts as a base in catalysis. Importantly, the cysteines that participate in the binding and formation of the [2Fe-2S] cluster are not all conserved within the Ilv/ED dehydratase family, which suggests that some members of the IlvD/EDD family may bind different types of [Fe-S] clusters.

Introduction

There is increasing interest in developing economical and sustainable bioprocesses to convert biomass into valuable organic compounds. Metabolic engineering and synthetic biology have enabled the development of optimized microbial strains for the production of target compounds by taking advantage of known pathways and enzymes. To fully exploit the possibilities offered by nature, a more detailed molecular-level understanding of key metabolic enzymes is required. One important group of these enzymes belongs to the IlvD/EDD protein family, which is involved in the pathways used to produce biofuels such as ethanol, butanol, isobutanol, and 3-methyl-1-butanol as well as platform chemicals such as isobutyric acid, 1,2,4-butanetriol and D-pantothenic acid (vitamin B5)16.

Enzymes belonging to the IlvD/EDD protein family are widely distributed among bacteria, archaea, fungi, and algae and in higher plant species712. The family includes dihydroxy acid dehydratases (IlvD, DHADHT, DHAD; EC 4.2.1.9), which are involved in short-chain amino acid biosynthesis and catalyse the dehydration of 2,3-dihydroxy-3-methylbutanoate13, and aldonic acid dehydratases, which are involved in the metabolism of hexose and pentose sugars via the Entner-Doudoroff (ED) pathway or a modified ED pathway1418. Pentonate dehydratases from the IlvD/EDD family participate in the non-phosphorylative oxidation pathways of pentose sugars in which D-xylonate dehydratase (XyDHT) and L-arabinonate dehydratase (ArDHT) catalyse the dehydration of D-xylonate and L-arabinonate, respectively1721. D-gluconate dehydratases (GDHT, GnaD) and 6-phosphogluconate dehydratases (6PGDHT) are involved in the metabolism of glucose via the ED pathway. These two enzymes catalyse the dehydration of gluconate and 6-phosphogluconate, respectively16,22. These enzymes all function similarly, by catalysing an elimination reaction in which a hydroxyl group at the C3 position and a proton at the C2 position are cleaved, resulting in the loss of a water molecule (Fig. 1).

Figure 1.

Figure 1

The reactions catalysed by IlvD/EDD enzymes. The identical reactive regions in different substrates are highlighted in red.

Enzymes from the IlvD/EDD family are suggested to contain a [Fe-S] cluster as a co-factor in the active site19. Some enzymes from this family have been characterized by using electron paramagnetic resonance (EPR) and UV-Vis spectroscopic methods to explore the structural features of the [Fe-S] cluster and its role in enzyme activity19,20,2325. Based on these studies, two different catalytically active forms of [Fe-S] clusters have been found. A catalytically active [4Fe-4S] cluster that is unstable in aerobic environments has been reported in 6PGDHT from Zymomonas mobilis26 and in DHADHT from Escherichia coli23. In contrast, a functional and stable [2Fe-2S] cluster in DHADHTs from several organisms, including bacteria, archaea, fungi and plants, has been observed24,25.

A detailed structural and functional characterization of these enzymes is important for understanding the catalytic mechanism and the substrate specificity, which could help improve the synthetic metabolic pathways in biorefinery industries. However, to date, only limited three-dimensional structural information is available for the IlvD/EDD enzymes. The coordinates of Shewanella oneidensis 6PGDHT in the apo form have been submitted to the Protein Data Bank (PDB code: 2GP4) but are not yet published. Moreover, this 6PGDHT structure is incompleted and partially incorrect since the original 2GP4 model contains a mixed β-sheet at N-terminal domain, but a corrected model is composed of parallel β-sheet. The model also lacks the cofactor and several surrounding loops in the active site. Recently, we solved the first representative of the IlvD/EDD family in its holo form, an L-arabinonate dehydratase from Rhizobium leguminosarum bv trifolii (RlArDHT, PDB code: 5J84). This structure revealed the presence of a [2Fe-2S] cluster and a Mg2+ ion in the active site27.

In this paper, we describe the first crystal structure of a D-xylonate dehydratase at 2.7 Å resolution and compare it with the two other known crystal structures (PDB codes: 2GP4 and 5J84) belonging to the IlvD/EDD family. D-xylonate dehydratase from Caulobacter crescentus (CcXyDHT) shares 42% sequence identity with RlArDHT, and the crystal structure of CcXyDHT shows that this enzyme has a [2Fe-2S] and a Mg2+ ion in its active site, similar to RlArDHT. The three different enzyme structures available from the IlvD/EDD enzyme family now allow the analysis of the structure and functional features within this protein family.

Results and Discussion

The re-refined structure of So6PGDHT

The crystal structure of 6-phosphogluconate dehydratase from Shewanella oneidensis (So6PGDHT) at 2.5 Å resolution (Rwork = 0.23 and Rfree = 0.29) is available in the Protein Data Bank (PDB code: 2GP4), but the structure lacks a [Fe-S] cluster and a divalent metal ion. When inspecting the electron density map of So6PGDHT, we found several obscured regions in the crystal structure. Therefore, the 2GP4 model was re-refined, which resulted in improved R-values (Rwork = 0.17 and Rfree = 0.23). However, the improved model of 6PGDHT still had missing regions, Val35-Leu63, Lys182-Ile185 and Thr215-Thr226, and it was lacking density for the [Fe-S] cluster or Mg2+ ion in the active site. The re-refinement resulted in new locations for the active site residues Cys112 and Asp113. In the re-refined model, the putative iron-sulphur cluster binding residues Cys112 and Cys154 are sufficiently close to ligate a [Fe-S] cluster. The location of the third putative cluster-binding residue Cys220 is undefined because, it is located in the missing loop, the structure of which is unknown due to the absence of electron density.

Overall structure of CcXyDHT

The crystal structure of holo-CcXyDHT was determined at 2.7 Å resolution by molecular replacement, using L-arabinonate dehydratase from Rhizobium leguminosarum bv trifolii (5J84) as a template. The final structure had good R-factor and R-free values of 17.8% and 22.3%, respectively. The structure contained a tetramer in the asymmetric unit in the space group C2. Unambiguous electron density was seen for all of the amino acid residues of CcXyDHT, except for the disordered region at the N-terminus, which includes Asp1-Ser2 and the N-terminal Strep-tag II. The refinement resulted in a final model with good refinement statistics (Table 1).

Table 1.

Data collection and refinement statistics. The structure was determined from a single crystal. The value in parentheses are for the highest-resolution shell.

Data collection CcXyDHT
Beam line I04, DLS, England
Wavelength (Å) 0.979497
Resolution (Å) 59.12 − 2.66 (2.76 − 2.66)
Space group C 1 2 1
Cell dimension
a, b, c (Å) 270.41, 236.14, 65.17
α, β, γ (°) 90, 97.38, 90
Total reflections 393801 (60419)
Unique reflections 115354 (11560)
Multiplicity 3.4 (3.4)
Completeness (%) 99.54 (99.80)
Mean I/σ (I) 17.2 (2.1)
Wilson B-factor (Å2) 59.91
R-meas (%) 7.0 (82.1)
CC 1/2 99.9 (95.8)
Refinement statistics
Reflections used in refinement 110218 (10961)
Reflections used for R-free 5509 (546)
R-work 0.1769 (0.3295)
R-free 0.2174 (0.3814)
Number of non-hydrogen atoms 18336
Protein 17992
Solvent 324
Fe/S (cluster) 8/8
Mg 4
Protein residues 2354
RMS deviations
Bond length (Å) 0.007
Bond angles (°) 0.95
Ramachandran analysis (%)
Favoured 95.6
Allowed 3.9
Outliers 0.5
Rotamer outliers 0.0
Clashscore 8.33
Average B-factor (Å 2 ) 67
Protein 68
Solvent 58
Fe/S (cluster) 79/71
Mg 83
PDB ID 5OYN

The enzyme is an α/β protein that consists of two domains: a N-terminal domain (residues Asn3-Leu358) and a C-terminal domain (residues Leu383-His591) (Fig. 2a,b). The two domains are connected by a long loop from Gln359 to Phe382. The N-terminal domain is composed of a β-sheet with four parallel β-strands surrounded by four α-helices. In addition, the N-terminal domain contains an extension at the N-terminus (residues Arg10 - Ser43) with three α-helices and an insertion with β-hairpin (residues Gly157 – Val164) and a helix-loop-helix (residues Thr168 – Ser194). The C-terminal domain is composed of a β-sheet consisting of six parallel and two anti-parallel β-strands that are arranged like a β-barrel. Secondary structure analysis showed that the complete polypeptide chain consists of 27 α-helices and 16 β-strands. Each monomer contains one [2Fe-2S] cluster and a Mg2+ ion. The substrate binding site is located in the cavity between the domains, but the [Fe-S] cluster and the Mg2+ binding site are located entirely in the N-terminal domain (Fig. 2c).

Figure 2.

Figure 2

Structure of CcXyDHT and comparison of the dimers; the N- and C-terminal domains are shown in red and in blue, respectively. A long loop connecting the domains is shown in green. The [2Fe-2S] cluster is shown with yellow and orange spheres, and Mg2+ is shown as a green sphere. (a) N-terminal domain, (b) C-terminal domain, (c) monomer, (d) tetramer, (e) the dimer of CcXyDHT in which the monomeric unit is shown in colours and the dimeric counterpart is in grey, and (f) the dimer of So6PGHDT. The key features on the monomer-monomer interface are marked with arrows.

Quaternary structure

The quaternary structure of CcXyDHT is a homotetramer (Fig. 2d), similar to the homotetrameric structure of RlArDHT. D-gluconate dehydratase from Achromobacter xylosoxidans has also been reported to form a tetramer in solution22. A packing analysis of the crystal structure using the PISA server28 suggested that the quaternary structure contains a stable dimer with an extensive monomer-monomer interface (4640 Å2). This interface is predominantly formed by residues from the larger-sized N-terminal domain. An α-helix (residues 99–115) from this domain forms the central part of the interface. The tight dimer then packs against another similar dimer in such a way that one monomer has an interface with both polypeptide chains from the second dimer (887 and 607 Å2). The total monomer-monomer interface area in the tetramer is 12268 Å2.

In contrast, SoPGDHT forms a dimer in the crystal structure that is similar to the tight dimer of CcXyDHT (Fig. 2e,f). However, unlike CcXyDHT, SoPGDHT does not form a tetrameric structure. Many IlvD/EDD enzymes such as L-arabonate dehydratase from Azospirillum brasiliense20, dihydroxy acid dehydratase from Spinacia oleracea24, and 6-phosphogluconate dehydratase from Zymomonas mobilis have been reported to form dimers in solution29.

The formation of the extensive dimer interface in CcXyDHT (and in RlArDHT) is essential for the formation of the active site. Although the residues that participate in the binding of Mg2+ and the [2Fe-2S] cofactor are located in the N-terminal domain of the first monomer, the polypeptide chain from the second monomer covers the active site by extending its N-terminal helix (residues 21–33) into the first monomer. Because the amino acid residues of this helix, which point towards the active site, are different in CcXyDHT and RlArDHT, this helix may be called a substrate recognition helix. In addition, the C-terminus of the dimeric counterpart protrudes into the active site of the first monomer by placing His591, which forms a salt bridge with Glu463, in the active site (Fig. 2e). The crystal structure of So6PGDHT shows similar dimer formation to that observed in CcXyDHT but does not show any evidence that the polypeptide chain from the dimeric counterpart participates in the formation of the active site. In contrast, the So6PGDHT structure represents the apo form, and some of the loop regions are not visible in the electron density map due to disorder in the absence of cofactor (Fig. 2f).

Structural comparison of three IlvD/EDD enzymes

The sequence of CcXyDHT is 42% identical to RlArDHT and 27% identical to So6PGDHT. A sequence alignment of these three enzymes and of representative members of the IlvD/EDD family is shown in Fig. 3. The higher sequence identity between CcXyDHT and RlArDHT, compared to So6PGDHT, is also reflected in a low, 0.70 Å RMS deviation between the Cα atoms of monomer proteins. The overall structures are very similar except for a small insertion of a β-hairpin at Leu409-Val417 in CcXyDHT. The [2Fe-2S] cluster and Mg2+ ion coordinating residues are well conserved between these enzymes. The improved model of So6PGDHT can be superimposed with CcXyDHT with a RMSD of 1.95 Å. The N-terminal core domain consisting of a four-stranded parallel β-sheet surrounded by α-helices and the C-terminal core containing a mixed eight-stranded β-sheet are similar. However, the large RMSD value between So6PGDHT and CcXyDHT is due to the following major differences between the structures: 1) the N-terminal regions, 2) the helix-loop-helix motifs and 3) the β-hairpin structures (Fig. 4).

Figure 3.

Figure 3

Multiple sequence alignment of IlvD/EDD enzymes. CcXyDHT is a D-xylonate dehydratase from Caulobacter crescentus (GeneBank: ANS60449), RlArDHT is an L-arabinonate dehydratase from Rhizobium leguminosarum bv. trifolii (GeneBank: ANS60454), SmDHADHT is a dihydroxy acid dehydratase from Streptococcus mutans (GeneBank: KZM62800), SoDHADHT is a dihydroxy acid dehydratase from Spinacia oleracea (GeneBank: KNA20834), EcDHADHT is a dihydroxy acid dehydratase from Escherichia coli (GeneBank: CDZ22543), CgDHADHT is a dihydroxy acid dehydratase from Corynebacterium glutamicum (GeneBank: BAV23085), CaDHADHT is a dihydroxy acid dehydratase from Clostridium acetobutylicum (GeneBank: WP_010966867), and Zm6PGDHT and So6PGDHT are 6-phosphogluconate dehydratases from Zymomonas mobilis (GeneBank: WP_013934127) and Shewanella oneidensis (GeneBank: WP_011072452). The N- and C-terminal domain are shown in red and blue, connecting loop in green. The [2Fe-2S] cluster coordinating residues are shown in yellow, Mg2+ ion coordinating residues are in magenta and serine that play role as Lewis base is shown in orange, respectively. Cys-Asp-Gly motif is shown in cyan. Black stars represent the fully conserved residues.

Figure 4.

Figure 4

The monomeric structures of IlvD/EDD enzymes: (a) CcXyDHT, (b) RlArDHT, and (c) So6PGDHT. The N-terminal domain is in red, the C-terminal domain in blue, and the connecting region is shown in green. The undefined regions in the structure of So6PGDHT are shown as dashed lines. The arrows indicate the major differences between the structures: 1) the N-terminal region, 2) the flipped helix-loop-helix structure, and 3) the β-hairpin insertion of So6PGDHT in the connecting region. (d) The location of sequence motifs 1 and 2 in the CcXyDHT monomer. The conserved residues in both motifs are in purple, and other residues are in grey.

The N-termini of CcXyDHT and So6PGDHT are very different. In CcXyDHT, this region (residues 1–45) forms a bundle of three short helices, but in So6PGDHT, this region is longer (residues 1–65) and contains two long α-helices, which have a very different orientation and pack against the core protein in an extended fashion. A part of the N-terminal region (35–63) is undefined in the structure of So6PGDHT. Perhaps the most remarkable structural difference between the proteins is the location of the Pro186-Ser214 region that contains a helix-loop-helix motif structure at the N-terminal domain of So6PGDHT. The helix-loop-helix in SoPGDHT has been flipped and moves by approximately 30 Å from the position observed in CcXyDHT (and in RlArDHT), where it partially covers the active site. Interestingly, in an open form of an RlArDHT mutant co-crystallized with L-arabinonate, this helix-loop-helix structure is slightly displaced by 7 Å, which suggests that it has a functional role in the catalytic cycle27.

In So6PGDHT, the helix-loop-helix region is twisted away from the active site. Unfortunately, the preceding loop (residues 182–185) and the following loop (residues 215–226) regions are disordered in the So6PGDHT structure, so how accessible the active site of So6PGDHT is compared to those of CcXyDHT and RlArDHT remains an open question. The third difference between So6PGDHT and CcXyDHT is an additional β-hairpin structure (residues 383–396) in the connecting loop of So6PGDHT. This packs against the abovementioned helix-loop-helix structure (Fig. 4a–c).

Sequence motifs to identify the IlvD/EDD family

We have analysed the multiple amino acid sequence alignment of some representative members of the IlvD/EDD family. The sequence alignment is presented in Fig. 3, including the sequences of the three enzymes (CcXyDHT, RlArDHT, So6PGDHT) for which three-dimensional structure information is available. The description of the IlvD/EDD protein family in the PROSITE database30 reports that this family has two common sequence motifs. The first sequence motif consists of eleven residues with a peptide pattern Cys-Asp-Lys-x(2)-Pro-[Gly/Ala]-x(3)-[Gly/Ala], where Cys-Asp-Lys and Pro are conserved residues. The second sequence motif consists of twelve residues with a peptide pattern [Ser/Gly/Ala/Leu/Cys]-[Leu/Ile/Met/Phe]-[Leu/Ile/Val/Met/Phe]-Thr-Asp-[Gly/Ala]-Arg-[Leu/Ile/Val/Met/Phe/Tyr]-Ser-[Gly/Ala]-[Gly/Ala/Val]-[Ser/Thr], where Thr-Asp, Arg and Ser (without brackets) are conserved residues.

These two motifs can be found in the CcXyDHT structure (Figs 3, 4d). The first motif is Cys-Asp-Lys-Thr-Thr-Pro-Ala-Gly-Ile-Met-Ala at the N-terminal domain (residues 128–138). These eleven residues are located in the helix α6 and the loop connecting the strand β3 and helix α6. The conserved Cys128 participates in the binding of the [2Fe-2S] cluster, Asp129 and Lys130 (in carbamylated form) in the binding of Mg2+.

The second motif is Pro-Thr-Leu-Gly-Asp-Gly-Arg-Gln-Ser-Gly-Thr-Ala (residues 482–493) in CcXyDHT. These twelve residues are located at the C-terminal domain consisting of strand β13 and the loop follows. Interestingly, the first two and last two residues of this motif in CcXyDHT do not match the predicted sequence motif. The proposed conserved threonine is not conserved in CcXyDHT and is replaced by Gly485. The conserved Asp486 forms hydrogen bonds with the main chain nitrogens of Gly452 and Ala453 in the loop structure. The conserved Arg488 is located rather close (4.3 Å) to the Mg2+ but probably does not directly participate in the binding of magnesium; consequently, its role may be electrostatic. Ser490, which is proposed to act as a base in enzyme catalysis, as supported by experimental results from the corresponding Ser480Ala mutant of RlArDHT, is fully conserved among the IlvD/EDD protein family27.

The active site of CcXyDHT

The subsequent analysis of the active site is based solely on the crystal structures of the holo forms of CcXyDHT and RlArDHT and not on the structure of So6PGDHT because the latter structure represents an incomplete structure in the apo form. The electron density map of the CcXyDHT crystal structure shows that the binding of the Mg2+ and the [2Fe-2S] cluster to the active site is similar to that observed in RlArDHT27.

The hexacoordination of the Mg2+ ion resembles a square bipyramid in which Glu92, Asp129, Glu463 and a water (or hydroxide ion) are approximately in the plane and another water molecule (or a hydroxide ion) and carbamylated Lys130 are in the axial positions. One water molecule in the square plane is hydrogen-bonded (2.7 Å) to Thr206. Due to the obtained resolution, we cannot fully exclude the possibility that Thr206 may also directly interact with the Mg2+ ion. However, the observed distance from Thr206 to the Mg is approximately 3.8 Å. Identical residues can be found in the structure of RlArDHT and all four Mg2+ binding residues (and Thr206) are fully conserved within the protein family (Fig. 3), suggesting that the binding of magnesium is a universal feature of the IlvD/EDD enzyme family. The crystal structure of CcXyDHT unambiguously shows that Lys130 is carbamylated (Fig. 5c,d) as is also found in the crystal structure of RlArDHT27. The residues Asp129 and Lys130 (or more correctly KCX130) belong to the N-terminal sequence motif. The complete structures of CcXyDHT and RlArDHT support an interpretation that carbamylation of this lysine is a common feature among IlvD/EDD enzymes.

Figure 5.

Figure 5

The active site of CcXyDHT. (a) The mFo-DFc omit maps (green) contoured at 3σ level for the [2Fe-2S] cluster; and (b) The 2mFo-DFc electron density maps (blue) contoured at 1σ level for the Mg2+ ion and the binding residues; (c) the active site residues, which participate in the binding of the cofactors and the putative substrate binding residues are shown. The protein backbone cartoon colour is green for the first monomer and cyan for the second monomer; (d) The superimposition of CcXyDHT (in green and cyan) and RlArDHT (grey). The most likely binding site for the substrate is circled with a red ellipse.

The crystal structure of CcXyDHT suggests that the enzyme has a [2Fe-2S] cluster, similar to that observed in RlArDHT. In the cluster, the Fe1 atom is coordinated by two sulfide ions and two cysteines (Cys60 and Cys128). Cys128 belongs to the Cys-Asp-Lys motif, which is fully conserved among IlvD/EDD enzymes. Interestingly, Cys60 is conserved only among the subfamily of IlvD/EDD enzymes including pentonate dehydratases and some dihydroxy acid dehydratases19. In 6PGDHT, the putative corresponding cysteine belongs to a Cys-Asp-Gly motif (Fig. 1), which is not observed in pentonate dehydratases. This may reflect the organization of a different cluster type, for example, the binding of [2Fe-2S] versus [4Fe-4S] in the active site. In CcXyDHT, Fe2 atom of the cluster is coordinated by two sulfide ions and one cysteine residue (Cys200). This cysteine residue is fully conserved within the IlvD/EDD protein family.

Residues Ser490 and Thr492 from the C-terminal sequence motif participate in the formation of the substrate binding site (Fig. 5c,d). Based on the crystal structure and mutagenesis studies of RlArDHT, we have previously suggested that serine acts as a Lewis base in catalysis27. In CcXyDHT, Ser490 is approximately 6 Å away from the [2Fe-2S] cluster and 4 Å away from the Mg2+ ion. As in RlArDHT, OG1 of Thr492 forms a short (and strong) hydrogen bond (2.6 Å) with OG of Ser490 in CcXyDHT. In addition, a hydrogen bond (3.2 Å) exists between the main chain nitrogen of Thr492 and OG1 of Ser490. Furthermore, NE1 of Trp171 is hydrogen bonded (2.9 Å) to OG1 of Thr492. This hydrogen bond network may increase the nucleophilicity of Ser490. Thr492 in CcXyDHT and the corresponding Thr482 in RlArDHT are not conserved within the IlvD/EDD protein family (replaced by Ala/Gly) as seen in Fig. 3. However, the next amino acid after Ala/Gly in the sequence within the members of this protein family is Ser/Thr, which could play a similar role to Thr492 in the CcXyDHT structure. The proposed elimination reaction is shown in Fig. 6, where [2Fe-2S] cluster act as a Lewis acid and the alkoxide ion form of Ser490 is a base. In the end, the enol product is tautomerized to the final C2 oxo product.

Figure 6.

Figure 6

Proposed reaction mechanism for D-xylonate dehydratase.

Molecular determinants for the substrate specificity

To understand the substrate specificity between CcXyDHT and RlArDHT, the active sites were superimposed. The preferred substrates for CcXyDHT and RlArDHT are stereoisomers of pentonates, having different configurations at the C4 position. The Km values for D-xylonate and L-arabinonate are in the mM range (2–10 mM) for both enzymes, but the kcat values differ more significantly. The catalytic efficiency kcat/Km is 120 times higher in CcXyDHT for D-xylonate compared to L-arabinonate, and 10 times higher in RlArDHT for L-arabinonate compared to D-xylonate. In addition to pentonate sugar acids, various hexonate sugar acids can be dehydrated, as long as the configurations at C2 and C3 are maintained19.

The active site superimposition of CcXyDHT and RlArDHT shows that the binding of the Mg2+ and the [2Fe-2S] cluster is very similar, even identical in both enzymes. However, the differences in the active site cleft are significant in the N-terminal helix (residues 21–33) from the dimeric counterpart. The N-terminal helix can be therefore identified as a substrate recognition helix, where Trp31 in RlArDHT is replaced by Arg30 in CcXyDHT. This Arg residue forms a salt-bridge with Glu29 in CcXyDHT (the corresponding residue in RlArDHT is Gly30). Tyr27 in RlArDHT is replaced by Leu26 in CcXyDHT, and consequently, the conformation of Asn92 is altered. In addition, both enzymes have a C-terminal histidine residue from the dimeric counterpart (His591 in CcXyDHT and His579 in RlArDHT) in the active site (Fig. 5d). The carboxylate group of the C-terminus is salt-bridged to Arg22 and Lys422 in RlArDHT and to Arg174 in CcXyDHT. The differences between the catalytic efficiency thus probably originate from the recognition helix and C-terminus, which highlights the important role of the dimerization.

Materials and Methods

Protein purification and crystallization

D-xylonate dehydratase from Caulobacter crescentus was heterologously expressed in E. coli, purified, and crystallized, and data were collected at 2.66 Å resolution using synchrotron radiation as described previously19,31. The data were processed and scaled using XDS and XSCALE32.

Structure determination and refinement

A crystal structure of CcXyDHT at 2.7 Å was determined by molecular replacement using PHASER33 with the coordinates of L-arabinonate dehydratase (PDB code 5J84) as a template. The model building was performed in COOT34. The refinement of the model was performed using PHENIX refine35. The validation of the refinement was done using MolProbity36. The re-refinement of So6PGDHT (PDB code 2GP4) was also done using the PHENIX software35.

UV-Vis spectroscopy

The presence of a [Fe-S] cluster in CcXyDHT was analysed using UV-Vis spectroscopy. A purified protein sample was diluted with a sample storage buffer consisting of 50 mM Tris-HCl and 5 mM MgCl2 at pH 7.5. The sample was loaded into a quartz cuvette and data were collected using a UV-Vis/NIR/900 spectrophotometer (PerkinElmer, USA) at wavelengths of 260–800 nm against sample storage buffer. The spectrum is shown in Figure S1.

Enzyme activity test

The CcXyDHT activity was assayed as described previously19. The purified enzyme (6 µg in a 600 µl reaction) was incubated at 30 °C in 50 mM Tris-Cl, 10 mM MgCl2, at pH 8.5 using 20 mM D-xylonate as substrate. At 2-min intervals, a 100-μl sample was transferred to a microcentrifuge tube and the reaction was stopped by adding 10 μl of 12% trichloroacetic acid (TCA). The product formation was measured using the thiobarbituric acid (TBA) assay37. Aliquots of 50 μl from all samples were transferred into fresh microcentrifuge tubes, after which 125 μl of 25 mM periodic acid (dissolved in 0.2 M H2SO4) was added and incubated at room temperature for 20 min. To the reaction, 250 μl of 2% sodium arsenate in 0.5 M HCl was added and mixed, followed by 1 ml of 0.3% TBA. The samples were then incubated at 100 °C for 10 min. Prior to reading the absorbance at 549 nm, the samples were mixed with an equal volume of DMSO.

Sequence alignment and structural comparisons

The amino acid sequences of the enzymes were collected from the NCBI GeneBank38. The sequence alignment was carried out using the online multiple sequence alignment tools Clustal Omega39 and MacVector40. The secondary structure analysis was carried out using the online tool STRIDE41. The alignment figure was created using ALINE42. The superimposition of the crystal structures and the figures of the three-dimensional enzyme structures were made using PyMOL43.

Data availability

The atomic coordinates and structure factors have been deposited in the Protein Data Bank under the accession code 5OYN.

Electronic supplementary material

Supplementary Information (287.7KB, docx)

Acknowledgements

We thank Dr. Steffi Goffart and Dr. Jaakko Pohjoismäki, Department of Biology, UEF for their assistance during the protein expression. We thank the Diamond Light Source (DLS) for providing synchrotron access. At UEF, this work received financial support from the Academy of Finland (Decisions 256937 and 292705). The work was part of the Finnish Centre of Excellence in the White Biotechnology-Green Chemistry programme (Academy of Finland Decision 118573) and the IV4SP project (Academy of Finland Decision 272598) at VTT. The work also received funding from the European Community’s Seventh Framework Programme (FP7/2007–2013) under BioStruct-X (Grant Agreement N° 283570).

Author Contributions

M.M.R. and M.A. expressed and purified the enzyme. M.M.R. and N.H. performed the crystallization, data collection, structure determination and refinement. All authors discussed the results and contributed to the final manuscript.

Competing Interests

The authors declare that they have no competing interests.

Footnotes

Electronic supplementary material

Supplementary information accompanies this paper at 10.1038/s41598-018-19192-6.

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Lang, K., Zierow, J., Buehler, K. & Schmid, A. Metabolic engineering of Pseudomonas sp. strain VLB120 as platform biocatalyst for the production of isobutyric acid and other secondary metabolites. Microb. Cell. Fact. 13, 2 (2014). [DOI] [PMC free article] [PubMed]
  • 2.Blombach B, Eikmanns BJ. Current knowledge on isobutanol production with Escherichia coli, Bacillus subtilis and Corynebacterium glutamicum. Bioeng. Bugs. 2011;2:346–350. doi: 10.4161/bbug.2.6.17845. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Li S, Wen J, Jia X. Engineering Bacillus subtilis for isobutanol production by heterologous Ehrlich pathway construction and the biosynthetic 2-ketoisovalerate precursor pathway overexpression. Appl. Microbiol. Biotechnol. 2011;91:577–589. doi: 10.1007/s00253-011-3280-9. [DOI] [PubMed] [Google Scholar]
  • 4.Connor MR, Liao JC. Engineering of an Escherichia coli strain for the production of 3-methyl-1-butanol. Appl. Environ. Microbiol. 2008;74:5769–5775. doi: 10.1128/AEM.00468-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Niu W, Molefe MN, Frost JW. Microbial synthesis of the energetic material precursor 1,2,4-butanetriol. J. Am. Chem. Soc. 2003;125:12998–12999. doi: 10.1021/ja036391+. [DOI] [PubMed] [Google Scholar]
  • 6.Guterl JK, et al. Cell-free metabolic engineering: production of chemicals by minimized reaction cascades. ChemSusChem. 2012;5:2165–2172. doi: 10.1002/cssc.201200365. [DOI] [PubMed] [Google Scholar]
  • 7.Egan SE, et al. Molecular characterization of the Entner-Doudoroff pathway in Escherichia coli: sequence analysis and localization of promoters for the edd-eda operon. J. Bacteriol. 1992;174:4638–4646. doi: 10.1128/jb.174.14.4638-4646.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Carsten JM, Schmidt A, Sieber V. Characterization of recombinantly expressed dihydroxy-acid dehydratase from Sulfobus solfataricus-A key enzyme for the conversion of carbohydrates into chemicals. J. Biotechnol. 2015;211:31–41. doi: 10.1016/j.jbiotec.2015.06.384. [DOI] [PubMed] [Google Scholar]
  • 9.Velasco JA, et al. Cloning of the dihydroxyacid dehydratase-encoding gene (ILV3) from Saccharomyces cerevisiae. Gene. 1993;137:179–185. doi: 10.1016/0378-1119(93)90004-M. [DOI] [PubMed] [Google Scholar]
  • 10.Bromke MA. Amino Acid Biosynthesis Pathways in Diatoms. Metabolites. 2013;3:294–311. doi: 10.3390/metabo3020294. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Zhang C, et al. Dihydroxyacid dehydratase is important for gametophyte development and disruption causes increased susceptibility to salinity stress in Arabidopsis. J. Exp. Bot. 2015;66:879–888. doi: 10.1093/jxb/eru449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Kanamori M, Wixom RL. Studies in valine biosynthesis. V. Characteristics of the purified dihydroxyacid dehydratase from spinach leaves. J. Biol. Chem. 1963;238:998–1005. [PubMed] [Google Scholar]
  • 13.Myers JW. Dihydroxy acid dehydrase: an enzyme involved in the biosynthesis of isoleucine and valine. J. Biol. Chem. 1961;236:1414–1418. [PubMed] [Google Scholar]
  • 14.Conway T. The Entner-Doudoroff pathway: history, physiology and molecular biology. FEMS Microbiol. Rev. 1992;9:1–27. doi: 10.1111/j.1574-6968.1992.tb05822.x. [DOI] [PubMed] [Google Scholar]
  • 15.Gottschalk G, Bender R. D-Gluconate dehydratase from Clostridium pasteurianum. Methods Enzymol. 1982;90(Pt E):283–287. doi: 10.1016/S0076-6879(82)90141-0. [DOI] [PubMed] [Google Scholar]
  • 16.Meloche HP, Wood WA. The Mechanism of 6-Phosphogluconic Dehydrase. J. Biol. Chem. 1964;239:3505–3510. [PubMed] [Google Scholar]
  • 17.Stephens C, et al. Genetic analysis of a novel pathway for D-xylose metabolism in Caulobacter crescentus. J. Bacteriol. 2007;189:2181–2185. doi: 10.1128/JB.01438-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Weimberg R. Pentose oxidation by Pseudomonas fragi. J. Biol. Chem. 1961;236:629–635. [PubMed] [Google Scholar]
  • 19.Andberg M, et al. Characterization and mutagenesis of two novel iron-sulphur cluster pentonate dehydratases. Appl. Microbiol. Biotechnol. 2016;100:7549–7563. doi: 10.1007/s00253-016-7530-8. [DOI] [PubMed] [Google Scholar]
  • 20.Watanabe S, Shimada N, Tajima K, Kodaki T, Makino K. Identification and characterization of L-arabonate dehydratase, L-2-keto-3-deoxyarabonate dehydratase, and L-arabinolactonase involved in an alternative pathway of L-arabinose metabolism. Novel evolutionary insight into sugar metabolism. J. Biol. Chem. 2006;281:33521–33536. doi: 10.1074/jbc.M606727200. [DOI] [PubMed] [Google Scholar]
  • 21.Jiang Y, et al. Characterization of D-xylonate dehydratase YjhG from Escherichia coli. Bioengineered. 2015;6:227–232. doi: 10.1080/21655979.2015.1040208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Kim S, Lee SB. Identification and characterization of the bacterial d-gluconate dehydratase in Achromobacter xylosoxidans. Biotechnol Bioproc E. 2008;13:436–444. doi: 10.1007/s12257-008-0152-y. [DOI] [Google Scholar]
  • 23.Flint DH, Emptage MH, Finnegan MG, Fu W, Johnson MK. The role and properties of the iron-sulfur cluster in Escherichia coli dihydroxy-acid dehydratase. J. Biol. Chem. 1993;268:14732–14742. [PubMed] [Google Scholar]
  • 24.Flint DH, Emptage MH. Dihydroxy acid dehydratase from spinach contains a [2Fe-2S] cluster. J. Biol. Chem. 1988;263:3558–3564. [PubMed] [Google Scholar]
  • 25.Flint, D. H., Rothman, S. C., Suh, W. & Tomb, J. F. Identification and use of bacterial [2Fe-2S] dihydroxy-acid dehydratases, Patent WO2010037112 (A1) (2010).
  • 26.Rodriguez M, Wedd AG, Scopes RK. 6-phosphogluconate dehydratase from Zymomonas mobilis: an iron-sulfur-manganese enzyme. Biochem. Mol. Biol. Int. 1996;38:783–789. [PubMed] [Google Scholar]
  • 27.Rahman MM, et al. The crystal structure of a bacterial L-arabinonate dehydratase contains a [2Fe-2S] cluster. ACS Chem. Biol. 2017;12:1919–1927. doi: 10.1021/acschembio.7b00304. [DOI] [PubMed] [Google Scholar]
  • 28.Krissinel E, Henrick K. Inference of macromolecular assemblies from crystalline state. J. Mol. Biol. 2007;372:774–797. doi: 10.1016/j.jmb.2007.05.022. [DOI] [PubMed] [Google Scholar]
  • 29.Scopes RK, Griffiths-Smith K. Use of differential dye-ligand chromatography with affinity elution for enzyme purification: 6-phosphogluconate dehydratase from Zymomonas mobilis. Anal. Biochem. 1984;136:530–534. doi: 10.1016/0003-2697(84)90257-4. [DOI] [PubMed] [Google Scholar]
  • 30.Hulo N, et al. The PROSITE database. Nucleic Acids Res. 2006;34:D227–30. doi: 10.1093/nar/gkj063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Rahman MM, Andberg M, Koivula A, Rouvinen J, Hakulinen N. Crystallization and X-ray diffraction analysis of an L-arabinonate dehydratase from Rhizobium leguminosarum bv. trifolii and a D-xylonate dehydratase from Caulobacter crescentus. Acta Crystallogr. F. Struct. Biol. Commun. 2016;72:604–608. doi: 10.1107/S2053230X16010311. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kabsch W. Integration, scaling, space-group assignment and post-refinement. Acta Crystallogr. D Biol. Crystallogr. 2010;66:133–144. doi: 10.1107/S0907444909047374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.McCoy AJ, et al. Phaser crystallographic software. J. Appl. Crystallogr. 2007;40:658–674. doi: 10.1107/S0021889807021206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Emsley P, Lohkamp B, Scott WG, Cowtan K. Features and development of Coot. Acta Crystallogr. D Biol. Crystallogr. 2010;66:486–501. doi: 10.1107/S0907444910007493. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Afonine PV, et al. Towards automated crystallographic structure refinement with phenix.refine. Acta Crystallogr. D Biol. Crystallogr. 2012;68:352–367. doi: 10.1107/S0907444912001308. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Chen VB, et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. D Biol. Crystallogr. 2010;66:12–21. doi: 10.1107/S0907444909042073. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Buchanan CL, Connaris H, Danson MJ, Reeve CD, Hough DW. An extremely thermostable aldolase from Sulfolobus solfataricus with specificity for non-phosphorylated substrates. Biochem. J. 1999;343(Pt 3):563–570. doi: 10.1042/bj3430563. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Clark K, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW. GenBank. Nucleic Acids Res. 2016;44:D67–72. doi: 10.1093/nar/gkv1276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Sievers F, et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 2011;7:539. doi: 10.1038/msb.2011.75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Olson SA. MacVector: an integrated sequence analysis program for the Macintosh. Methods Mol. Biol. 1994;25:195–201. doi: 10.1385/0-89603-276-0:195. [DOI] [PubMed] [Google Scholar]
  • 41.Heinig M, Frishman D. STRIDE: a web server for secondary structure assignment from known atomic coordinates of proteins. Nucleic Acids Res. 2004;32:W500–2. doi: 10.1093/nar/gkh429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Bond, C. S. & Schuttelkopf, A. W. ALINE: aWYSIWYG protein-sequence alignment editor for publication-quality alignments. Acta Crystallogr. D Biol. Crystallogr.65, 510–512 (2009). [DOI] [PubMed]
  • 43.The PyMOL Molecular Graphics System, Version 1. 8 Schrödinger, LLC, Portland, OR (2015).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information (287.7KB, docx)

Data Availability Statement

The atomic coordinates and structure factors have been deposited in the Protein Data Bank under the accession code 5OYN.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES