Abstract
The mechanistically diverse enolase superfamily is a paradigm for elucidating Nature’s strategies for divergent evolution of enzyme function. Each of the different reactions catalyzed by members of the superfamily is initiated by abstraction of the α-proton of a carboxylate substrate that is coordinated to an essential Mg2+. The muconate lactonizing enzyme (MLE) from Pseudomonas putida, a member of a family that catalyzes the syn-cycloisomerization of cis,cis-muconate to (4S)-muconolactone in the β-ketoadipate pathway, has provided critical insights into the structural bases for evolution of function within the superfamily. A second, divergent family of homologues MLEs that catalyzes anti-cycloisomerization has been identified. Structures of members of both families liganded with the common (4S)-muconolactone product (syn, Pseudomonas fluorescens, GI:70731221; anti, Mycobacterium smegmatis, GI:118470554) document that the conserved Lys at the end of the second β-strand in the (β/α)7β-barrel domain serves as the acid catalyst in both reactions. The different stereochemical courses (syn and anti) result from different structural strategies for determining substrate specificity: although the distal carboxylate group of the cis,cis-muconate substrate attacks the same face of the proximal double bond, opposite faces of the resulting enolate anion intermediate are presented to the conserved Lys acid catalyst. The discovery of two families of homologous, but stereochemically distinct, MLEs likely provides an example of “pseudoconvergent” evolution of the same function from different homologous progenitors within the enolase superfamily, in which different spatial arrangements of active site functional groups and substrate specificity determinants support catalysis of the same reaction.
The mechanistically diverse enolase superfamily was recognized by the discovery that the structures of cis,cis-muconate lactonizing enzyme (MLE) and mandelate racemase (MR) from Pseudomonas putida are remarkably similar, despite sharing only 26% sequence identity (1). Although MLE and MR catalyze different overall reactions, each is initiated by general-base catalyzed abstraction of the α-proton of the carboxylate substrate that is coordinated to an essential Mg2+. Without significant stabilization by the Mg2+, the resulting enediolate intermediates would be too unstable to be kinetically competent.
The polypeptide chains of members of the enolase superfamily are organized in two domains (2, 3): 1) a (β/α)7β-barrel domain [modified TIM1-barrel] that contains the ligands for the Mg2+ as well as general acid/base catalysts at the C-terminal ends of the β-strands; and 2) an α+β capping domain formed from N- and, sometimes, C-terminal segments of the polypeptide chain that closes over the mouth of the barrel domain and contains residues responsible for determining substrate specificity. This large superfamily can be partitioned into subgroups based on phylogenetic analyses, the identities of the ligands for the essential Mg2+, and the identities and positions of the general acid/base catalysts. Functionally diverse subgroups include 1) the MLE subgroup in which the acid/base catalysts are Lys residues at the ends of the second and sixth β-strands (Figure 1A); and 2) the MR subgroup in which one of the acid/base catalysts is a His Asp dyad at the ends of the seventh and sixth β-strands. Within the superfamily, the only conserved residues are ligands for the Mg2+ at the ends of the third, fourth, and fifth β-strands, thereby establishing the importance of stabilization of the enediolate intermediate.
Figure 1.
Structure, functions, and sequence relationships in the MLE subgroup. Panel A, (β/α)7β-barrel domain of the MLE subgroup showing the positions of the conserved Asp/Glu ligands for the essential Mg2+ at the ends of the third, fourth, and fifth β-strands and the Lys acid/base catalysts at the ends of the second and sixth β-strands. Panel B, reactions catalyzed by members of the MLE subgroup.
Four reactions have been identified for members of the MLE subgroup (Figure 1B): 1) syn-cycloisomerization catalyzed by MLE (4–6); 2) syn-dehydration catalyzed by o-succinylbenzoate synthase (OSBS) (7); 3) dipeptide epimerization catalyzed by l-Ala-d/l-Glu epimerase (AEE) (8) plus other recently identified epimerases with divergent substrate specificities (9); and 4) racemization catalyzed by N-succinylamino acid racemases (NSAR) (10, 11). Flexible 20s and 50s loops in the capping domain complete the active site cavity in which the substrate is coordinated to the Mg2+ and positioned proximal to the acid/base catalysts. Divergent evolution within the MLE subgroup follows the strategy that different reactions can be accomplished by changing the identity of the carboxylate anion that binds vis-à-vis the conserved acid/base catalysts by altering the identities and locations of the residues that form the active site cavity (12–14).
A phylogenetic tree that depicts the relationships between sequence and function in the MLE subgroup is shown in Figure 2. The OSBSs catalyze syn-dehydration, as dictated by the structure of the substrate (7). The sequences are divergent; several subfamilies have been identified in which the common substrate specificity is determined by different structural motifs (15). The Lys at the end of the second β-strand is positioned proximal to the abstracted α-proton and the OH leaving group.
Figure 2.
Phylogenetic tree of the MLE subgroup constructed as described in the Materials and Methods section.
Several subfamilies of dipeptide epimerases have been identified. The AEE from Bacillus subtilis is specific for l-Ala-d/l-Glu, with Arg 24 in the 20s loop determining the specificity for the Glu residue (8). Homologous dipeptide epimerases, with sequence-divergent 20s and 50s loops in the capping domain, catalyze epimerization of other dipeptides (9). The epimerases share a D×D motif at the end of the eighth β-strand, with the Asp residues hydrogen-bonded to the substrate’s ammonium group. The Lys at the end of the second β-strand is the (R)-specific acid/base; the Lys at the end of the sixth β-strand is the (S)-specific acid/base.
Two families of NSARs have been identified. The first (NSAR-1 in Figure 2) includes the NSAR from Geobacillus kaustophilus that catalyzes racemization of hydrophobic N-succinylamino acids (10) and the promiscuous NSAR/OSBS from Amycolatopsis (16, 17); the second (NSAR-2 in Figure 2) includes the NSAR from B. cereus that catalyzes racemization of N-succinyl Arg and N-succinyl Lys (11). The specificity determinants differ in the two families (11, 17), but in each the N-succinyl moiety proximal to the end of the eighth β-strand, the Lys at the end of the second β-strand is the (R)-specific acid/base catalyst, and the Lys at the end of the sixth β-strand is the (S)-specific acid/base catalyst. Despite these shared features, the families evolved from different, but homologous, progenitors, providing an example of “pseudoconvergent” evolution of function within the enolase superfamily, i.e., these proteins evolved from different intermediate ancestors derived from the common progenitor of the enolase superfamily (11). This contrasts with “true” convergent evolution of the same function from unrelated progenitors (analogues) (15, 18).
Although the MLE from P. putida is the structural paradigm for the MLE subgroup, its structure-function relationships are uncertain (6). Addition of the distal carboxylate group across the proximal double bond proceeds with a syn-stereochemistry to produce (4S)-muconolactone, with the 5-proR hydrogen derived from solvent (4, 5). A potential Lys acid catalyst is located on either face of the active site at the end of the second or sixth β-strand, but its identity is uncertain because no product-liganded structure has been available.
Herein we report the structure of the (4S)-muconolactone liganded complex of the MLE from P. fluorescens that catalyzes syn-cycloisomerization (in the same branch of the tree and 76% sequence identity with the MLE from P. putida; MLE-1 in Figure 2). We also report functional and structural characterization of the MLE from Mycobacterium smegmatis, a member of a second family of MLEs2 (MLE-2 in Figure 2), that catalyzes anti-cycloisomerization although it also produces (4S)-muconolactone. In both families the Lys at the end of the second β-strand serves as the acid catalyst; the different stereochemical courses result from distinct substrate binding modes. In the syn-family a solvent-derived proton is delivered to the 5-proR position of the product; in the anti-family a solvent-derived proton is delivered to the 5-proS position. The possible pseudoconvergent evolution to yield MLEs with stereochemically distinct mechanisms reinforces our conclusion that evolution of function within the enolase superfamily often occurs via changes in specificity determining residues and not changes in the identities/positions of the acid/base catalysts.
MATERIALS AND METHODS
Genomic Context
All MLE subgroup sequences except those belonging to the dipeptide epimerase, OSBS and NSAR families were used as queries in Microbes Online (http://www.microbesonline.org/) to identify adjacent genes that are likely to be regulated as operons. The relationships among the MLE subgroup proteins and all other proteins identified in Microbes Online were assessed using Cytoscape (19) to determine whether the MLE subgroup proteins share the same operon contexts, which would suggest that they share the same functions,
Phylogenetic Inference
All members of the MLE subgroup were aligned using Muscle v. 3.52 (20), and the alignment was manually adjusted based on a structural alignment (15). Phylogenetic reconstruction was performed using Bayesian and distance methods. Bayesian trees were constructed with MrBayes v3.1.1 (21, 22) under the WAG amino acid substitution model (23) using a gamma distribution to approximate rate variation among sites.
Cloning, Expression and Purification of the MLE 2 from M. smegmatis (GI:118470554)
The gene encoding the MLE-2 from M. smegmatis was PCR amplified from M. smegmatis str. MC2 155 genomic DNA (kindly provided by Prof. William R. Jacobs, Jr., Howard Hughes Medical Institute and Albert Einstein College of Medicine) using platinum Pfx DNA polymerase (Invitrogen). The PCR reaction (50 µL) contained 60 ng of DNA, 5 µL of 10x Pfx amplification buffer, 1mM MgSO4, 0.4 mM dNTP, 20 pmol of each primer, and 1 unit of platinum Pfx polymerase. The gene was amplified and cloned into the pET-17b vector (Novagen). The MLE was expressed in Escherichia coli strain BL21 (DE3). Transformed cells were grown at 37 °C for 48 hours in LB supplemented with 100 µg/mL ampicillin and harvested by centrifugation. The cells were resuspended in low salt buffer (20 mM Tris, pH 8.0, and 5 mM MgCl2) and lysed by sonication. The lysate was clarified by centrifugation, and the supernatant was applied to a DEAE Sepharose Fast Flow (GE Healthcare) column in low salt buffer. The protein was eluted with a linear gradient of NaCl (0.5 – 1 M) in low salt buffer. Fractions containing MLE were identified by SDS-PAGE analysis and applied to a Phenyl Sepharose 6 Fast Flow column in 0.5 M (NH4)2SO4 for further purification. The protein was eluted with a linear gradient of (NH4)2SO4 (0.5 – 0 M), and the purest fractions were pooled and dialyzed against 20 mM Tris, pH 8.0, and 5 mM MgCl2.
Cloning, Expression and Purification of the MLE 1 from Pseudomonas fluorescens (GI:70731221)
The PCR product from Pseudomonas fluorescens genomic DNA (ATCC) was cloned into a TOPO vector (Invitrogen) with a non-cleavable C-terminal His6 tag. The clone was transformed into BL21-RIL cells (CodonPlus). Overnight cultures were grown from glycerol stocks in HY media (Medicilon Inc.) and transferred to 2 L shake flasks containing 1 L of HY media with kanamycin and chloramphenicol. Cells were grown to an OD595 = 1.0 at 37 °C; IPTG was added to induce expression of the MLE. After 18 hours at 22 °C, the cells were harvested by centrifugation; the pellets stored at −80 °C. Cells were lysed by sonication in 20 mM Tris pH 8.0, 0.5 M NaCl, 25 mM imidazole, and 0.1% Tween 20. The protein was purified by sequential nickel-affinity/size-exclusion chromatographies.
Spectrophotometric Assay for MLE Activity
MLE activity was measured at 25 °C by quantitating the decrease in absorbance at 260 nm due to the consumption of cis,cis-muconate (ε260 = 16,900 M−1 cm−1) in 50 mM Tris-HCl, pH 7.5, and 0.1 mM MnCl2.
Polarimetric Assay to Detect (4S)-Muconolactone Configurations
The change of optical rotation due to muconolactone production was followed at Hg 405 nm using a JASCO P-1010 polarimeter at 25 °C until equilibrium was established. The reaction mixture contained 10 mM cis,cis-muconate and 0.1 µM MLE in 50 mM Tris-Cl, pH 7.5, and 0.1 mM MnCl2.
Stereochemical Studies
1H NMR spectra (500 MHz) were recorded in a D2O containing buffer at 37 °C. The reactions contained 10 mM cis,cis-muconate, 50 mM sodium phosphate buffer, pD 6.0, 5 mM MgCl2, and 1 µM MLE.
Crystallization and Data Collection
Five crystal forms (Table 1) were grown by the hanging drop vapor diffusion method at room temperature: 1) MLE from M. smegmatis (MLE-MS) and Mg2+, 2) MLE-MS, Mg2+, and cis,cis-muconate (CCM) in a monoclinic space group, 3) MLE-MS, Mg2+, and CCM in a triclinic space group, 4) MLE from P. fluorescens (MLE-PF) and Mg2+, and 5) MLE-PF, Mg2+, and CCM. Crystallization utilized the following conditions:
For MLE-MS and Mg2+ (MLE-MS·Mg2+), the protein (33.7 mg/mL) was in 20 mM Tris-HCl, pH 8.0, and 5 mM MgCl2; the precipitant was 10% 2-propanol, 0.1 M MES, pH 6.0, and 0.2 M Ca(OAc)2.
For crystallization of MLE-MS, Mg2+, and CCM (MLE-MS·Mg2+·CCM) in the tetragonal space group I422, the protein (33.7 mg/mL) was in 20 mM Tris-HCl, pH 8.0, 5 mM MgCl2, and 20 mM CCM; the precipitant was 1.0 M (NH4)2SO4, 0.1 M HEPES, pH 7.0, and 0.5% PEG 8000.
For crystallization of MLE-MS·Mg2+·CCM in the monoclinic space group C2, the conditions were identical to those used for the previous sample, except that the CCM concentration was 40 mM.
For MLE-PF and Mg2+ (MLE-PF·Mg2+), the protein (5.0 mg/mL) was in 20 mM HEPES, pH 7.5, 150 mM NaCl, 10 mM MgCl2, and 10% glycerol; the precipitant was 20% PEG 1000, 0.1 M cacodylate, pH 6.5, and 0.2 M MgCl2.
For MLE-PF, Mg2+, and CCM (MLE-PF·Mg2+·CCM), the protein (4.6 mg/mL) was in 20 mM HEPES, pH 7.5, 150 mM NaCl, 10 mM MgCl2, 10% glycerol, and 80 mM CCM; the precipitant was 2.0 M (NH4)2SO4, 0.1 M MES, pH 6.0, and 5% isopropanol.
Table 1.
X-ray data collection and refinement statistics for crystals of MLE’s from M. smegmatis and P. fluorescens
| MLE-MS·Mg2+ | MLE-MS·Mg2+·CCM | MLE-MS·Mg2+·CCM | MLE-PF·Mg2+ | MLE-PF·Mg2+·CCM | |
|---|---|---|---|---|---|
| Data Collection | |||||
| Space group | I422 | I422 | C2 | P4212 | I422 |
| Mol. in a.u. | 1 | 1 | 4 | 2 | 1 |
| Unit cell | |||||
| a (Å) | 123.62 | 123.84 | 171.05 | 135.93 | 133.84 |
| b (Å) | 124.26 | ||||
| c (Å) | 117.59 | 117.37 | 117.48 | 82.97 | 103.37 |
| β (°) | 133.4 | ||||
| Resolution (Å) | 25.0–1.6 | 25.0–1.6 | 25.0–2.0 | 25.0–1.8 | 25.0–1.7 |
| Unique | |||||
| Reflections | 56648 | 59180 | 116011 | 71185 | 51329 |
| Completeness (%) | 94.4 | 98.5 | 96.5 | 98.4 | 99.7 |
| Rmerge | 0.087 | 0.098 | 0.076 | 0.081 | 0.074 |
| Average I/σ | 25.9 | 39.1 | 15.2 | 14.7 | 22.7 |
| Refinement | |||||
| Resolution (Å) | 25.0–1.6 | 25.0–1.6 | 25.0–2.0 | 25.0–1.8 | 25.0–1.7 |
| Rcryst | 0.189 | 0.185 | 0.188 | 0.215 | 0.189 |
| Rfree | 0.204 | 0.201 | 0.203 | 0.222 | 0.209 |
| rmsd, bonds (Å) | 0.005 | 0.005 | 0.006 | 0.005 | 0.005 |
| rmsd, angles (°) | 1.2 | 1.3 | 1.3 | 1.2 | 1.2 |
| no. of atoms | |||||
| Protein | 2710 | 2797 | 11188 | 5408 | 2787 |
| Water | 298 | 299 | 789 | 414 | 236 |
| Mg2+ | 1 | 1 | 4 | 2 | 1 |
| Inhibitor | 10 | 40 | 10 | ||
| PDB entry | 3DG3 | 3DG6 | 3DG7 | 3CT2 | 3DGB |
The crystals were transferred to a cryoprotectant solution composed of their mother liquid and 20% glycerol and then flash-cooled in a nitrogen stream. X-Ray diffraction data were collected at the NSLS X4A beamline (Brookhaven National Laboratory) on an ADSC CCD detector. Diffraction intensities were integrated and scaled with programs DENZO and SCALEPACK (24). The data collection statistics are given in Table 1.
Structure Determination and Model Refinement
The structure of MLE-MS·Mg2+ was determined by molecular replacement with the fully automated molecular replacement pipeline BALBES (25), using only diffraction data and the amino acid sequence. Iterative cycles of manual rebuilding with TOM (26), refinement with CNS (27), and automatic rebuilding with ARP (28) were performed. Residues 17–27 in the 20s loop have no density and were excluded from the final refinement model.
For the structure of the tetragonal MLE-MS·Mg2+·CCM complex, the MLE-MS·Mg2+ structure was used as the starting point. Iterative cycles of manual rebuilding with TOM, refinement with CNS, and automatic rebuilding with ARP with subsequent inclusion of water molecules were performed. The structure of the monoclinic MLE-MS·Mg2+·CCM complex was determined and partially refined with BALBES. Subsequent iterative cycles of manual rebuilding with TOM, refinement with CNS, and automatic rebuilding with ARP were performed. Both structures of MLE-MS·Mg2+·CCM contain residues 1–366. The 20s loop, Mg2+ ion, and CCM have well-defined density in the single polypeptide chain in the tetragonal crystal form and in all four copies of the polypeptide chain comprising the asymmetric unit for the monoclinic crystal form.
The structures of MLE-PF·Mg2+ and MLE-PF·Mg2+·CCM were determined and refined by analogous methods. The final atomic model of MLE-PF·Mg2+ includes residues 4–375 with the exception of residues 22–32 in the 20s loop. The final atomic model of MLE-PF·Mg2+·CCM includes residues 4–374; the 20s loop, Mg2+ ion, and CCM are well defined in the structure.
Crystallographic refinement statistics are also provided in Table 1.
Molecular Graphics Images
The images in Figure 1, panel A, and Figure 5, panels A, C, and D, were constructed using Chimera (29).
Figure 5.
Panel A, superposition of the polypeptides of the MLEs from P. fluorescens and M. smegmatis. The backbone of the MLE from P. fluorescens is shown in grey, the α-helices are shown in red, and the β-strands are shown in blue; the backbone of the MLE from M. smegmatis is shown in tan, the α-helices are shown in magenta, and the β-strands are shown in green. The (4S)-muconolactone ligands are represented by the space-filling structures. Panel B: Electron density for the (4S)-muconolactone ligand in: left, the active site of the MLE from P. fluorescens; right, the active site of the MLE from M. smegmatis. Panel C and D: left, side and top views of the active site of the MLE from P. fluorescens; right, side and top views of the active site of the MLE from M. smegmatis. The (4S)-muconolactone ligands are shown in brown.
RESULTS AND DISCUSSION
As described in the Introduction, several subfamilies of monophyletic OSBSs and dipeptide epimerases as well as two phylogenetically distinct families of NSARs can be identified in the MLE subgroup (Figure 2) (9, 15). Also shown in Figure 2 are two families of MLEs: one (MLE-1) that includes the paradigmatic MLE from P. putida and a second (MLE-2) that includes a protein from Arthrobacter sp. BA-5-17 (GI:42627739) that was recently shown to be an MLE by enzymatic assays (30). We extended the MLE functional assignment to other members of the “new” family2 using operon context, i.e., proximal to genes encoding catechol 1,2-dioxygenase, muconolactone δ-isomerase, and/or enol-lactone hydrolase).
Kinetic and Stereochemical Characterization of a “New” MLE (MLE-2)
We isolated a “new” MLE-2 from Mycobacterium smegmatis (GI:118470554, 31% sequence identity with the MLE-1 from P. putida, GI:157838298, and occupying a different branch in the phylogenetic tree in Figure 2); the kinetic constants of the MLEs from M. smegmatis and P. putida for cycloisomerization of cis,cis-muconate are compared in Table 2.
Table 2.
Kinetic constants for MLEs.
| MLE | kcat [s−1] | Km [M] | kcat/Km [M−1 s−1] |
| Mycobacterium smegmatis | 78 ± 6 | (5.6 ± 1) × 10−4 | 1.4 × 105 |
| Pseudomonas putida | 11 ± 0.5 | (5.4 ± 0.5) × 10−4 | 2.0 × 104 |
| Pseudomonas fluorescens | 110 ± 2 | (1.6 ± 0.2) × 10−5 | 7.0 × 106 |
We determined the stereochemical course of the reaction catalyzed by the MLE-2 from M. smegmatis by 1) measuring the optical rotation of the muconolactone product; and 2) performing the reaction in D2O so that the location of the solvent-derived hydrogen in the muconolactone products could be determined by 1H NMR spectroscopy (at neutral pH, the equilibrium favors the muconolactone). Like the MLE-1 from P. putida (4, 5), the MLE-2 from M. smegmatis catalyzes formation of (+)-(4S)-muconolactone (data not shown); however, unlike the paradigmatic MLE, the solvent-derived deuterium is incorporated into the 5-proR hydrogen, not the 5-proS hydrogen (Figure 3). These observations establish that cycloisomerization catalyzed by the MLE-2 proceeds via an anti-stereochemical course, in contrast to the syn-stereochemical course for the MLE-1 from P. putida.
Figure 3.
Stereochemical course of the MLE-catalyzed reaction. The facial relationships of the carboxylate group and active site acid/base catalyst in anti- and syn-cycloisomerization reactions are shown at the top of the Figure. Panel A, 1H NMR spectrum of (4S)-muconolactone obtained in H2O. Panel B, 1H NMR spectrum of (4S)-muconolactone obtained in D2O with the MLE from P. fluorescens. Panel C, 1H NMR spectrum of (4S)-muconolactone obtained in D2O with the MLE from M. smegmatis.
Structures of Product Liganded MLEs
We obtained crystals and determined structures for the anti-MLE from M. smegmatis both in the absence and presence of cis,cis-muconate.
Because crystals of a substrate/product liganded complex of the MLE from P. putida could not be determined, we isolated other MLEs from the syn-family (MLE-1)3 and subjected these to crystallization trials in the absence and presence of cis,cis-muconate. Crystals were obtained and structures determined for the MLE-1 from P. fluorescens both in the absence and presence of cis,cis muconate (GI:70731221, 76% sequence identity with the MLE-1 from P. putida and in the same branch of the tree and 30% sequence identity with the MLE-2 from M. smegmatis). We determined the kinetic constants (Table 2) and stereochemical course (Figure 2C) for the reaction catalyzed by the MLE from P. fluorescens. This enzyme catalyzes syn-cycloisomerization (with incorporation of solvent deuterium in the 5-proS hydrogen) like the paradigmatic MLE from P. putida.
As expected on the basis of sequence similarity (Figure 4), the MLEs from M. smegmatis and P. fluorescens are structurally similar (Figure 5A; root-mean-square deviation for 273 atom pairs = 1.24 Å), with the barrel-domains containing the expected carboxylate ligands for the Mg2+ at the ends of the third (Asp), fourth (Glu), and fifth (Asp) β-strands and Lys residues at the ends of the second and sixth β-strands (Figure 5C/D). In both structures, the (4S)-muconolactone product is sequestered from solvent by the capping domain (electron densities for the (4S)-muconolactone products are shown in Figure 5B). The sequence divergent 20s loops contain the same number of residues whereas the sequence divergent 50s loops differ in length by two residues (Figure 4). Thus, the structures of the active site cavities are not conserved between the two families.
Figure 4.
Sequence alignment of the syn- and anti-MLEs from P. fluorescens and M. smegmatis, respectively. The positions of the 20s and 50s loops in the capping domain and the β-strands in the (β/α)7β-barrel domain are indicated. The conserved residues are highlighted in blue, the ligands for Mg2+ in green, the Lys residues at the end of the second and sixth β-strands in red, and the positions of the hydrogen bond donors to the lactone carbonyl of (4S)-muconolactone in magenta (the structurally homologous residues for these hydrogen bond donors also are highlighted to emphasize the differing identities/positions of the specificity determinants).
In both MLEs, the Lys at the end of the second β-strand is proximal to C5 of the (4S)-muconolactone product (Figure 5C/D). Lys 171 in the syn-MLE is spatially proximal to both the 5-proS hydrogen and the lactone oxygen. Lys 162 in the anti-MLE is spatially proximal to the 5-proR hydrogen and opposite the lactone oxygen derived from the distal carboxylate group. These relationships are those expected from the stereochemical courses and establish that the Lys at the end of the second β-strand serves as the acid/base catalyst in both families. In both structures, the Lys at the end of the sixth β-strand (Lys 275 in the syn MLE and Lys 266 in the anti MLE) is proximal to the carboxylate group of muconolactone product that is coordinated to the Mg2+ and presumably assists in stabilization of the enolate anion intermediate.
The lactone rings (derived from the distal carboxylate group of the substrate) are located in different positions in the active sites of the two MLEs. In the syn-MLE the lactone carbonyl oxygen is hydrogen-bonded to His 24 in the 20s loop that is conserved in the MLE-1 family; in the anti-MLE, the lactone carbonyl oxygen is hydrogen-bonded to Gln 294 in the loop following the seventh β-strand that is conserved in the MLE-2 family. These interactions enforce different rotamers between the Mg2+-coordinated carboxylate group and proximal double bond cis,cis-muconate substrate, although in both enzymes the distal carboxylate group attacks the re-face of the proximal double bond to produce the same (4S)-muconolactone enolate anion intermediate. The conserved Lys at the end of the second β-strand is located on opposite faces of the proximal double bond and delivers a proton to opposite faces of the conserved enolate anion intermediate, producing the observed opposite stereochemical courses for the reactions.
Divergent Evolution of the MLE Function
Despite the differing stereochemical courses, the mechanisms of the syn- and anti-cycloisomerization reactions are expected to be the same: 1) addition of the distal carboxylate to the same face of the proximal double bond to form the enolate intermediate that is stabilized by coordination to the essential Mg2+ as well as proximity to the Lys at the end of the sixth β-strand; and 2) protonation of the intermediate, albeit on opposite faces, by the conserved Lys acid catalyst at the end of the second β-strand as the result of the differing binding geometries for the cis,cis-muconate substrate.
Unlike the 1,1-proton transfer reactions catalyzed by AEEs and NSARs and the substrate structure-enforced syn-dehydration reaction catalyzed by the OSBSs, the cycloisomerization reactions catalyzed by the divergent MLEs occur via distinct stereochemical pathways that involve opposite orientations of the carboxylate nucleophile vis-à-vis the acid catalyst. This difference in catalytic strategy is of no metabolic consequence because the same (4S)-muconolactone product is produced by both families.
“Pseudoconvergent” Evolution of the MLE Function in the Enolase Superfamily?
Presumably, the progenitors for the syn- and anti-MLEs differed in the shapes and polarities of their active sites so that the evolved MLEs would enforce cis,cis-muconate to bind as the distinct rotamers that determine the stereochemical courses of the syn- and anti-cycloisomerization reactions. Phylogenetic and genomic context relationships provide support for the hypothesis that the progenitors for the syn- and anti-MLEs may have had different catalytic functions. Referring to the phylogenetic tree in Figure 2, although the MLEs share the same active site motif (metal ion ligands and Lys acidic/electrophilic catalysts), they are separated by a group of uncharacterized proteins that likely do not catalyze the MLE reaction as suggested by operon/genome context. The genomes that encode this intervening proteins lack homologues of catechol 1,2-dioxygenase and muconolactone δ-isomerase, suggesting that these organisms do not have a β-ketoadipate pathway and, therefore, do not require the MLE function. The functions of these proteins are of interest, e.g., perhaps they catalyze a promiscuous MLE reaction, and are under investigation.
If this hypothesis is correct, the stereochemically distinct families of MLEs would provide the second example of “pseudoconvergent” evolution of function within the superfamily, i.e., the syn- and anti-MLEs evolved from different ancestors within the MLE subgroup. Of course, these intermediate progenitors would have evolved divergently from the common progenitor of the MLE subgroup. We previously identified two evolutionarily distinct families of NSARs, also in the MLE subgroup, one more closely related to the OSBS family (10, 15–17) and the second more closely related to the dipeptide epimerases (11), suggesting that Nature can evolve the same overall reaction from different starting points, perhaps providing a useful lesson for in vitro enzyme engineering (31).
Support for this interpretation is provided by experiments we previously described. In vitro design followed by directed evolution was used to change enzymatic functions in the MLE subgroup by changing specificity-determining residues, i.e., we introduced the OSBS function into the AEE structural scaffold (12–14). Now, with the discovery of stereochemically distinct families of homologous MLEs, we conclude that substitutions for specificity-determining residues is a strategy that has, in fact, been used in the natural evolution of new functions.
Footnotes
This research was supported by NIH R01 GM-60595 (to P.C.B.), NIH P01 GM-71790 (to J.A.G., P.C.B., and S.C.A.), NIH U54 GM-74945 (to S.K.B. and S.C.A.), and NSF-0640476 (to P.C.B.). Molecular graphics images were produced using the UCSF Chimera package from the Resource for Biocomputing, Visualization, and Informatics at the University of California, San Francisco (supported by NIH P41 RR-01081). The X-ray coordinates and structure factors for the MLE from Mycobacterium smegmatis liganded with Mg2+, the MLE from M. smegmatis liganded with Mg2+ and (4S)-muconolactone in two space groups, the MLE from Pseudomonas fluorescens liganded with Mg2+, and the MLE from P. fluorescens liganded with Mg2+ and (4S)-muconolactone have been deposited in the Protein Data Bank (PDB accession codes, 3DG3, 3DG6, 3DG7, 3CT2, and 3DGB respectively).
Abbreviations: AEE, l-Ala-d/l-Glu epimerase; CCM, cis,cis-muconate; LB, Luria-Bertani broth; MLE, muconate lactonizing enzyme; MR, mandelate racemase; NSAR, N-succinylamino acid racemase; OSBS, o-succinylbenzoate synthase; TIM, triose phosphate isomerase.
GI:42627730, Arthrobacter sp. BA-5-17; GI:116670480, Arthrobacter sp. FB24; GI:145221949, Mycobacterium gilvum PYR-GCK; GI:118470554, Mycobacterium smegmatis str. MC2 155; GI:126434001, Mycobacterium sp. JLS; GI:108798346, Mycobacterium sp. KMS; GI:108789346, Mycobacterium sp. MCS; GI:54025491, Nocardia farcinica IFM 10152; GI:119715080, Nocardioides sp. JS614; GI:134100914, Saccharopolyspora erythraea NRRL 2338; GI:25990736, Streptomyces setonii.
GI:23308936, Corynebacterium glutamicum; GI:152970427, Klebsiella pneumoniae MGH 78758; GI:70731221, Pseudomonas fluorescens PF-5.
REFERENCES
- 1.Neidhart DJ, Kenyon GL, Gerlt JA, Petsko GA. Mandelate racemase and muconate lactonizing enzyme are mechanistically distinct and structurally homologous. Nature. 1990;347:692–694. doi: 10.1038/347692a0. [DOI] [PubMed] [Google Scholar]
- 2.Babbitt PC, Hasson MS, Wedekind JE, Palmer DR, Barrett WC, Reed GH, Rayment I, Ringe D, Kenyon GL, Gerlt JA. The enolase superfamily: a general strategy for enzyme-catalyzed abstraction of the alpha-protons of carboxylic acids. Biochemistry. 1996;35:16489–16501. doi: 10.1021/bi9616413. [DOI] [PubMed] [Google Scholar]
- 3.Gerlt JA, Babbitt PC, Rayment I. Divergent evolution in the enolase superfamily: the interplay of mechanism and specificity. Arch Biochem Biophys. 2005;433:59–70. doi: 10.1016/j.abb.2004.07.034. [DOI] [PubMed] [Google Scholar]
- 4.Avigad G, England S. Fed. Proc., Fed. Am. Soc. Exp. Biol. 1984;28:345. Abstr. 486. [Google Scholar]
- 5.Chari RVJ, Whitman CP, Kozarich JW, Ngai K-L, Ornston LN. Absolute stereochemical coure of the 3-carboxylmuconate cycloisomerases from Pseudomonas putida and Acinetobacter calcoaceticus: analysis and implications. J. Am. Chem. Soc. 1987;109:5514–5519. [Google Scholar]
- 6.Hasson MS, Schlichting I, Moulai J, Taylor K, Barrett W, Kenyon GL, Babbitt PC, Gerlt JA, Petsko GA, Ringe D. Evolution of an enzyme active site: the structure of a new crystal form of muconate lactonizing enzyme compared with mandelate racemase and enolase. Proc Natl Acad Sci U S A. 1998;95:10396–10401. doi: 10.1073/pnas.95.18.10396. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Thompson TB, Garrett JB, Taylor EA, Meganathan R, Gerlt JA, Rayment I. Evolution of enzymatic activity in the enolase superfamily: structure of o-succinylbenzoate synthase from Escherichia coli in complex with Mg2+ and o-succinylbenzoate. Biochemistry. 2000;39:10662–10676. doi: 10.1021/bi000855o. [DOI] [PubMed] [Google Scholar]
- 8.Schmidt DM, Hubbard BK, Gerlt JA. Evolution of Enzymatic Activities in the Enolase Superfamily: Functional Assignment of Unknown Proteins in Bacillus subtilis and Escherichia coli as L-Ala-D/L-Glu Epimerases. Biochemistry. 2001;40:15707–15715. doi: 10.1021/bi011640x. [DOI] [PubMed] [Google Scholar]
- 9.Kalyanaraman C, Imker HJ, Fedorov AA, Fedorov EV, Glasner ME, Babbitt PC, Almo SC, Gerlt JA, Jacobson MP. Discovery of a dipeptide epimerase enzymatic function guided by homology modeling and virtual screening. Structure. 2008;16:1668–1677. doi: 10.1016/j.str.2008.08.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Sakai A, Xiang DF, Xu C, Song L, Yew WS, Raushel FM, Gerlt JA. Evolution of enzymatic activities in the enolase superfamily: N-succinylamino acid racemase and a new pathway for the irreversible conversion of D- to L-amino acids. Biochemistry. 2006;45:4455–4462. doi: 10.1021/bi060230b. [DOI] [PubMed] [Google Scholar]
- 11.Song L, Kalyanaraman C, Fedorov AA, Fedorov EV, Glasner ME, Brown S, Imker HJ, Babbitt PC, Almo SC, Jacobson MP, Gerlt JA. Prediction and assignment of function for a divergent N-succinyl amino acid racemase. Nat Chem Biol. 2007;3:486–491. doi: 10.1038/nchembio.2007.11. [DOI] [PubMed] [Google Scholar]
- 12.Schmidt DM, Mundorff EC, Dojka M, Bermudez E, Ness JE, Govindarajan S, Babbitt PC, Minshull J, Gerlt JA. Evolutionary potential of (beta/alpha)8-barrels: functional promiscuity produced by single substitutions in the enolase superfamily. Biochemistry. 2003;42:8387–8393. doi: 10.1021/bi034769a. [DOI] [PubMed] [Google Scholar]
- 13.Vick JE, Schmidt DM, Gerlt JA. Evolutionary Potential of (beta/alpha)(8)-Barrels: In Vitro Enhancement of a "New" Reaction in the Enolase Superfamily. Biochemistry. 2005;44:11722–11729. doi: 10.1021/bi050963g. [DOI] [PubMed] [Google Scholar]
- 14.Vick JE, Gerlt JA. Evolutionary potential of (beta/alpha)8-barrels: stepwise evolution of a "new" reaction in the enolase superfamily. Biochemistry. 2007;46:14589–14597. doi: 10.1021/bi7019063. [DOI] [PubMed] [Google Scholar]
- 15.Glasner ME, Fayazmanesh N, Chiang RA, Sakai A, Jacobson MP, Gerlt JA, Babbitt PC. Evolution of structure and function in the o-succinylbenzoate synthase/N-acylamino acid racemase family of the enolase superfamily. J Mol Biol. 2006;360:228–250. doi: 10.1016/j.jmb.2006.04.055. [DOI] [PubMed] [Google Scholar]
- 16.Taylor Ringia EA, Garrett JB, Thoden JB, Holden HM, Rayment I, Gerlt JA. Evolution of enzymatic activity in the enolase superfamily: functional studies of the promiscuous o-succinylbenzoate synthase from Amycolatopsis. Biochemistry. 2004;43:224–229. doi: 10.1021/bi035815+. [DOI] [PubMed] [Google Scholar]
- 17.Thoden JB, Taylor Ringia EA, Garrett JB, Gerlt JA, Holden HM, Rayment I. Evolution of Enzymatic Activity in the Enolase Superfamily: Structural Studies of the Promiscuous o-Succinylbenzoate Synthase from Amycolatopsis(,) Biochemistry. 2004;43:5716–5727. doi: 10.1021/bi0497897. [DOI] [PubMed] [Google Scholar]
- 18.Galperin MY, Walker DR, Koonin EV. Analogous enzymes: independent inventions in enzyme evolution. Genome Res. 1998;8:779–790. doi: 10.1101/gr.8.8.779. [DOI] [PubMed] [Google Scholar]
- 19.Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ronquist F, Huelsenbeck JP. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003;19:1572–1574. doi: 10.1093/bioinformatics/btg180. [DOI] [PubMed] [Google Scholar]
- 22.Altekar G, Dwarkadas S, Huelsenbeck JP, Ronquist F. Parallel Metropolis coupled Markov chain Monte Carlo for Bayesian phylogenetic inference. Bioinformatics. 2004;20:407–415. doi: 10.1093/bioinformatics/btg427. [DOI] [PubMed] [Google Scholar]
- 23.Whelan S, Goldman N. A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol. 2001;18:691–699. doi: 10.1093/oxfordjournals.molbev.a003851. [DOI] [PubMed] [Google Scholar]
- 24.Otwinowski Z, Minor W. Processing of X-ray diffraction data collected in oscillation mode. In: Carter CWJ, Sweet RM, Abelson JN, Simon MI, editors. Methods in Enzymology. New York: Academic Pres; 1997. pp. 307–326. [DOI] [PubMed] [Google Scholar]
- 25.Long F, Vagin AA, Young P, Murshudov GN. BALBES: a molecular-replacement pipeline. Acta Crystallogr D Biol Crystallogr. 2008;64:125–132. doi: 10.1107/S0907444907050172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Jones AT. Interactive computer graphics: FRODO. Methods Enzymol. 1985;115:157–171. doi: 10.1016/0076-6879(85)15014-7. [DOI] [PubMed] [Google Scholar]
- 27.Brunger AT, Adams PD, Clore GM, DeLAno WL, Gros P, Grosse-Kunstleve RW, Jiang JS, Kuszewski J, Nilges M, Pannu NS, Read RJ, Rice LM, Simonson T, Warren GL. Crystallography & NMR system: A new software suite for macromolecular structure determination. Acta Crystallogr. 1998;D54:905–921. doi: 10.1107/s0907444998003254. [DOI] [PubMed] [Google Scholar]
- 28.Lamzin VS, Wilson KS. Automated refinement of protein models. Acta Crystallogr D Biol Crystallogr. 1993;49:129–147. doi: 10.1107/S0907444992008886. [DOI] [PubMed] [Google Scholar]
- 29.Pettersen EF, Goddard TD, Huang CC, Couch GS, Greenblatt DM, Meng EC, Ferrin TE. UCSF Chimera--a visualization system for exploratory research and analysis. J Comput Chem. 2004;25:1605–1612. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
- 30.Murakami S, Kohsaka C, Okuno T, Takenaka S, Aoki K. Purification, characterization, and gene cloning of cis, cis-muconate cycloisomerase from benzamide-assimilating Arthrobacter sp. BA-5–17. FEMS Microbiol Lett. 2004;231:119–124. doi: 10.1016/S0378-1097(03)00933-9. [DOI] [PubMed] [Google Scholar]
- 31.Glasner ME, Gerlt JA, Babbitt PC. Evolution of enzyme superfamilies. Curr Opin Chem Biol. 2006;10:492–497. doi: 10.1016/j.cbpa.2006.08.012. [DOI] [PubMed] [Google Scholar]





