Abstract
The non-proteinogenic amino acid enduracididine is a critical component of the mannopeptimycins, cyclic glycopeptide antibiotics with activity against drug-resistant pathogens including methicillin-resistant Staphylococcus aureus. Enduracididine is produced in Streptomyces hygroscopicus by three enzymes, MppP, MppQ, and MppR. Based on primary sequence analysis, MppP and Q are pyridoxal-5'-phosphate-dependent aminotransferases; MppR shares low, but significant, sequence identity with acetoacetate decarboxylase. The exact reactions catalyzed by each enzyme, and the intermediates involved in the route to enduracididine are currently unknown. Herein we present biochemical and structural characterization of MppR that demonstrates a catalytic activity for this enzyme and provides clues about its role in enduracididine biosynthesis. Bioinformatic analysis shows that MppR belongs to a previously uncharacterized family within the acetoacetate decarboxylase-like superfamily (ADCSF) and suggests that MppR-like enzymes may catalyze reactions diverging from the well-characterized, prototypical ADCSF decarboxylase activity. MppR shares a high degree of structural similarity with acetoacetate decarboxylase, though the respective quaternary structures differ markedly and structural differences in the active site explain the observed loss of decarboxylase activity. The crystal structure of MppR in the presence of a mixture of pyruvate and 4-imidazolecarboxaldehyde shows that MppR catalyzes the aldol condensation of these compounds and subsequent dehydration. Surprisingly, the structure of MppR in the presence of a mixture of "4-hydroxy-2-ketoarginine" and "2-ketoenduracididine" shows only the correct 4R-enantiomer of "2-ketoenduracididine" bound to the enzyme. These data, together with bioinformatic analysis of MppR homologs, identifies a novel family within the acetoacetate decarboxylase-like superfamily with divergent active site structure and, consequently, biochemical function.
Keywords: Nonribosomal peptide antibiotics, mannopeptimycin, acetoacetate decarboxylase, pyruvate aldolase
Bacterial strains resistant to multiple classes of antibiotics, including last-line drugs like vancomycin, are a serious threat to public health (1–3). Meeting this challenge requires the development of new antibiotics, preferably with modes of action that are distinct from those of current antimicrobial compounds. The mannopeptimycins are a group of five related cyclic glycopeptide antibiotics produced by Streptomyces hygroscopicus NRRL 30439 that are active against an array of Gram-positive pathogens (4–6). Of particular importance is the activity of the mannopeptimycins against methicillin-resistant Staphylococcus aureus (MRSA) and vancomycin-resistant enterococci (VRE). Mannopeptimycins exert their antimicrobial effects by interfering with cell wall biosynthesis, though in a manner distinct from that of glycopeptide antibiotics like vancomycin (7). The activity and novel mode of action of these compounds have led to significant interest in developing this scaffold into a clinically-viable antibiotic (8–11).
Drug development efforts based on the mannopeptimycin scaffold will benefit from a ready supply of enduracididine (End). Furthermore, this unusual building block may be incorporated into semisynthetic non-ribosomal peptide derivatives to give products with altered biological activity. While a synthetic route to β-hydroxyenduracididine (βhEnd) has been developed (12), an enzymatic or chemo-enzymatic approach to End production may ultimately be more facile, economical, and environmentally friendly.
The mannopeptimycins (e.g. mannopeptimycin α, Figure 1, 1) consist of a cyclic hexapeptide core that is glycosylated with mannose residues at two positions (13). A distinguishing feature of these compounds is the presence of βhEnd (Figure 1, 2), which is derived from hydroxylation of End (Figure 1, 3) (14). However, the biosynthetic origin of End (and thus βhEnd) is currently unknown. The presence of a guanidinium group, together with evidence from feeding experiments using radio-labeled amino acids (15), suggest that End derives from arginine. Furthermore, comparison of the mannopeptimycin biosynthetic cluster with that of the unrelated lipodepsipeptide enduracidin, which also contains multiple End residues, shows that there are only three pairs of proteins with high sequence identity: MppP/EndP (80%), MppQ/EndQ (68%), and MppR/EndR (75%) (5, 16). Finally, the enzyme MppO, which has no homolog in the enduracidin biosynthetic cluster, has been demonstrated to be a non-heme iron, α-ketoglutarate-dependent oxygenase that selectively hydroxylates the β-carbon of free End (14). Thus, the gene products MppP, MppQ, and MppR are thought to act through an unknown series of steps to transform arginine to enduracididine.
Figure 1.
Chemical structures of mannopeptimycin β (1), β-hydroxy-enduracididine (2), L-enduracididine (3), L-capreomycidine (4), guanidinoacetaldehyde (5), imidazole 4-carboxaldehyde (6), 2-oxo-3-(imidazol-5-yl)pyruvate (7), and 3-[(4R)-2-iminoimidazolidin-4-yl]-2-oxopropanoic acid (8).
Analysis of the MppP, MppQ and MppR amino acid sequences reveals that MppP and MppQ are likely both pyridoxal-5'-phosphate (PLP)-dependent aminotransferases and that MppR has some similarity to members of the acetoacetate decarboxylase-like superfamily (ADCSF; sequence identity: 17–25%). The non-proteinogenic amino acid capreomycidine (Figure 1, 4), which is analogous to End in that it is a cyclized form of arginine, is produced by two enzymes: a non-heme iron oxygenase (e.g. VioC) and a PLP-dependent aminotransferase (e.g. VioD) (17). VioC hydroxylates L-Arg at the β-carbon to give (2S,3S)-β-hydroxyarginine (18). VioD uses a PLP cofactor to catalyze an intramolecular β-elimination/replacement reaction to give 2S,3R-capreomycidine (19). Given that the MppPQR system lacks an oxygenase and includes two apparent aminotransferases, it is clear that End and βhEnd must be produced by a different chemical logic than capreomycidine. If arginine is the biosynthetic precursor of End, how the oxidation takes place is an open question. As MppO does not oxidize arginine (14), this question remains open for both the mannopeptimycin and enduracidin biosynthetic clusters.
The exact nature of this chemical logic, however, is not immediately obvious. The most glaring question concerns the role of an enzyme belonging to the ADCSF. The only characterized members of the ADCSF are confirmed acetoacetate decarboxylases (ADCs)—what we term the "classical" ADCs (20). The unique feature of these enzymes is the presence in the active site of a lysine side chain with a pKa of 6.5—a shift of ~4 pH units from the solution value (20, 21). The structure of Chromobacterium violaceum ADC (CvADC) indicates that the markedly hydrophobic environment of the active site is responsible for this dramatic pKa perturbation (22). The catalytic mechanism involves the formation of a Schiff base between the nucleophilic lysine amino group and the β-carbon of acetoacetate (20). Decarboxylation is encouraged by the juxtaposition of two negatively-charged glutamate side chains and facilitated by the Schiff base (22). The imine is then hydrolyzed to release acetone and regenerate the active form of the enzyme. With no clear rationale for the involvement of a decarboxylase in End biosynthesis, it appears likely that MppR has some other catalytic function. Consistent with this, a preliminary comparative genomics analysis of the ADC fold specifically supports a functional shift from decarboxylase activity for MppR and its closest homologs, while suggesting that the ADC fold represents a scaffold with diverse substrate specificities and biochemical functions. In order to gain a better understanding of the role of MppR in End biosynthesis and its relationship to acetoacetate decarboxylase enzymes, the structures of MppR with the buffer HEPES bound, as well as the covalent complexes with pyruvate, 2-oxo-3-(imidazol-5-yl)pyruvate (Figure 1, 7), and 3-[(4R)-2-iminoimidazolidin-4-yl]-2-oxopropanoic acid (Figure 1, 8; i.e. "2-ketoenduracididine") have been determined. The structures and preliminary biochemical characterization suggest a possible function of MppR in End biosynthesis.
Materials and Methods
Cloning, Expression, and Purification of MppR
The coding sequence of MppR was optimized for expression in E. coli and synthesized by GenScript Inc (Piscataway, NJ). This synthetic gene was sub-cloned into the pE-SUMOkan expression vector (LifeSensors Inc, Malvern, PA) using primers containing Bsa I and Xba I restriction sites (forward: 5'-GGTCTCAAGGTATGGAAAACCTGTACTTTCAGG-3'; reverse: 5'-GCTCTAGATCATTAGCTACGCACACCGATTT-3'). The His6-tagged SUMO-MppR fusion protein was expressed from E. coli BL21 Star (DE3) cells (Invitrogen Inc, Carlsbad, CA) carrying the pE-SUMO-MppR plasmid. Cultures were grown in 2 L of Luria-Bertani medium (50 µg/mL kanamycin) at 37 °C to an OD600 of 0.8–1.0, at which point protein expression was induced with 0.4 mM IPTG. The temperature was reduced to 25 °C and the cultures were grown overnight. Cells were harvested by centrifugation, resuspended in 5 mL/g of buffer A (25 mM TRIS pH 8.0, 300 mM NaCl, 10 mM imidazole) supplemented with 1 mg/mL hen egg lysozyme, and stored overnight at −20 °C. The cell suspension was thawed at RT for 2 hours and then treated with 0.1 mg/mL DNAse I (Worthington Biochemical Corp., Lakewood, NJ). The lysate was clarified by centrifugation at 39,000 × g for 45 minutes and then applied to a 5 mL HisTrap column (GE Lifesciences, Piscataway, NJ) at a flow rate of 5 mL/min to isolate the His6-SUMO-MppR fusion protein. The protein was eluted by a 4-step gradient of buffer B (25 mM TRIS pH 8.0, 300 mM NaCl, 250 mM imidazole). The His6-SUMO-MppR fusion protein eluted in the third and fourth steps and was ~90 % pure. Peak fractions were pooled and dialyzed overnight against 3.5 L of 25 mM TRIS pH 8.0, 150 mM NaCl in the presence of ~2 µM SUMO protease (LifeSensors Inc). The dialysate was passed through the HisTrap column a second time to remove the cleaved His6-SUMO tag as well as the protease. The resulting MppR preparation was > 95% pure, as judged on coomassie-stained SDS-PAGE gels. Selenomethionine-labed MppR was purified using the same protocol, except that SelenoMethionine Medium Complete (Molecular Dimensions, Newmarket, Suffolk, UK) was used as the growth medium rather than LB and T7 Express Crystal cells (New England Biolabs, Ipswitch, MA) were used in place of the BL21 Star (DE3) cells.
Activity Tests with Arginine and Hydroxylated Derivatives
MppR (10 µM) was incubated with 1 mM L-Arg, (3S)-hydroxy-L-Arg, (3R)-hydroxy-L-Arg, (4R,S)-hydroxy-L-Arg, or 2-oxo-5-guanidinovaleric acid for 6 hours at 25 °C. The protein was removed by ultra-filtration using Amicon centrifugal filter devices. Prior to HPLC analysis, 50 µL aliquots of the reactions containing amino acid substrates were mixed with 50 µL of 80 mM LiCO3−, 70 µL of acetonitrile (MeCN) and 30 µL of 5 mM dansyl chloride (DNS-Cl) in MeCN. After incubating for 30 minutes at 50 °C, 40 µL of 2 % ethylamine were added to consume unreacted DNS-Cl. The samples were incubated for 10 minutes at 25 °C. The reaction containing 2-oxo-5-guanidinovaleric acid was derivatized with OPD by mixing 100 µL of the reaction mixture with 100 µL of water and 100 µL of 100 mM OPD in 2 N HCl. The samples were incubated at 80 °C for 20 minutes, then cooled on ice for 5 minutes. All derivatized samples were centrifuged at 30,000 rcf for 10 minutes prior to HPLC analysis. Aliquots of the supernatants (5–10 µL) were analyzed on an Agilent 1220 Infinity HPLC with a 4.6 × 150 mm Aquasil C18 column (Thermo Scientific). The separation was done at 50 °C using a linear gradient from 2 to 50 % MeCN in 0.1 % TFA/water over 14 minutes at a flow rate of 0.5 mL/min.
Size Exclusion Chromatography
The solution molecular weight of MppR was estimated by gel filtration chromatography using a Superdex S-200 10/300 column (GE Lifesciences) and a mixture of molecular weight standards including rabbit muscle aldolase (156.8 kDa), Saccharomyces cerevisiae alkaline phosphatase (126.0 kDa), and Streptomyces rubiginosus xylose isomerase (86.4 kDa). To define the standard curve, each protein was dissolved at 10 mg/mL in 50 mM NaPO4, 150 mM NaCl, pH 7.0. The mobile phase consisted of 50 mM NaPO4, 150 mM NaCl, pH 7.0 and the separation was performed on 250 µL of the standard mixture at a flow rate of 0.5 mL/min and 4 °C. A sample of MppR (12 mg/mL) was run separately over the same column, and under identical conditions.
Labeling with 5-Nitrosalicylaldehyde (2-Hydroxy-5-nitrobenzaldehyde)
MppR (380 µM) was incubated with 5 mM 5-nitrosalicylaldehyde (5-NSA) for 15 minutes at 25 °C. NaBH4 was added to a final concentration of 10 mM to reduce the aldimine and trap the p-nitrophenol reporter group on the enzyme. Presence of the reporter was verified by UV spectroscopy on a sample of labeled protein after thorough buffer exchange to remove unreacted 5-NSA.
Preparation of 2-oxo-acids
The 2-oxo acids used in this study were prepared by a modification of the procedure first described by Meister (23). For example, 2-oxo-5-guanidinovaleric acid was prepared by dissolving 2 g of L-Arg in 50 mL of water and the pH was adjusted to 7.2 with concentrated HCl. Next, 100 mg of Crotalus atrox L-amino acid oxidase (LAAO) and 1.5 mg of catalase were dissolved in 10 mL of water and added to the amino acid solution. The volume was adjusted to 100 mL with water and the reaction was incubated in the dark at RT with vigorous stirring. After 18 hours, the 100 mL reaction was concentrated to 7 mL at 50 °C under vacuum. The 2-oxo acid crystallized from the concentrated reaction mixture at 4°C after approximately one week. The crystalline material was harvested, dried, and stored at −20°C. The sample of 2-oxo-4R/S-hydroxy-5-guanidinovaleric acid (i.e. "4-hydroxy-2-ketoarginine") was prepared similarly, starting from 4R/S-hydroxy-L-Arg. Both compounds were characterized by NMR and found to be predominantly the desired product (see Supplementary Information).
Crystallization, Structure Determination, and Model Refinement
Initial crystallization conditions were identified by screening 14 mg/mL MppR against the Index HT screen (Hampton Research). After optimizing the most promising hits, diffraction-quality crystals were obtained by the hanging drop vapor diffusion method from 25–30 % polyethylene glycol (PEG) 3,350, 0.2 M (NH4)2SO4, and 1–10 mM HEPES pH 7.5. Drops contained 2 µl of protein solution at 12–18 mg/mL (~380–750 µM) and 1 µl of crystallization solution. Crystals appeared after 3–4 days and grew to maximum dimensions of ~300 × 300 × 100 µm. Crystals of selenomethionine (SeMet)-substituted MppR were grown using the same conditions, but were significantly smaller (~100 × 100 × 20 µm). Crystals were harvested from the hanging drops, coated with LV Cryo Oil (MiTeGen, LLC) and flashed-cooled by plunging in liquid nitrogen. Structures of MppR with compounds 7 and 8 (Figure 1) bound were obtained by transferring crystals of native MppR into 30 µL drops of soaking solution containing 36 % PEG 3,350, 0.18 M (NH4)2SO4, and either of 30 mM pyruvate/50 mM 6, or < 300 mM 8. After soaking for 2–24 h, crystals were coated with LV Cryo Oil and flash-cooled. X-Ray diffraction data for SeMet MppR and MppR·HEPES were collected at beamline 21-ID-D of the Life Science Collaborative Access Team (LS-CAT) at the Advanced Photon Source (APS). The MppR·pyruvate, MppR·7, and MppR·8 data sets were collected at LS-CAT beamline 21-ID-G. All crystals were screened for diffraction quality using the rotating anode X-ray source at Marquette University. Data were processed with HKL2000 (24) or MOSFLM (25, 26) and SCALA (27) of the CCP4 Program Suite (28).
The structure of MppR was determined by the single-wavelength anomalous diffraction (SAD) method using 2.2 Å-resolution data collected from a crystal of SeMet-substituted MppR at 0.97625Å, 41 eV above the tabulated K-edge wavelength for Se (0.97950Å). The program autoSHARP (29) was used to solve the Se substructure, which contained 12 of the 14 Se atoms in the asymmetric unit, and calculate high-quality, density-modified phases. An initial model comprising ~85 % of the asymmetric unit contents was built automatically using the PHENIX package (phenix.autobuild, (30, 31)). After iterative cycles of manual model building in COOT (32) and maximum likelihood-based refinement using the PHENIX package (phenix.refine, (33)), ordered solvent molecules were added automatically in phenix.refine and culled manually in COOT. Hydrogen atoms were added to the model using phenix.reduce (34) and were included in the later stages of refinement to improve the stereochemistry of the model. Positions of H atoms were refined using the riding model with a global B-factor. For all models, except the original SeMet MppR model, regions for translation-libration-screw (TLS) refinement were identified using the TLSMD server (35) or phenix.find_tls_groups and the TLS parameters were refined in phenix.refine. Once the refinement converged (e.g. SeMet MppR: R=0.191, Rfree=0.212) the model was validated using the tools implemented in COOT and PHENIX (36, 37). Sections of the backbone with missing or uninterpretable electron density were not included in the final model. Side chains with poor or missing electron density were modeled in favored rotameric conformations; the B-factors were allowed to refine without additional restraints and the occupancies were held to be 1.0.
The final, refined model of SeMet MppR was stripped of water molecules, H atoms, and B-factor information (all B-factors set to 20.00) and used to determine the structures of MppR with HEPES, pyruvate, 7, or 8 bound by the difference Fourier method or by molecular replacement in PHASER (38). A similar refinement protocol was used for all of the models presented here. Restraints and coordinates for the modified Lys156 residue in the liganded structures were generated with phenix.elbow (39) and added to the model in COOT. Data collection and model refinement statistics for all structures are listed in Table 1. Coordinates and structure factors have been deposited in the Protein Data Bank (www.rcsb.org) with accession codes 4JM3, 4JMC, 4JMD, and 4JME.
Table 1.
Crystallographic data collection and refinement statistics.
| SeMet MppR | MppR·HEPES | MppR·Pyr | MppR·7 | MppR·8 | |
|---|---|---|---|---|---|
| Resolution (Å) | 34.33–2.20 | 34.26–1.85 | 34.30–1.72 | 41.87 – 1.67 | 33.28 – 1.70 |
| (last shell)a | (2.28–2.20) | (1.95–1.85) | (1.81–1.72) | (1.73 – 1.67) | (1.73 – 1.70) |
| Wavelength (Å) | 0.97625 | 0.97950 | 0.97856 | 0.97856 | 0.97856 |
| No. of reflections | |||||
| Observed | 321309 (21370) | 323463 (46672) | 606665 (87494) | 645358 (43289) | 373589 (17118) |
| Unique | 31470 (3019) | 52362 (7559) | 65173 (9382) | 71344 (6515) | 67228 (3321) |
| Completeness (%)a | 99.7 (97.4) | 100.0 (100.0) | 100.0 (100.0) | 99.1 (91.8) | 99.9 (100.0) |
| Rmerge (%)a,b | 0.092 (0.401) | 0.109 (0.320) | 0.063 (0.468) | 0.062 (0.475) | 0.048 (0.420) |
| Multiplicity | 10.2 (7.1) | 6.2 (6.2) | 9.3 (9.3) | 9.0 (6.6) | 5.6 (5.2) |
| <I/σ(I)>a | 24.2 (4.7) | 9.4 (4.2) | 16.7 (4.1) | 26.3 (3.5) | 39.2 (3.8) |
| Figure-of-merit | 0.348 | ||||
| Phasing power | 1.111 | ||||
| Model Refinement Statistics | |||||
| Reflections in work set | 31436 | 51432 | 65133 | 67732 | 63304 |
| Reflections in test set | 1559 | 1958 | 3298 | 3573 | 3318 |
| Rcryst (Rfree) | 0.191 (0.212) | 0.145 (0.176) | 0.157 (0.179) | 0.156 (0.177) | 0.150 (0.173) |
| No. of non-H atoms | 4493 | 4565 | 4514 | 4768 | 4544 |
| No. of solvent atoms | 460 | 555 | 504 | 629 | 597 |
| Number of TLS groups | NA | 24 | 13 | 17 | 18 |
| Average B-factor (Å2) c | |||||
| Protein atoms | 26.7 | 22.2 | 26.0 | 25.3 | 22.9 |
| Ligand atoms | 28.3 | 16.4 | 26.9 | 29.5 | 23.9 |
| Solvent | 35.9 | 29.3 | 31.3 | 36.8 | 33.8 |
| RMS deviations | |||||
| Bond lengths (Å) | 0.022 | 0.010 | 0.009 | 0.010 | 0.010 |
| Bond angles (°) | 1.886 | 1.267 | 1.258 | 1.307 | 1.297 |
| Coordinate error (Å) | 0.30 | 0.15 | 0.16 | 0.22 | 0.14 |
Values in parentheses apply to the high-resolution shell indicated in the resolution row.
R = Σ(||Fobs|-scale*|Fcalc||) / Σ |Fobs|.
Isotropic equivalent B factors, including contribution from TLS refinement.
Results and Discussion
Biochemical Characterization
Recombinant S. hygroscopicus MppR was expressed in E. coli as a His6-SUMO fusion protein and purified using conventional chromatographic techniques. MppR expressed at high level in E. coli, yielding 20–30 mg of > 95% pure protein per liter of culture. The molecular weight was estimated to be 131.5 kDa by size exclusion chromatography, consistent with a tetramer of 32.2 kDa protomers (128.8 kDa). ADC, in contrast, is dodecameric with a solution molecular weight of ~330 kDa (22, 40).
Given the sequence identity between ADC and MppR, the first experiments undertaken were aimed at determining whether or not these two proteins share the same catalytic activity. The catalytic Lys115 of ADC is known to react with 5-nitro-salicylaldehyde (5-NSA) to give a Schiff base that can be trapped by reduction of the imine with NaBH4 to give 2-hydroxy-5-nitrobenzyl-ADC (41, 42). Treatment of MppR with 5-NSA and NaBH4 at pH 6.5 results in stable labeling of the enzyme with the p-nitrophenol reporter group as verified by UV spectroscopy (Figure S1). After removing unreacted 5-NSA by buffer exchange, the sample retained an intense yellow color and a strong UV absorbance peak at 405 nm. This result indicates that MppR, like ADC, contains a reactive lysine side chain in the active site that is deprotonated to a significant extent at pH 6.5.
MppR was assayed for acetoacetate decarboxylase activity by monitoring the disappearance of the enolate form of acetoacetate as described (20). There was no change in absorbance at 270 nm after 60 min, indicating that acetoacetate is not a substrate for MppR. Since enduracididine is hypothesized to originate from L-Arg, MppR was tested for activity against L-Arg, (3S)-hydroxy-L-Arg, (3R)-hydroxy-L-Arg, (4R,S)-hydroxy-L-Arg, and 2-oxo-5-guanidinovaleric acid (i.e. α-keto-Arg). After six hours at 25 °C, reaction mixtures were derivatized with dansyl chloride (DNS) or o-phenylenediamine (OPD) and analyzed by HPLC. Comparison of the reactions with authentic standards and controls lacking MppR indicates that none of the compounds tested are substrates for MppR.
Overall Structure of MppR
MppR crystallized from a solution containing 25–30 % polyethylene glycol 3,350, 0.2–0.3 M (NH4)2SO4, and 1–100 mM HEPES, pH 7.5 in space group P3121 (unit cell dimensions a = b = 109.8 Å, c = 87.8 Å) with two molecules in the asymmetric unit. Experimental phases were obtained to 2.2 Å using single-wavelength anomalous diffraction from crystals of the selenomethionine-substituted enzyme (Table 1). The resulting model was used to phase a higher-resolution (1.9 Å) data set by molecular replacement. The final model of MppR·HEPES contains 257 (monomer A) or 259 (monomer B) of 302 total amino acids, 589 water molecules, and 2 HEPES molecules. At the N-terminus, 31–33 residues are disordered, as well as 7 at the C-terminus and an additional 5 from a surface loop (residues 226–231). There are no Ramachandran outliers in the model, and 99.2% of the residues are in the favored regions of the Ramachandran plot. The protein chains from the SeMet MppR model were used to obtain initial phases for the structures of MppR described below.
The overall fold of MppR is similar to the β-cone (22) or double-barrel (43) fold observed in acetoacetate decarboxylases. The structure contains a high proportion of β-strand (45%) and relatively little α-helix (11%). The 13 β-strands are organized into one discontinuous, antiparallel β-sheet with a compound fold that forms a large barrel (light blue in Figure 2A) and a second, smaller barrel-like feature othogonal to the large barrel (green in Figure 2A). The large barrel is closed at one end by the longest α-helix in the structure (α3, residues 142–152). At the opposite end, residues 205 through 214 form a loop with a short, 4-residue stretch of α-helix (α4). We dub this part of the structure the "entrance loop." Given its position at the open end of the large barrel, and the predominance of coil structure in this region, it is plausible that this part of the structure may control access to the active site. While the B-factors of residues 205–214 are comparable to the average B-factor of the model, the observations that (1) the opening to the barrel is too small (ca. 6.5 × 4.5 Å) to allow HEPES (ca. 6.5 × 5.0 Å) or a molecule of similar size into the barrel and (2) the residues in the entrance loop make indirect hydrogen-bonding interactions and/or direct packing interactions with bound ligands, suggest that this loop might be mobile in the absence of ligand. This may explain, at least in part, why MppR failed to crystallize in the absence of a bound ligand (e.g. HEPES). Finally, MppR possesses two pairs of protuberances (yellow and orange in Figure 2A) that are involved in oligomerization (see below).
Figure 2.
Ribbon representation of the MppR protomer (A), shown in three orthogonal orientations to highlight the dimerization and tetramerization loops (yellow and orange, respectively) and the “double-barrel” fold. The “entrance loop” (red) may play a role in gating access to the active site, which is identified by the position of Lys156 (shown as sticks with magenta carbon atoms). In the absence of α-keto acids, the enzyme crystallizes with a molecule of HEPES buffer bound. The stereoview (B) shows the 2|Fo|-|Fc| (magenta) and 2|Fo|-|Fc| simulated annealing composite omit (green) electron density maps, both contoured at 1.0 σ. The schematic (C) shows the interactions between HEPES and MppR, and the associated distances. This figure and subsequent figures showing crystallographic structures were rendered with the POVSCRIPT+ (54) modification of MOLSCRIPT (55) and POVRAY.
The MppR Active Site
The interior of the large barrel defines a flattened, elliptical pocket with a narrow opening exposed to solvent and, at the opposite end of the barrel, an anion-binding site and the lysine residue (Lys156) that aligns with the catalytic lysine side chain of ADC (Figure 2B). The environment of the presumed catalytic Lys156 is overwhelmingly hydrophobic. The nearest protein atoms to the lysine amino group are Met154 Sδ (4.1 Å), Val140 Cγ (4.2 Å), Phe116 Cε (4.5 Å), Gly149 Cα (4.6Å), Phe145 Cβ (5.1 Å), Ala138 Cβ (5.7 Å), Trp98 Cδ (6.5 Å), Leu120 Cδ (6.5 Å), Phe58 Cε (6.8 Å), and Arg148 Nε/Nη (6.9/7.0 Å). In this environment, the only potential hydrogen-bonding partners are Glu118 Oε (2.7 Å), Pro145 O (4.1 Å), and an ordered solvent molecule (3.0Å). The relative orientations of Glu118 and the solvent molecule suggest that only one would be able to form a hydrogen bond with Lys156 at any given time. Also, formation of the hydrogen bond with the carbonyl oxygen of Pro145 would require a reorientation of the lysine side chain that would likely preclude hydrogen-bonding interactions with Glu118 or the solvent molecule. As in acetoacetate decarboxylase, the hydrophobic active site environment and dearth of potential hydrogen-bonding interactions are likely responsible for the apparent pKa perturbation observed for Lys156. Given its propensity for binding negatively-charged species (see below), Arg148 is likely to be charged. A charge on this residue could be stabilized by hydrogen bonds with multiple solvent molecules. Also, the indole ring of Trp100 is ~3.5 Å from the guanidinium group and is suitably oriented to stabilize a positive charge via a π-cation interaction (44).
Each molecule of MppR contains one molecule of the HEPES buffer from the crystallization solution bound in the active site (Figure 2B, C). The buffer molecule appears to be tightly held, with an average B-factor of 26.9 Å2, agreeing well with the average B-factor of all protein atoms (26.0 Å2). The sulfonate group of the HEPES molecule makes a bidentate interaction with the guanidinium group of Arg148 (both 2.9 Å) and hydrogen-bonding interactions with the side-chain amide of Gln152 (3.1 Å) and the ε-amino group of Lys156 (3.1 Å). The piperizine nitrogens of HEPES make additional hydrogen-bonding interactions with Glu283 (2.7 Å) and, through a bridging water, with Glu118 (Glu-H2O: 2.7 Å, H2O-HEPES: 2.9 Å). One side of the piperizine ring is in a hydrophobic pocket defined by the side chains of Phe58 and Trp98 (4.0–4.2 Å), while the other side is exposed to the side chains of Met154 (4.0 Å) and Leu215 (3.9 Å) and a pair of water molecules (3.2–3.5 Å). Finally, the hydroxyethyl moiety of the buffer forms a hydrogen bond to the carboxylate of Glu283 (2.6 Å). Several attempts to determine the structure of MppR with no buffer bound by substituting different buffers in the crystallization solution resulted in failure to obtain crystals or in structures with alternative buffers bound (e.g. TRIS, data not shown).
Quaternary Structure
The two monomers in the crystallographic asymmetric unit are arranged as a dimer (Figure 3A) mediated by the "small interface loops" (residues 100–114 and 166–176, orange in Figure 2A). The resulting interface buries ~1300 Å2 of surface area (determined using PISA (45)) and consists of predominantly van der Waal's contacts with only 6 hydrogen bonds and no ionic interactions. This dimer is related to a second dimer by the crystallographic two-fold rotation axis to form a tetramer. This interface is mediated by the "large interface loops" (residues 34–55 and 218–235, yellow in Figure 2A), which form extensive domain-swapping contacts that bury an additional ~2,400 Å2 of surface area. This interface is also dominated by van der Waal's contacts, having only 4 hydrogen bonds and no ionic interactions. The extensive buried surface area and specific intermolecular interactions strongly suggest that the tetramer observed in the crystal represents the true oligomeric state of MppR and is not a consequence of crystallization. A tetramer would also be consistent with the SEC results above. Further evidence for the physiological significance of the observed MppR tetramer comes from the structure of the putative acetoacetate decarboxylase from Methanocelleus marisnigri (PDB ID: 3CMB; Joint Center for Structural Genomics, unpublished), which exhibits an almost identical quaternary structure.
Figure 3.
The ribbon representation of the MppR “dimer” (A) is very similar to the corresponding unit of Chromobacterium violaceum acetoacetate decarboxylase (B; PDB ID: 3BGT). The two protomers of each protein shown are colored light blue and light yellow, while the respective interface loops are colored orange and dark blue. Both interfaces are comprised almost entirely of hydrophobic interactions. The full biological units of MppR (C) and CvADC (D), on the other hand, are quite different. The functional significance, if any, of the tetrameric vs. dodecameric quaternary structure is currently unclear.
Comparison to Acetoacetate Decarboxylase
In spite of the modest sequence identity with CaADC (PDB ID: 3BH2) and CvADC (PDB ID: 3BGT), the tertiary structure of MppR is extremely similar to those of the classical ADCs (Figure 3A–B). Least-squares fitting with SSM (46) results in root mean square deviations (RMSD) for Cα atoms of 2.3 Å and 1.9 Å for CaADC and CvADC, respectively. The interface mediated by the "large interface loops" of MppR is also highly conserved (Figure 3A–B). This is not unexpected, since in ADC this interface was shown to be important for maintaining the catalytic lysine in the hydrophobic active site. The lysine side chain adjacent to the catalytic lysine (Lys116, CaADC numbering) projects into the interface, acting as a physical wedge to prevent the catalytic lysine from backing out of the active site (22). Mutating Lys116 of CaADC to cysteine abolished ADC activity (20), presumably because the catalytic lysine was out of position. Chemical rescue of the Lys116Cys mutant with 2-bromoethylamine restored ADC activity. It was thought, in the absence of structural data, that the positive charge adjacent to the catalytic lysine was responsible for the pKa perturbation of that residue. However, the structural data of Ho, et al indicate that the recovery was actually due to the increased length of the resulting S-(2-aminoethyl)-cysteine side chain forcing the catalytic lysine back into position. Interestingly, the residue adjacent to the catalytic Lys156 of MppR is Gln157 and, contrary to ADC, the environment around this side chain is not crowded by the adjacent protomer. In fact, Gln157 interacts primarily with solvent, and so does not seem to be acting as a wedge analogous to Lys116 of CaADC. It may be that the role of the residue adjacent to the catalytic lysine in this fold is more complicated, or that the wedge mechanism at work in the classical ADCs is not universal.
While one interface is conserved between MppR and the classical ADCs, the higher-order quaternary structures are very different. Whereas 6 copies of the ADC dimer trimerize to give a homododecamer, two copies of the MppR dimer associate back-to-back to form a homotetramer (Figure 3C–D). The structural basis for this difference is clear: one of the “small interface loops” of MppR disrupts the packing of the "trimerization platform" observed in the classical ADCs (22). The functional significance of the higher-order quaternary structures (tetrameric vs. dodecameric) is less clear, and its elucidation will require additional structural and biochemical data from MppR-like and classical ADCSF enzymes.
As expected from the high degree of structural similarity between MppR and ADC, the active sites of both enzymes are strikingly similar (Figure 4). Both active sites, as discussed earlier, are highly hydrophobic, both contain a lysine side chain with an apparently perturbed pKa, and both contain a glutamate side chain near the catalytic lysine (MppR: Glu118, CvADC: Glu77). Perhaps more informative than the similarities are the subtle, but important, differences between the two active sites. First, Glu62 of CvADC, which is adjacent to Glu77 in that active site and is important for the decarboxylase activity (22), has been replaced by valine (Val94) in MppR. The position occupied by the guanidinium group of CvADC Arg30, which coordinates the carboxylate group of acetoacetate, is occupied by the indole ring of Trp98 in MppR. The aliphatic Leu232 of CvADC is replaced by Glu283 in MppR (the carboxylate is 9.0 Å from the catalytic Lys156). Finally, the conserved Gly107 and Trp112 of CvADC are replaced by Arg148 and Gln152, respectively, in MppR. These two hydrophobic-to-polar substitutions create a polar sub-site in the active site that we propose as a carboxylate-binding site (see below). These amino acid substitutions provide a structural rationale for the lack of decarboxylase activity in MppR. The positive charge that positions the carboxylate of acetoacetate in ADC (Arg30) is missing in MppR, which has the (presumably) positively-charged Arg148 closer to, and on the opposite side of, the catalytic lysine residue. The Glu62 side chain of CvADC that helps create an electrostatic environment that favors decarboxylation is lost in MppR, where it is replaced by the much smaller, hydrophobic valine side chain. Clearly these few changes result in significant changes in the electrostatic environment of the active site.
Figure 4.
Superposition of the active sites of MppR and CvADC (PDB ID: 3BGT). MppR is colored light blue, while CvADC is colored grey. Conserved residues are denoted by black labels. Unconserved positions are labeled with both the MppR (blue) and CvADC (grey) residues.
Structures of MppR complexes with α-keto acids
Since acetoacetate was not a substrate for MppR and the side chain of Arg148 is quite close to the putative nucleophilic ε-amino group of Lys156, we hypothesized that MppR might react with α-keto acids. This is based primarily on the observation that the carboxylate, which we envisioned interacting with the guanidinium of Arg148 analogous to the sulfonate group of HEPES, is too far from the ketone in β-keto acids to permit favorable alignment of the reacting groups. The crystal structure with HEPES bound suggested that a shorter α-keto acid might fit well. To test this possibility, crystals of MppR were soaked in a solution containing PEG 3,350, (NH4)2SO4, and 30 mM sodium pyruvate for 24 hours. Data were collected from a pyruvate-treated crystal and the resulting structure showed pyruvate covalently bound to Lys156 as the Schiff base (Figure 5). Unlike the MppR·HEPES structure, the main chain carbonyl of Pro145 is near enough (2.9 Å), and in a suitable orientation to form a hydrogen bond to the ζ-nitrogen of Lys156, which could stabilize the complex in the iminium form. The α-carboxylate of pyruvate makes a bidentate interaction with Arg148 (2.8 and 2.9Å) and a hydrogen-bonding interaction to the amide nitrogen of Gln152 (3.1 Å). These two residues constitute a carboxylate-binding subsite in the MppR active site. The β-carbon of pyruvate is oriented toward Glu118 and the opening of the active site. Structures were determined using data collected from crystals that had soaked for two weeks in solutions containing pyruvate, and from 3-week-old crystals grown in the presence of pyruvate (data not shown). In both cases the enzyme was in the Schiff base form, suggesting that the imine is long-lived. However, because hydrolysis of the Schiff base would simply regenerate the α-keto acid, we cannot distinguish between kinetic and thermodynamic stability of this complex. Still, the environment around the imine is hydrophobic and quite crowded, which would make it difficult for a water molecule to gain access to the α-carbon. Indeed, in almost every Schiff base complex structure we examined, there are no water molecules proximal to the imine. Thus, it would not be surprising to learn that the rate of hydrolysis of the Schiff base is very slow.
Figure 5.
Stereoview of the MppR·Pyruvate Schiff base complex (A) showing the experimental 2|Fo|-|Fc| electron density (magenta) and the simulated annealing composite omit electron density (green), both contoured at 1.0 σ. The schematic of the complex (B) shows potential interactions and their distances (in Å). Pyruvate is covalently bound to the enzyme as indicated by continuous electron density between Lys156 and the alpha carbon of pyruvate. The side chain of pyruvate is too short to unequivocally differentiate between the imine and enamine forms of the Schiff base. This result suggested that MppR takes an α-keto acid as its substrate.
The structure of the MppR·pyruvate complex led to a series of structures of MppR in complexes with other α-keto acids in order to probe substrate specificity and the possible roles of key residues near the catalytic Lys156. Eight of the nine α-keto acids tested formed a stable Schiff base complex with the enzyme (data not shown). In the interest of clarity and brevity, only the complexes of MppR with 7 and 8 will be discussed in detail.
If an MppR Schiff base with an α-keto acid is involved in the formation of End, compounds 10, and/or 11 are plausible participants in the reaction (Figure 6). These compounds both resemble aldol condensation products between pyruvate and guanidinoacetaldehyde (Figure 1, 5). Such an aldol process could, in principle, lead to End, however a source of guanidinoacetaldehyde (5) is unknown. The reverse reaction, cleavage of 2-oxo-4-hydroxyarginine (10) to pyruvate and 5 has been proposed in the synthesis of the antibiotic 2-aminoimidazole by Streptomyces eurocidicus (47). Based on these proposed intermediates, 4-imidazolecarboxaldehyde (Figure 1, 6) was chosen as a stable analog of 5. To test the hypothesis that MppR might be capable of catalyzing an aldol condensation, we determined the structure of MppR in the presence of both pyruvate and 6.
Figure 6.
Hypothetical enduracididine biosynthetic scheme starting from L-Arg (top). The structures reported here suggest that the biosynthesis could also originate with a metabolite of Arg, like guanidinoacetaldehyde (5) and pyruvate. The bottom row shows a possible synthetic route to 5 and potential byproducts.
Crystals of MppR were soaked for 2.5 hours in a solution containing 30 % PEG 3,350, 0.3 M (NH4)2SO4, and 30 mM pyruvate, after which an equal volume of 100 mM 6 in PEG/(NH4)2SO4 was added. After an additional 4.0 hours, crystals were flash-cooled and diffraction data were collected. The resulting structure showed the condensation product 2-oxo-3-(imidazol-5-yl)pyruvate (Figure 1, 7) bound covalently to Lys156 (Figure 7). The absence of the aldehyde oxygen and the −178.0 ° torsion angle defined by C2-C3-C4-C5 (Figure 7B) is consistent with the observed condensation product being unsaturated. The pyruvate moiety is identical to the MppR·Pyruvate complex. When the MppR·7 and MppR·Pyruvate complexes are superimposed using all Cα atoms, the 5 pyruvate-derived atoms (which do not influence the superposition) overlay with a 0.20 Å root mean square deviation. The imidazole moiety makes a single hydrogen bond to Glu118 (Figure 6B; 3.0 Å), one of the presumed key Schiff base-forming residues based on analogy to ADC. The other imidazole nitrogen atom is close to Glu283, but is too far away (3.7 Å) and likely in the wrong protonation state to make a hydrogen-bonding interaction with that residue. The observed aldol condensation and subsequent dehydration appear to be catalyzed by MppR, since no 7 could be detected by HPLC analysis of crystal-soaking solution that had not been treated with enzyme after standing overnight at RT. Thus, we have demonstrated the first observed enzymatic activity for MppR (Scheme 1). Preliminary kinetic analysis (see supplementary information) suggests that the reaction obeys Michaelis-Menten kinetics, though the KM value for 6 is very high (> 50 mM). We did not observe saturation of the reaction rate below the solubility limit of 6.
Figure 7.
Stereoview of the MppR·7 Schiff base complex (A) showing the experimental 2|Fo|-|Fc| electron density (magenta) and the simulated annealing composite omit electron density (green), both contoured at 1.2 σ. The schematic of the complex (B) shows potential interactions and their distances (in Å). Note the continuous electron density between 7 and Lys156 Nζ, indicating that 7 is covalently bound to the enzyme. The torsion angle defined by C2-C3-C4-C5 is −178.0 °, which puts all four atoms in the same plane. Thus, this structure is most likely in the imine form.
Scheme 1.

However, we were also not able to identify any 7 in overnight reactions containing 10 µM MppR, 30 mM pyruvate, and 100 mM 4-imidazolecarboxaldehyde—a surprising observation in light of our crystallographic data. Further experiments with pyruvate and the alternative aldehyde substrate 3-(2-furyl)acrolein, the condensation product of which is bright yellow and absorbs UV light at 360 nm, cleared up this apparent paradox. In these tests, high concentrations of MppR (> 100 µM) were required in order to accumulate enough of the condensation product to give a measurable absorbance signal, suggesting that the condensation product remained covalently linked to the enzyme. This suspicion was confirmed after filtering the reaction mixture through a centrifugal concentrator to remove the enzyme. The yellow color remained trapped in the concentrator, while the filtrate lacked any absorbance at 360 nm, indicating that the product remained bound to the enzyme. It is currently unclear why the rate of hydrolysis of this Schiff base complex is so slow (it is stable for at least 1 day), but it may be that substance 7 mimics an intermediate, 11a (Figure 6), that must be converted to 8, with concomitant conformational changes of individual active site residues and/or larger structural elements, before it can be released from the enzyme. It is also possible that the observed aldol condensation is a side-reaction resulting from the presence of the Schiff base with pyruvate and is not relevant to End biosynthesis.
Based on analogy to capreomycidine biosynthesis, it is possible that 4-hydroxy-arginine is an intermediate in End biosynthesis, and thus a potential substrate for MppR. To determine if MppR can react with 4-hydroxy-arginine, a diastereomeric mixture of 4R/S-hydroxy-2S–arginine was treated with Crotalus atrox L-amino acid oxidase (LAAO) (48) to convert the 2S–diastereomers to the corresponding α-keto acids (Figure 6, 10). The LAAO was removed from the solution using a centrifugal concentrator and the filtrate was evaporated to an amber residue in a rotary evaporator at 50 °C. When crystals of MppR were soaked in a solution containing this material at a nominal concentration of 300 mM (assuming all of the material was 10), the resulting crystal structure clearly showed 3-[(4R)-2-iminoimidazolidin-4-yl]-2-oxopropanoic acid (Figure 1, 8; i.e. "2-keto-enduracididine") bound in the active site (Figure 8) as the Schiff base. The pyruvate moiety of 8 makes the same interactions with the enzyme as pyruvate (Figure 5B) and 7 (Figure 7B). The two nitrogen atoms in the iminoimidazoline ring of 8 make hydrogen bonding interactions with the carboxylate oxygen atoms of Glu283 (2.8 and 3.0 Å). Significantly, the chirality at C4 of 8 is that observed in the mannopeptimycins (6).
Figure 8.
Stereoview of the MppR·8 Schiff base complex (A) showing the experimental 2|Fo|-|Fc| electron density (magenta) and the simulated annealing composite omit electron density (green), both contoured at 1.2 σ. The schematic of the complex (B) shows potential interactions and their distances (in Å). As expected, 8 is bound covalently to Lys156 Nζ as the Schiff base imine. The fact that the enzyme selected the correct 4R enantiomer from a mixture of 4R/S 8 suggests that 8 (i.e. α-keto-enduracididine) may be the true product of MppR.
Comparison of the NMR spectrum of the LAAO reaction product used to prepare the crystal-soaking solution (Figure S2A) with that of the model compound 2,3-dihydroxypropylguanidinium (Figure S2B) shows that 10 is the major product from the LAAO reaction. Specifically, CH2N signals at δ3.35 and 3.22 ppm show geminal coupling constants of 14.4 Hz and vicinal coupling of 3.6 or 7.2 Hz, matching closely with the linear model compound. Since the sample used to soak crystals contained predominantly the linear molecule 10, the observation of the cyclized 8 bound to the enzyme suggests that MppR catalyzed the cyclization of 10 to produce the iminoimidazoline ring of enduracididine. It should be noted that the sample of 10 does contain some uncharacterized impurities (< 15%), so we cannot rule out the presence of a small amount of 8 in the crystal-soaking solution. Consequently, it is possible, though perhaps less likely, that MppR has simply selected keto acid 8 from the mixture rather than catalyzing its formation.
Catalytic implications and proposed function of MppR
The three structures of MppR in complexes with α-keto acids described here show that the Schiff base intermediate is shared between both the classical ADC enzymes and MppR. More importantly, the MppR·7 structure demonstrates that the enzyme can catalyze the aldol condensation and subsequent dehydration of pyruvate and 4-imidazolecarboxaldehyde (6) to give the corresponding enone 7 (if it were released from the enzyme). While it is not clear at this point that the condensation activity is relevant to the biosynthesis of End, it does raise the possibility that MppR may synthesize an oxidized arginine derivative suitable for cyclization to form the iminoimidazoline ring of End directly from pyruvate and guanidinoacetaldehyde (5). Finally, the fact that the R-enantiomer of 8 was trapped in the MppR·8 structure suggests that, whatever the origin of the presumed oxidized arginine intermediate, MppR may catalyze its cyclization to give the iminoimidazoline ring of 8.
Natural History and Genome Context Analysis of MppR-like domains
Comparative genome analysis of MppR and its closely-related orthologs defined a monophyletic family of MppR-like ADCSF enzymes clearly distinct from the classical ADCs (22) (see Supplemental Material and Methods). MppR family members are found in both bacteria and archaea. The distribution of MppR domains across archaeal lineages, including its presence in the deep-branching korarchaeota lineage (49), relative to their sporadic distribution in bacteria strongly suggests an archaeal origin for MppR-like enzymes followed by horizontal transfer into bacteria. Phylogenetic trees constructed for the family support this scenario (Figure S3), with strong grouping of archaeal sequences to the exclusion of bacterial sequences. Given that bacterial membership is concentrated in actinobacteria and only sporadically found in alpha-, beta-, and delta-proteobacteria and synergistetes; it is likely the transfer occurred initially into actinobacteria followed by subsequent dispersion to additional lineages through horizontal gene transfer events (Figure S3, File S1).
Analysis of the genome context of the MppR family reveals a near-constant association with a gene closely related to the classical aldehyde dehydrogenases comprised of a NAD-binding domain fused to a Rossmannoid catalytic domain (50). Several neighborhoods spanning both archaeal and bacterial lineages additionally associate with a distinct enzyme containing an oxidoreductase-like Rossmannoid domain. Further associations observed almost exclusively in bacteria occur with a variety of genes encoding enzymes likely acting on amino acid or amino acid-derived substrates, including PLP-dependent aminotransferases, GATase-like glutamine amidotransferases, and GlnA-like glutamine synthetases. These associations suggest that archaeal members of the family could act as decarboxylases as part of a pathway involved in the general processing of highly oxidized substrates, while bacterial members were recruited to more specialized functional niches likely involving the processing of amino acids or associated metabolites. The MppR protein itself clusters with several actinobacterial proteins all likely to play a role in End biosynthesis; other bacterial MppR-like domains could act on similar, End-like substrates (File S1).
Conclusions
In spite of significant sequence and structural identity to the classical acetoacetate decarboxylases, as suggested by a comparative genome analysis and shown through targeted biochemical assays, MppR is not a decarboxylase. Rather, it has Type I pyruvate aldolase activity. The demonstrated aldolase-dehydratase activity corroborates the idea that MppR catalyzes the dehydration of an oxidized arginine derivative (e.g. 10) to give the 3–4 unsaturated derivative 11b, which is subsequently cyclized to give 8. It is also possible that MppR may catalyze the aldol condensation of pyruvate and guanidinoacetaldehyde (5) to produce compound 10. From a broader perspective, the contextual analysis of this and additional ADCSF families suggests that the ADC fold displays considerable adaptability for the acquisition of novel biochemical functions. Rapid exploration of substrate space in ADC-like enzymes appears to proceed through retention of core catalytic residues with punctuating loss and gain of ancillary structural and sequence features concomitant with extensive horizontal gene flow. Similar patterns have been observed previously in other rapidly-evolving enzyme families (51–53). The overriding implication of these findings is that the ADC domains represent possible fruitful avenues of biophysical and biochemical investigations, potentially leading to the discovery of useful biocatalysts like the classical ADCs.
, which presumably dehydrates
Supplementary Material
ACKNOWLEDGMENT
FUNDING STATEMENT: Work at Oregon State University was supported by NIAID grant AI073784. This work is presently supported by NSF MRI grant # 0959442.
Use of the Advanced Photon Source was supported by the U. S. Department of Energy, Office of Science, Office of Basic Energy Sciences, under Contract No. DE-AC02-06CH11357. Use of the LS-CAT Sector 21 was supported by the Michigan Economic Development Corporation and the Michigan Technology Tri-Corridor for the support of this research program (Grant 085P1000817).
Glossary
- ADC
acetoacetate decarboxylase
- ADCSF
acetoacetate decarboxylase-like superfamily
- End
enduracididine
- βhEnd
β-hydroxyenduracididine
Footnotes
Supporting Information Available. The methods used in the bioinformatic analysis of the ADCSF are described in detail. Additional images illustrating the 5-NSA labeling of MppR and the distribution are gene context of MppR-like sequences, as well as a text file with sequence alignments, are also provided. This material is available free of charge via the Internet at http://pubs.acs.org.
REFERENCES
- 1.Foglia EE, Fraser VJ, Elward AM. Effect of nosocomial infections due to antibiotic-resistant organisms on length of stay and mortality in the pediatric intensive care unit. Infect. Control Hosp. Epidemiol. 2007;28:299–306. doi: 10.1086/512628. [DOI] [PubMed] [Google Scholar]
- 2.Mauldin PD, Salgado CD, Durkalski VL, Bosso JA. Nosocomial infections due to methicillin-resistant Staphylococcus aureus and vancomycin-resistant enterococcus: relationships with antibiotic use and cost drivers. Ann. Pharmacother. 2008;42:317–326. doi: 10.1345/aph.1K501. [DOI] [PubMed] [Google Scholar]
- 3.Spellberg B, Guidos R, Gilbert D, Bradley J, Boucher HW, Scheld WM, Bartlett JG, Edwards J., Jr. The epidemic of antibiotic-resistant infections: a call to action for the medical community from the Infectious Diseases Society of America. Clin. Infect. Dis. 2008;46:155–164. doi: 10.1086/524891. [DOI] [PubMed] [Google Scholar]
- 4.He H. Mannopeptimycins, a novel class of glycopeptide antibiotics active against gram-positive bacteria. Appl. Microbiol. Biotechnol. 2005;67:444–452. doi: 10.1007/s00253-004-1884-z. [DOI] [PubMed] [Google Scholar]
- 5.Magarvey NA, Haltli B, He M, Greenstein M, Hucul JA. Biosynthetic pathway for mannopeptimycins, lipoglycopeptide antibiotics active against drug-resistant gram-positive pathogens. Antimicrob. Agents Chemother. 2006;50:2167–2177. doi: 10.1128/AAC.01545-05. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Singh MP, Petersen PJ, Weiss WJ, Janso JE, Luckman SW, Lenoy EB, Bradford PA, Testa RT, Greenstein M. Mannopeptimycins, new cyclic glycopeptide antibiotics produced by Streptomyces hygroscopicus LL-AC98: antibacterial and mechanistic activities. Antimicrob. Agents Chemother. 2003;47:62–69. doi: 10.1128/AAC.47.1.62-69.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Ruzin A, Singh G, Severin A, Yang Y, Dushin RG, Sutherland AG, Minnick A, Greenstein M, May MK, Shlaes DM, Bradford PA. Mechanism of action of the mannopeptimycins, a novel class of glycopeptide antibiotics active against vancomycin-resistant gram-positive bacteria. Antimicrob. Agents Chemother. 2004;48:728–738. doi: 10.1128/AAC.48.3.728-738.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Guppi SR, O’Doherty GA. Synthesis of aza-analogues of the glycosylated tyrosine portion of mannopeptimycin-E. J. Org. Chem. 2007;72:4966–4969. doi: 10.1021/jo070326r. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Petersen PJ, Wang TZ, Dushin RG, Bradford PA. Comparative in vitro activities of AC98-6446, a novel semisynthetic glycopeptide derivative of the natural product mannopeptimycin alpha, and other antimicrobial agents against gram-positive clinical isolates. Antimicrob. Agents Chemother. 2004;48:739–746. doi: 10.1128/AAC.48.3.739-746.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.He H, Shen B, Petersen PJ, Weiss WJ, Yang HY, Wang TZ, Dushin RG, Koehn FE, Carter GT. Mannopeptimycin esters and carbonates, potent antibiotic agents against drug-resistant bacteria. Bioorg. Med. Chem. Lett. 2004;14:279–282. doi: 10.1016/j.bmcl.2003.09.071. [DOI] [PubMed] [Google Scholar]
- 11.Sum PE, How D, Torres N, Petersen PJ, Ashcroft J, Graziani EI, Koehn FE, Mansour TS. Synthesis and evaluation of ether and halogenated derivatives of mannopeptimycin glycopeptide antibiotics. Bioorg. Med. Chem. Lett. 2003;13:2805–2808. doi: 10.1016/s0960-894x(03)00542-0. [DOI] [PubMed] [Google Scholar]
- 12.Olivier KS, Van Nieuwenhze MS. Synthetic studies toward the mannopeptimycins: synthesis of orthogonally protected beta-hydroxyenduracididines. Org. Lett. 2010;12:1680–1683. doi: 10.1021/ol100219a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.He H, Williamson RT, Shen B, Graziani EI, Yang HY, Sakya SM, Petersen PJ, Carter GT. Mannopeptimycins, novel antibacterial glycopeptides from Streptomyces hygroscopicus, LL-AC98. J. Am. Chem. Soc. 2002;124:9729–9736. doi: 10.1021/ja020257s. [DOI] [PubMed] [Google Scholar]
- 14.Haltli B, Tan Y, Magarvey NA, Wagenaar M, Yin X, Greenstein M, Hucul JA, Zabriskie TM. Investigating beta-hydroxyenduracididine formation in the biosynthesis of the mannopeptimycins. Chem Biol. 2005;12:1163–1168. doi: 10.1016/j.chembiol.2005.09.013. [DOI] [PubMed] [Google Scholar]
- 15.Hatano K, Nogami I, Higashide E, Kishi T. Biosynthesis of Enduracidin: Origin of Enduracididine and Other Amino Acids. Agric. Biol. Chem. 1984;48:1503–1508. [Google Scholar]
- 16.Yin X, Zabriskie TM. The enduracidin biosynthetic gene cluster from Streptomyces fungicidicus. Microbiology. 2006;152:2969–2983. doi: 10.1099/mic.0.29043-0. [DOI] [PubMed] [Google Scholar]
- 17.Ju J, Ozanick SG, Shen B, Thomas MG. Conversion of (2S)-arginine to (2S,3R)-capreomycidine by VioC and VioD from the viomycin biosynthetic pathway of Streptomyces sp. strain ATCC11861. Chembiochem. 2004;5:1281–1285. doi: 10.1002/cbic.200400136. [DOI] [PubMed] [Google Scholar]
- 18.Yin X, Zabriskie TM. VioC is a non-heme iron, alpha-ketoglutarate-dependent oxygenase that catalyzes the formation of 3S-hydroxy-L-arginine during viomycin biosynthesis. Chembiochem. 2004;5:1274–1277. doi: 10.1002/cbic.200400082. [DOI] [PubMed] [Google Scholar]
- 19.Yin X, McPhail KL, Kim KJ, Zabriskie TM. Formation of the nonproteinogenic amino acid 2S,3R-capreomycidine by VioD from the viomycin biosynthesis pathway. Chembiochem. 2004;5:1278–1281. doi: 10.1002/cbic.200400187. [DOI] [PubMed] [Google Scholar]
- 20.Highbarger LA, Gerlt JA, Kenyon GL. Mechanism of the reaction catalyzed by acetoacetate decarboxylase. Importance of lysine 116 in determining the pKa of active-site lysine 115. Biochemistry. 1996;35:41–46. doi: 10.1021/bi9518306. [DOI] [PubMed] [Google Scholar]
- 21.Warren S, Zerner B, Westheimer FH. Acetoacetate decarboxylase. Identification of lysine at the active site. Biochemistry. 1966;5:817–823. doi: 10.1021/bi00867a002. [DOI] [PubMed] [Google Scholar]
- 22.Ho MC, Menetret JF, Tsuruta H, Allen KN. The origin of the electrostatic perturbation in acetoacetate decarboxylase. Nature. 2009;459:393–397. doi: 10.1038/nature07938. [DOI] [PubMed] [Google Scholar]
- 23.Meister A. Enzymatic preparation of alpha-keto acids. J. Biol. Chem. 1952;197:309–317. [PubMed] [Google Scholar]
- 24.Otwinowski Z, Minor W. Processing of X-ray Diffraction Data Collected in Oscillation Mode. Methods Enzymol. 1997;276:307–326. doi: 10.1016/S0076-6879(97)76066-X. [DOI] [PubMed] [Google Scholar]
- 25.Battye TG, Kontogiannis L, Johnson O, Powell HR, Leslie AG. iMOSFLM: a new graphical interface for diffraction-image processing with MOSFLM. Acta Crystallogr. Sect. D Biol. Crystallogr. 2011;67:271–281. doi: 10.1107/S0907444910048675. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Leslie AGW. Recent changes to the MOSFLM package for processing film and image plate data. Joint CCP4 + ESF-EAMCB Newsletter on Protein Crystallography. 1992;26 [Google Scholar]
- 27.Evans P. Scaling and assessment of data quality. Acta Crystallogr. Sect. D Biol. Crystallogr. 2006;62:72–82. doi: 10.1107/S0907444905036693. [DOI] [PubMed] [Google Scholar]
- 28.Collaborative Computational Project, N. The CCP4 suite: programs for protein crystallography. Acta Crystallogr. Sect. D Biol. Crystallogr. 1994;50:760–763. doi: 10.1107/S0907444994003112. [DOI] [PubMed] [Google Scholar]
- 29.Vonrhein C, Blanc E, Roversi P, Bricogne G. Automated structure solution with autoSHARP. Methods Mol. Biol. 2007;364:215–230. doi: 10.1385/1-59745-266-1:215. [DOI] [PubMed] [Google Scholar]
- 30.Zwart PH, Afonine PV, Grosse-Kunstleve RW, Hung LW, Ioerger TR, McCoy AJ, McKee E, Moriarty NW, Read RJ, Sacchettini JC, Sauter NK, Storoni LC, Terwilliger TC, Adams PD. Automated structure solution with the PHENIX suite. Methods Mol. Biol. 2008;426:419–435. doi: 10.1007/978-1-60327-058-8_28. [DOI] [PubMed] [Google Scholar]
- 31.Terwilliger TC, Grosse-Kunstleve RW, Afonine PV, Moriarty NW, Zwart PH, Hung LW, Read RJ, Adams PD. Iterative model building, structure refinement and density modification with the PHENIX AutoBuild wizard. Acta Crystallogr. Sect. D Biol. Crystallogr. 2008;64:61–69. doi: 10.1107/S090744490705024X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Emsley P, Lohkamp B, Scott WG, Cowtan K. Features and development of Coot. Acta Crystallogr. Sect. D Biol. Crystallogr. 2010;66:486–501. doi: 10.1107/S0907444910007493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Afonine PV, Mustyakimov M, Grosse-Kunstleve RW, Moriarty NW, Langan P, Adams PD. Joint X-ray and neutron refinement with phenix.refine. Acta Crystallogr. Sect. D Biol. Crystallogr. 2010;66:1153–1163. doi: 10.1107/S0907444910026582. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Word JM, Lovell SC, Richardson JS, Richardson DC. Asparagine and glutamine: using hydrogen atom contacts in the choice of side-chain amide orientation. J. Mol. Biol. 1999;285:1735–1747. doi: 10.1006/jmbi.1998.2401. [DOI] [PubMed] [Google Scholar]
- 35.Painter J, Merritt EA. Optimal description of a protein structure in terms of multiple groups undergoing TLS motion. Acta Crystallogr. Sect. D Biol. Crystallogr. 2006;62:439–450. doi: 10.1107/S0907444906005270. [DOI] [PubMed] [Google Scholar]
- 36.Chen VB, Arendall WB, 3rd, Headd JJ, Keedy DA, Immormino RM, Kapral GJ, Murray LW, Richardson JS, Richardson DC. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. Sect. D Biol. Crystallogr. 2010;66:12–21. doi: 10.1107/S0907444909042073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Urzhumtseva L, Afonine PV, Adams PD, Urzhumtsev A. Crystallographic model quality at a glance. Acta Crystallogr. Sect. D Biol. Crystallogr. 2009;65:297–300. doi: 10.1107/S0907444908044296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.McCoy AJ, Grosse-Kunstleve RW, Adams PD, Winn MD, Storoni LC, Read RJ. Phaser crystallographic software. J Appl Crystallogr. 2007;40:658–674. doi: 10.1107/S0021889807021206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Moriarty NW, Grosse-Kunstleve RW, Adams PD. electronic Ligand Builder and Optimization Workbench (eLBOW): a tool for ligand coordinate and restraint generation. Acta Crystallogr D Biol Crystallogr. 2009;65:1074–1080. doi: 10.1107/S0907444909029436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Tagaki W, Westheimer FH. Acetoacetate decarboxylase. The molecular weight of the enzyme and subunits. Biochemistry. 1968;7:895–900. doi: 10.1021/bi00843a002. [DOI] [PubMed] [Google Scholar]
- 41.Frey PA, Kokesh FC, Westheimer FH. A reporter group at the active site of acetoacetate decarboxylase. I. Ionization constant of the nitrophenol. J. Am. Chem. Soc. 1971;93:7266–7269. doi: 10.1021/ja00755a024. [DOI] [PubMed] [Google Scholar]
- 42.Kokesh FC, Westheimer FH. A reporter group at the active site of acetoacetate decarboxylase. II. Ionization constant of the amino group. J. Am. Chem. Soc. 1971;93:7270–7274. doi: 10.1021/ja00755a025. [DOI] [PubMed] [Google Scholar]
- 43.Andreeva A, Murzin AG. Structural classification of proteins and structural genomics: new insights into protein folding and evolution. Acta Crystallogr Sect F Struct Biol Cryst Commun. 2010;66:1190–1197. doi: 10.1107/S1744309110007177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Gallivan JP, Dougherty DA. Cation-pi interactions in structural biology. Proc. Natl. Acad. Sci. U. S. A. 1999;96:9459–9464. doi: 10.1073/pnas.96.17.9459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Krissinel E, Henrick K. Inference of macromolecular assemblies from crystalline state. J. Mol. Biol. 2007;372:774–797. doi: 10.1016/j.jmb.2007.05.022. [DOI] [PubMed] [Google Scholar]
- 46.Krissinel E, Henrick K. Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions. Acta Crystallogr D Biol Crystallogr. 2004;60:2256–2268. doi: 10.1107/S0907444904026460. [DOI] [PubMed] [Google Scholar]
- 47.Nakane A, Nakamura T, Eguchi Y. A novel metabolic fate of arginine in Streptomyces eurocidicus. Partial resolution of the pathway and identification of an intermediate. J. Biol. Chem. 1977;252:5267–5273. [PubMed] [Google Scholar]
- 48.Meister A. The alpha-keto analogues of arginine, ornithine, and lysine. J. Biol. Chem. 1954;206:577–585. [PubMed] [Google Scholar]
- 49.Elkins JG, Podar M, Graham DE, Makarova KS, Wolf Y, Randau L, Hedlund BP, Brochier-Armanet C, Kunin V, Anderson I, Lapidus A, Goltsman E, Barry K, Koonin EV, Hugenholtz P, Kyrpides N, Wanner G, Richardson P, Keller M, Stetter KO. A korarchaeal genome reveals insights into the evolution of the Archaea. Proc. Natl. Acad. Sci. U. S. A. 2008;105:8102–8107. doi: 10.1073/pnas.0801980105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Liu ZJ, Sun YJ, Rose J, Chung YJ, Hsiao CD, Chang WR, Kuo I, Perozich J, Lindahl R, Hempel J, Wang BC. The first structure of an aldehyde dehydrogenase reveals novel interactions between NAD and the Rossmann fold. Nat. Struct. Biol. 1997;4:317–326. doi: 10.1038/nsb0497-317. [DOI] [PubMed] [Google Scholar]
- 51.Burroughs AM, Iyer LM, Aravind L. Natural history of the E1-like superfamily: implication for adenylation, sulfur transfer, and ubiquitin conjugation. Proteins. 2009;75:895–910. doi: 10.1002/prot.22298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Aravind L, Anantharaman V, Zhang D, de Souza RF, Iyer LM. Gene flow and biological conflict systems in the origin and evolution of eukaryotes. Front Cell Infect Microbiol. 2012;2:89. doi: 10.3389/fcimb.2012.00089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Iyer LM, Abhiman S, Maxwell Burroughs A, Aravind L. Amidoligases with ATP-grasp, glutamine synthetase-like and acetyltransferase-like domains: synthesis of novel metabolites and peptide modifications of proteins. Mol Biosyst. 2009;5:1636–1660. doi: 10.1039/b917682a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Fenn TD, Ringe D, Petsko GA. POVScript+: a program for model and data visualization using persistence of vision ray-tracing. J Appl Crystallogr. 2003;36:944–947. [Google Scholar]
- 55.Kraulis P. MOLSCRIPT: a program to produce both detailed and schematic plots of protein structures. J Appl Crystallogr. 1991;24:946–950. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.








