Abstract
Gellan gum is a widely used commercial material, available in many different forms. Its economic importance has led to studies into the biosynthesis of exopolysaccharide gellan gum, which is industrially prepared in high yields using Sphingomonas elodea ATCC 31461. Glucose-1-phosphate uridylyltransferase mediates the reversible conversion of glucose-1-phosphate and UTP into UDP-glucose and pyrophosphate, which is a key step in the biosynthetic pathway of gellan gums. Here we present the X-ray crystal structure of the glucose-1-phosphate uridylyltransferase from S. elodea. The S. elodea enzyme shares strong monomeric similarity with glucose-1-phosphate thymidylyltransferase, several structures of which are known, although the quaternary structures of the active enzymes are rather different. A detailed comparison between S. elodea glucose-1-phosphate uridylyltransferase and available thymidylyltransferases is described and shows remarkable structural similarities, despite the low sequence identities between the two divergent groups of proteins.
The bacterium Sphingomonas elodea ATCC 31461 (25, 30) is an industrially used microorganism for the high-yield synthesis of exopolysaccharide gellan gum, approved in the United States and European Union as a food gelling, stabilizing, and suspending agent, either on its own or in combination with other hydrocolloids (9). In its native form, gellan is a linear anionic heteropolysaccharide based on a tetrasaccharide repeat unit composed of two molecules of d-glucose (d-Glc), one of d-glucuronic acid (d-GlcA), and one of l-rhamnose (l-Rha). Native gellan is partially esterified, with acetate and glycerate substituents on the d-glucosyl residue adjacent to the d-glucuronyl residue (1 mol of glycerate and 0.5 mol of acetate per repeat unit) (18), with an average molecular mass of about 0.5 MDa. The rheological properties of gellan are significantly modified upon the removal of acyl substituents. They may be turned from soft and elastic gels to more brittle and harder ones, depending on the level of glycerate substituents present in the repeating unit (18).
Gellan gum is available commercially in three chemical forms, no, low, and high acyl content, with the respective denominations of Gelrite, Kelcogel F, and Kelcogel LT100. Gelrite is used as a substitute for agar in microbiological and tissue culture media. The Kelcogels are food-grade gellan gums mainly used as gelling agents in foods and personal care applications. Other biomedical applications of gellan include its use as a pharmaceutical excipient for nasal and ocular drug delivery applications (32) and as material for the construction of three-dimensional (3-D) scaffolds for tissue engineering (10).
The commercial applications of gellan have been a stimulus to study its biosynthesis. The production pathway is a multistep process starting with the intracellular formation of the nucleotide-sugar precursors, UDP-Glc, UDP-GlcA, and dTDP-l-Rha. This is followed by the formation of the repeat unit, with sequential transfer of the sugar donors to an activated lipid carrier by committed glycosyltransferases and, ultimately, by gellan polymerization and export (28). The genes coding for the proteins involved in the synthesis of dTDP-l-Rha, the glycosyltransferases, and the proteins required for gellan polymerization and export are located together in the gel cluster (28, 34). However, the genes (pgmG, ugpG, and ugdG) coding the enzymes required for the synthesis of the nucleotide sugars (UDP-Glc and UDP-GlcA) from Glc-1-phosphate (G1P) are located at different positions on the S. elodea genome (25, 28, 34), perhaps reflecting the broader functions of these enzymes beyond the specific gellan pathway.
G1P uridylyltransferase (UGP) (EC 2.7.7.9) mediates the reversible conversion of G1P and UTP into UDP-Glc and pyrophosphate. This enzyme is widely distributed throughout all domains of life, although the eukaryotic UGPs are unrelated in terms of amino acid sequence to their prokaryotic counterparts (13, 14, 16). UGPs belong to the superfamily of nucleoside diphosphate sugar pyrophosphorylases. The protein sequence most closely homologous to that of bacterial UGP is G1P thymidylyltransferase (RmlA) (EC 2.7.7.24), a bacterial enzyme that mediates the reversible production of dTDP-Glc from G1P and dTTP, the first of a four-step pathway necessary for the formation of the nucleotide sugar dTDP-l-Rha (30). The protein under study, UGP from S. elodea (UgpG), recognizes both UTP and dTTP nucleotides as substrates in vitro (30), while other UGPs have been reported as being unable to use dTTP as a substrate in vitro (5). Studies have been performed to understand why some of these proteins are promiscuous toward both their nucleoside triphosphate (NTP) and the sugar substrates (3, 4), but the reasons are yet to be established. We have previously described the kinetic properties (25, 30) and preliminary structure determination of UgpG from the industrial microorganism S. elodea ATCC 31461 (2). Recently, the coordinates of a crystallographic structure of Escherichia coli UGP were deposited in the PDB under accession number 2E3D, but no associated publication was reported. The study presented here reports the structure analysis of UgpG and highlights the main differences between UGPs and RmlAs.
MATERIALS AND METHODS
Expression, purification, and crystallization of UgpG.
UgpG was expressed, purified, and crystallised as previously described (2). Briefly, for the production of native and seleno-methionine-derivatized proteins, the expression vector was transformed into E. coli BL21 Gold(DE3) and B834(DE3), respectively, grown to an optical density at 640 nm of 0.6, and induced with 0.1 mM isopropyl-β-d-thiogalactopyranoside for 3 h at 37°C. The bacteria were harvested and lysed (French press), and the cell debris removed. The supernatant was applied to a nickel affinity column and eluted with a gradient of increasing imidazole concentration. The chromatographic peak corresponding to UgpG was separated and concentrated to 10 mg/ml in a buffer containing 30 mM NaCl and 50 mM Tris-HCl (pH 7.6), and crystallized by vapor diffusion in hanging drops at 20°C. The UgpG-G1P crystals were obtained by the vapor diffusion method, equilibrating drops of 5 μl of the protein solution complemented with 5 mM G1P and 2 μl of reservoir solution against the reservoir containing 500 μl of 0.1 M sodium citrate (pH 4.6), 0.1 M ammonium acetate, and 15% polyethylene glycol monomethyl ether. A heavy atom soak was carried out on the native crystals with 10 mM K2OsO4 for 30 min.
Data collection, structure determination, and refinement.
X-ray diffraction data were collected at the European Synchrotron Radiation Facility (ESRF), Grenoble, France, beamlines ID14-2 and ID29. The diffraction images were integrated with MOSFLM (24) and the intensities scaled, merged together, and reduced to structure factor magnitudes using the CCP4 suite of programs (8), which was also used for further data handling and scaling purposes. Reflections for the calculation of Rfree were grouped in thin resolution shells, due to the presence of eightfold noncrystallographic symmetry (NCS). Diffraction data statistics are presented in Table 1.
TABLE 1.
Refinement statistics for UgpG at 2.65 Å
| Parameter | Result(s) |
|---|---|
| Resolution limits (Å) | 102.0-2.65 |
| Highest resolution shell | 2.67-2.65 |
| Rwork (no. of observations) | 0.245 (76,179) |
| Rfree (no. of observations) | 0.295 (1,992) |
| Highest resolution shell | |
| Rwork (no. of observations) | 0.384 (1,883) |
| Rfreea (no. of observations) | 0.403 (492) |
| Solvent water molecules | 160 |
| G1P atoms | 8 × 16 |
| Residues not located | |
| A1 | 1, 162-172, 187-200, 223-230 |
| A1′ | 223-229 |
| B1 | 1, 223-229 |
| B1′ | 1, 223-230 |
| A2 | 1, 162-182, 205-219, 221-229 |
| A2′ | 1, 222-229 |
| B2 | 1 |
| B2′ | 1, 224-229 |
| Average Biso (Å2) | |
| All atoms (solvent) | 54 (39) |
| Results for monomers (A1, A1′, B1, B1′, A2, A2′, B2, B2′) | |
| Biso Protein atoms (Å2) | 61, 52, 56, 53, 52, 48, 54, 59 |
| Biso Main chain (Å2) | 61, 50, 55, 52, 50, 46, 52, 57 |
| Biso Side chain (Å2) | 62, 54, 56, 56, 54, 49, 56, 61 |
| No. amino acids | 255, 282, 281, 280, 244, 280, 288, 282 |
| No. protein atoms | 1,940, 2,145, 2,135, 2,126, 1,866, 2,131, 2,174, 2,144 |
| Deviations from the ideal | |
| Bond distances (Å) | 0.012 |
| Bond angles (o) | 1.84 |
The Rwork highest resolution shell differs from that of the Rfree set, 2.75Å to 2.77 Å, because the last was selected in thin shells in order to minimize the bias due to the restrained noncrystallographic symmetry refinement.
The UgpG G1P structure was determined (2) by the multiple isomorphous replacement and anomalous scattering phasing method. Data were collected from a native crystal, from an osmium salt-soaked crystal (collected at the peak of the osmium L-III absorption edge) and from a seleno-methionine-derivatized crystal (collected at the peak of the selenium K absorption edge). The program autoSHARP (35) was used to identify the locations of 12 osmium atoms and produce a set of initial phases with an overall isomorphous phasing power of 0.73. With these initial phases, 80 selenium sites were found by SHARP (7), using a phased anomalous difference map calculated with the selenium data. The combined phases were improved using the NCS information with the program DM (11) to average the electron density maps. A preliminary model was produced with COOT (15) and was iteratively fed back to SHARP (7), allowing an improvement of the overall isomorphous phasing power from 1.3 to 1.8 in the range of 100 to 3.5 Å resolution. The molecular model was completed and refined with the program CNS (1) using the NCS restraints, simulating annealing and a maximum-likelihood target.
Comparison of 3-D structures.
All of the RmlA and the E. coli UGP coordinate files deposited in the Protein Data Bank were aligned with the UgpG structure presented here by using the secondary structure matching algorithm (22) implemented in COOT (15). This initial alignment was further refined with MODELLER (29), with a threshold of 3.5 Å for interprotein residue matching.
Since this paper was submitted, the 1.9-Å-resolution description of the glucose-1-phosphate uridylyltransferase from E. coli has been published (32a).
Protein structure accession numbers.
The coordinates and structure factors of UgpG have been deposited in the Protein Data Bank (PDB accession numbers, 2UX8 and 2UX8SF).
RESULTS
Crystal contents and structure quality.
The UgpG crystals belong to space group P21, with the following cell dimensions: a = 105.5 Å, b = 85.7 Å, c = 151.8 Å, and β = 105.2°. They diffract to a resolution of 2.65 Å and contain eight molecules in the asymmetric unit, which corresponds to a calculated solvent content of 51.7% (26). The asymmetric unit contains 2,219 residues, out of a possible total of 2,312, with monomers A1 and A2 being those with fewer residues visible in the electron density maps, with only 86% and 85% of their possible atoms modeled, respectively (Table 1). The histidine tag present in the protein sequence is not visible in the density for any of the monomers. Other undetected residues belong to loops and were not built into the model when not visible in 1-σ electron density maps, probably due to disorder. The asymmetric unit consists of two tetramer units, with one G1P molecule per UgpG monomer and a total of 160 modeled water molecules.
An enzyme gel filtration profile and the results of dynamic light scattering experiments show a single species in solution of a mass of ∼120 kDa (data not shown). This corresponds to approximately four times the expected molecular mass of the 289-residue protein monomer and suggests that the crystallographic tetramers are the functional units of UgpG.
The refined structure converged to a final Rwork and Rfree of 24.5% and 29.5%, respectively. A stereochemical analysis of the refined model using PROCHECK (23) revealed 86%, 13%, 1%, and 1% residues in the most favorable, additional allowed, generously allowed, and disallowed regions of the Ramachandran plot, respectively. The residues in the disallowed areas are partially supported by the electron density maps and lie in regions with high B factors. The overall geometry was better than for the set of PROCHECK comparative structures at a resolution of 2.65 Å, with a G factor of 0.13.
Overall polypeptide fold.
The monomer (Fig. 1a) has an overall size of around 60 Å, 45 Å, 35 Å and is classified by CATH (27) as an alpha/beta complex fold (CATH classification of 3.90). Each monomer is made up of 10 helices and 15 β-strands (20), labeled consecutively from 1 to 10 and A to O, respectively (Fig. 1b). The UgpG monomer is built by a large, mixed β-sheet, reminiscent of the Rossman fold but including nine β-strands, spanning the whole molecule and surrounded by10 helices and two additional anti-parallel β-sheets. The molecule may be divided into three functional regions or subdomains (12). The nucleotide binding subdomain (NBSD) (residues 1 to 132) is composed of the first four β-strands of the main β-sheet sandwiched between their interconnecting helices. It resembles other nucleotide binding domains (6) and is strongly involved in intermonomer interactions. Subdomain 2, the sugar binding subdomain (SBSD) (residues 133 to 256), binds G1P and is the most compact of the three subdomains. It is composed of the second half of the main β-sheet, including five mixed strands, the two extra β-sheets with two and four strands, and four α-helices. Finally, subdomain 3, the dimerization subdomain (DSD) (residues 257 to 289) is composed of two α-helices arranged like a “V” and, together with subdomain 1, is responsible for the formation of the active tetramer (see below).
FIG. 1.
UgpG monomer. (a) Ribbon representation of UgpG showing G1P in ball-and-stick representation (black). The molecule may be divided into three functional regions, the NBSD, the SBSD, and the DSD, which are colored in light gray, medium gray, and dark gray, respectively. (b) Topological diagram of UgpG using the same color scheme as described for panel a for subdomain identification. Arrows represent β-strands and cylinders represent helices. (c) Superposition of UgpG (black ribbon) and RmlA structures (gray) showing their overall similarity. The loop consisting of residues 219 to 229 lies above the catalytic cavity. The labeled helix bundle, only present in the RmlA structures, contains a second allosteric NTP binding site.
Quaternary structure and oligomerisation interfaces.
The asymmetric unit contains two homotetramers that can be described as dimers of dimers (A1A1′ and B1B1′ and A2A2′ and B2B2′, with subscripts denoting the tetramer, letters identifying primary dimers, and primes identifying monomers), with the monomers showing positional root-mean-square differences (RMSDs) between homologous Cα's within 0.2 to 0.4 Å for 237 positions to a cutoff of 3.5 Å. This assembly is responsible for an overall buried surface area upon tetramer formation of 12 kÅ2, slightly larger than the 10-kÅ2 surface of one monomer alone (calculated using AREAIMOL) (8). The interdimer assembly that produces the tetramer (Fig. 2a and b) is dominated by the stacking of helices 2 and β-strand C between monomers A and B (in a total of 36 to 46 van der Waals interactions [vdW]; 13 to 20 hydrogen bonds). The interdimer interfaces are strongly bound due to two particular interactions of the NBSD and DSD. From the NBSD, residues 15 to 39 form a set of hydrogen bonds (defined as such up to 3.2 Å) and favorable apolar interactions (vdW; defined as such up to 3.6 Å) with their symmetry equivalents, and in particular, methionines 27 and 31 and 27′ and 31′ create a four-methionine apolar region (Fig. 2c). These interactions are responsible for 28 to 52 vdW interactions and 6 to 10 hydrogen bonds between monomers, depending on the particular dimer. The DSD is responsible for the helix-helix interactions between helices 9 and 10 and 9′ and 10′, with one set with the “V” shape fitted into the inverted “V” of the neighbor monomer (Fig. 2b, middle top and middle bottom, and d). This interaction is responsible for 8 to 17 vdW interactions and one or two hydrogen bonds within each dimer. The two interactions account, on average, for 90% of the total vdW interactions and 70% of the total hydrogen bonds that create the dimer. Upon tetramer formation, the parallel alignment of the C β-strands allows the extension of the monomers' wide β-sheet into two interdimer supra-β-sheets (Fig. 2e, stacks of near-parallel strands represented by lines), contributing to the tetramer stability. These interactions account for 82% of the vdW interactions and 98% of the hydrogen bonds of the total dimer-dimer contact interface.
FIG. 2.
UGP tetramer. (a, b, and d) Perpendicular views of the UgpG oligomer, colored according to the monomers, in ribbon representation, with G1P in ball-and-stick representation. Dimers AA′ and BB′ are represented by the green and magenta and the yellow and blue monomers, respectively. While panel b shows the whole tetramer, panels a and d represent only the front monomers, for purposes of clarity. (c) A-A′ interactions (residues 15 to 37) with residues 27 and 27′ and 31and 31′, creating a four-methionine apolar region. (e and f) Schemes of UGP and RmlA tetramer arrangements, respectively. The rectangles represent monomers, color coded as described for panel b. The lines inside the monomer rectangles represent β-strands, which in UGP form an extended β-sheet across monomers A and B. The schemes highlight the different quaternary structures adopted by UGP and RmlA.
Catalytic cavity.
A loop between the K and L β-strands over the catalytic cavity contains the conserved sequence V190-E191-K192-P193 (UgpG numbering) of UGPs and RmlAs (Fig. 3) (30, 33), where K192 is thought to stabilize the negative charge of the phosphate group when the reaction takes place. The catalytic cavity spreads through both NBSD and SBSD and is solvent accessible (Fig. 4a). In this structure the substrate, G1P, is located at one side of the cavity, and although it does not undergo any direct interactions with K192, hydrogen bonds are formed with residues D134, G172, E191, and V204 (Fig. 4b). Glc oxygen O3 forms a hydrogen bond to the carboxyl moiety of E191 and to the nitrogen of G172. Oxygen O4 is also bound to the same G172 main-chain atom and has an additional bond to the carbonyl of V204. Oxygen O6 makes a hydrogen bond with OD1 of D134, and finally, in four of the eight monomers, O2 forms a hydrogen bond to a visible water molecule bridging to T234. The side chains of residues L111, L131, V204, Y208, and L231 form the boundaries of the catalytic cavity by providing a hydrophobic cap to the base of the sugar ring and are generally conserved in RmlA sequences (Fig. 3). The NBSD part of the catalytic cavity is empty, with sufficient space to allow a nucleotide molecule to bind (Fig. 4a to c, and see Modeling of nucleotide binding, below).
FIG. 3.
3-D-structure alignment of all available RmlA and UGP structures upon superposition with MODELLER (29). Residues boxed in white form H bonds with G1P, residues highlighted in gray form H bonds with nucleotides, and residues highlighted in black constitute a hydrophobic cap to the base of the sugar ring at the catalytic cavity. RmlA PDB accession numbers 1IIN, 1IIM, 1MP3, 1MP4, and 1MP5 are from Salmonella enterica; 1G0R, 1G23, 1G3L, 1G1L, and 1FXO are from Pseudomonas aeruginosa; 1H5R and 1MC3 are from E. coli; and 1LVW is from Methanobacterium thermoautotrophicum. UGP PDB accession numbers are 2UX8 (S. elodea) and 2E3D (E. coli).
FIG. 4.
UgpG catalytic cavity. (a) UgpG transparent solvent-accessible surface around the catalytic cavity, with bound G1P (ball-and-stick) and catalytic H-bonding residues (sticks). On the left of G1P, there is enough space for an NTP molecule. The side chains, atoms of residues M109 and V204, and the main chains, atoms of residues Q105, D133, D134, and E191 were removed for clarity.(b) G1P H-bonding network with UgpG catalytic residues, where dashed lines represent proton-donor to proton-acceptor distances of up to 3.2 Å. (c) The Rmla-deoxythymidine complex structure 1H5R (36) with its H-bonding network. (d) A deoxythymidine molecule was manually fitted, using a molecular graphics workstation, into the empty UgpG catalytic cavity using the complex structure shown in panel c as a template. M109 is represented by its main-chain atoms only, for purposes of clarity. Carbon is represented in black, oxygen in white, nitrogen in light gray, and phosphorus in dark gray.
Substrate affinity and specificity.
Trials to cocrystallise UgpG with dTTP, UTP, and UDP-Glc failed. To check that our protein could indeed bind nucleotides, preliminary isothermal calorimetry studies using the available protein were made at 25°C with UTP in the presence of 20 mM Mg2+ and, separately, with G1P (D. Aragão, unpublished results). A qualitative analysis of the results obtained hinted at favorable bindings with similar free energy gains for both substrates. UTP binding has a small enthalpy contribution and is mainly controlled by an increase in entropy upon binding, probably from protein conformation changes and water displacement, while on the other hand, G1P binding is controlled by a considerable enthalpy gain. A possible reason for our failure to cocrystallise UgpG with any of the nucleotides tested is that their binding is favored by the presence of Mg2+ as a cofactor, and unfortunately, successful crystallization conditions were only found at low pH where protonated side chains thwart Mg2+ coordination.
Structural neighbors.
A structural similarity search using DALI (17) against the UgpG monomer shows a number of structurally related proteins, many of which belong to the superfamily of bacterial nucleotide diphosphate sugar pyrophosphorylases. The 12 structures with z scores between 22 (best) and 14 (worst) and RMSDs from 2.2 to 3.3 Å, respectively, are shown in Table S1 of the supplemental material. The model with the highest z score and, as it happens, the highest sequence identity (21% for 214 3-D homologous Cα positions) to UgpG is PDB accession number 1G0R, a Pseudomonas aeruginosa RmlA structure complexed with thymidine and G1P. Many of the other high-scoring structures share the NTP binding subdomain with UgpG.
Modeling of nucleotide binding.
The structural superpositions described above show that the proteins' NBSDs are very similar. This fact, together with the existence of several complexes with nucleotides, was used to help model thymidine in the UgpG catalytic cavity in the proximity of the bound G1P. Using the E. Coli RmlA structure complexed with both deoxythymidine and G1P (PDB accession number 1H5R [36]) as a reference (Fig. 4c), a homology model was built, as shown in Fig. 4d, that shows that also in UgpG thymidine is also likely to form hydrogen bonds with residues A14, G15, N105, M109, G110, and D133 in UgpG. UgpG can employ either thymidine or uradine as substrates. They differ in their structures only by a methyl group, and our structure has space to accommodate it, leading to little specificity between the two bases. One can expect the binding of either nucleotide to be very similar.
DISCUSSION
Both UgpG and RmlA from S. elodea ATCC 31461 are able to catalyze, in vitro, the same enzymatic reactions (30). Several structures of RmlAs have been determined to date and display the same Rossmann-like fold with an NBSD plus an SBSD.
The 3-D structural comparison of the monomers of 13 published X-ray crystal structures of RmlAs with UgpG shows similar monomer folds, with RMSDs within 1.5 and 1.7 Å for more than 200 fitted Cα positions with 3-D structural identities below 20% (see Table S2 in the supplemental material). Indeed, the sequence analysis made by Silva et al. (30) of 17 RmlA proteins and three putative or confirmed UGPs showed the two enzymes to form two divergent groups in a phylogenetic tree. The UgpG structure strongly suggests that dTDP and UDP Glc pyrophosphorylase enzymes share a common3-D structure, as Silva et al. suggested, albeit with low sequence conservation. Local inspection of the structure superpositions shows that the active-site cavities and side-chain spatial orientations of the catalytic residues are conserved (Fig. 3 and 4). A loop consisting of residues 219 to 229 and located between helices 7 and 8 closes the active site in all of the RmlA structures, whether substrates are bound or not, while in the UgpG structure, this loop adopts an “open” conformation, allowing access to the active site where G1P is already bound in place (Fig. 1c). It should be noted, though, that this loop in the UgpG structure is visible in the electron density maps and was modeled unambiguously for only one monomer. The lack of clear electron densities for the remaining monomers is a sign of the mobility of this loop, which acts as gatekeeper to the active site and which by holding both substrates may then help to stabilize the ternary complex.
When compared to UgpG, all RmlAs have a C-terminal extension of 30 residues forming a three-helix bundle that is absent in UgpG (Fig. 1c, bottom right). The volume corresponding to the missing helix bundle is partially occupied by helix 3 in UgpG. This region is particularly significant as it accommodates a second, presumed to be allosteric, NTP binding site on some RmlAs (31). There is, however, no biochemical, sequence, or structural evidence to date that UgpG has a similar second site.
The results of the DALI search to locate structural neighbors of the UgpG monomer show that similar proteins cover diverse quaternary structures with monomers and multimeric associations to form dimers and tetramers. However, none of the tetrameric structures shows a quaternary arrangement similar to that found in UgpG, which reveals a new type of oligomerisation compared to any of its DALI neighbors and, indeed, to any of the RmlA structures published to date. Oligomers 1G0R (RmlA (4) and 1QWJ (murine CMP-5-N-acetylneuraminic acid synthetase (21) have RmlA-like tetrameric arrangements (see below), and the tetrameric arrangement of 1YP2 (ADP-Glc pyrophosphorylase) (19) is different from that of UgpG due to an extra subdomain in 1YP2 that is the principal component for forming its monomer-monomer interactions.
While the UgpG tetramer has the four symmetrically related helices 2, one from each monomer, arranged in the center of the tetramer, all RmlA structures show their oligomerization equivalent to be helix 9. The interface that creates the dimer in UgpG remains the same as in RmlA, but the interface that builds the dimer of dimers is from the opposite side of the monomer (Fig. 2e and f). Consequently, the RmlA tetramers do not contain the long continuous stack of β-chains between adjacent monomers (see “Quaternary structure and oligomerisation interfaces” in Results).
A comparison of UgpG with the recently deposited E. coli UGP (PDB accession number 2D3D) shows a 1.4 Å RMSD between homologous Cα's to a cutoff of 3.5 Å, including 238 positions with a 35% identity in a 3-D-based sequence alignment. It is noteworthy that none of the four crystallographically independent molecules in 2D3D could be completely modeled. Most significantly, their ternary and quaternary structures coincide with those of UgpG.
The analysis of the UgpG catalytic cavity structure and its comparison with that of known RmlA structures reveal considerable similarities in sugar and nucleotide binding, as might be expected for similar catalytic functions. Moreover, in mechanistic terms, conserved residues can be identified in 3-D positions equivalent to those proposed to act as the catalytic residues in RamlAs, which points to a similar catalytic mechanism. Although space is available at the catalytic cavity of UgpG to accommodate either UTP or TTP, the presence of a flexible loop (residues 219 to 229) that could close the active site and the results of calorimetry suggest that protein conformation changes may play an important role in the different specific activities observed in vitro (25, 30), which may be determining for the UgpG preference for UTP.
Supplementary Material
Acknowledgments
We thank IBS and the EMBL Grenoble Outstation for the use of high-throughput crystallization robots, the ESRF for the provision of synchrotron radiation, and CERMAV for ITC experiments. We gratefully acknowledge Anne Imberty (CERMAV) for valuable discussions.
D.A. acknowledges a grant from FCT, SFRH (BD/6480/2001). This work was partially supported by FEDER and Fundação para a Ciência e a Tecnologia (FCT), Portugal (POCTI/BME/44441/2002 and POCTI/BIO/58041/2004).
Footnotes
Published ahead of print on 13 April 2007.
Supplemental material for this article may be found at http://jb.asm.org/.
REFERENCES
- 1.Adams, P. D., N. S. Pannu, R. J. Read, and A. T. Brunger. 1997. Cross-validated maximum likelihood enhances crystallographic simulated annealing refinement. Proc. Natl. Acad. Sci. USA 94:5018-5023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Aragao, D., A. R. Marques, C. Frazao, F. J. Enguita, M. A. Carrondo, A. M. Fialho, I. Sa-Correia, and E. P. Mitchell. 2006. Cloning, expression, purification, crystallization and preliminary structure determination of glucose-1-phosphate uridylyltransferase (UgpG) from Sphingomonas elodea ATCC 31461 bound to glucose-1-phosphate. Acta Crystallogr.F 62:930-934. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Barton, W. A., J. Lesniak, J. B. Biggins, P. D. Jeffrey, J. Jiang, K. R. Rajashankar, J. S. Thorson, and D. B. Nikolov. 2001. Structure, mechanism and engineering of a nucleotidylyltransferase as a first step toward glycorandomization. Nat. Struct. Biol. 8:545-551. [DOI] [PubMed] [Google Scholar]
- 4.Blankenfeldt, W., M. Asuncion, J. S. Lam, and J. H. Naismith. 2000. The structural basis of the catalytic mechanism and regulation of glucose-1-phosphate thymidylyltransferase (RmlA). EMBO J. 19:6652-6663. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Bonofiglio, L., E. Garcia, and M. Mollerach. 2005. Biochemical characterization of the pneumococcal glucose 1-phosphate uridylyltransferase (GalU) essential for capsule biosynthesis. Curr. Microbiol. 51:217-221. [DOI] [PubMed] [Google Scholar]
- 6.Brandeen, C., and J. Tooze. 1991. Introduction to protein structure. Garland Publishing Inc., New York, NY.
- 7.Bricogne, E. D. L. F. G. 1997. Maximum-likelihood heavy-atom parameter refinement for the multiple isomorphous replacement and multiwavelength anomalous diffraction methods, vol. 276. Academic Press, New York, NY. [DOI] [PubMed]
- 8.Collaborative Computational Project, Number 4. 1994. The CCP4 suite: programs for protein crystallography. Acta Crystallogr. Sect. D 50:760-763. [DOI] [PubMed] [Google Scholar]
- 9.Chandrasekaran, R., and A. Radha. 1995. Molecular architectures and functional properties of gellan gum and related polysaccharides. Trends Food Sci. Technol. 6:143-148. [Google Scholar]
- 10.Ciardelli, G., V. Chiono, G. Vozzi, M. Pracella, A. Ahluwalia, N. Barbani, C. Cristallini, and P. Giusti. 2005. Blends of poly-(epsilon-caprolactone) and polysaccharides in tissue engineering applications. Biomacromolecules. 6:1961-1976. [DOI] [PubMed] [Google Scholar]
- 11.Cowtan, K. 1994. DM—density modification package. Joint CCP4/ESF-EACBM Newsl. Protein Crystallogr. 31:34-38. [Google Scholar]
- 12.Creighton, T. E. 1993. Proteins: structures and molecular properties, 2nd ed. W. H. Freeman, New York, NY.
- 13.Daran, J. M., N. Dallies, D. Thines-Sempoux, V. Paquet, and J. Francois. 1995. Genetic and biochemical characterization of the UGP1 gene encoding the UDP-glucose pyrophosphorylase from Saccharomyces cerevisiae. Eur. J. Biochem. 233:520-530. [DOI] [PubMed] [Google Scholar]
- 14.Eimert, K., P. Villand, A. Kilian, and L. A. Kleczkowski. 1996. Cloning and characterization of several cDNAs for UDP-glucose pyrophosphorylase from barley (Hordeum vulgare) tissues. Gene 170:227-232. [DOI] [PubMed] [Google Scholar]
- 15.Emsley, P., and K. Cowtan. 2004. Coot: model-building tools for molecular graphics. Acta Crystallogr. D. 60:2126-2132. [DOI] [PubMed] [Google Scholar]
- 16.Flores-Diaz, M., A. Alape-Giron, B. Persson, P. Pollesello, M. Moos, C. von Eichel-Streiber, M. Thelestam, and I. Florin. 1997. Cellular UDP-glucose deficiency caused by a single point mutation in the UDP-glucose pyrophosphorylase gene. J. Biol. Chem. 272:23784-23791. [DOI] [PubMed] [Google Scholar]
- 17.Holm, L., and C. Sander. 1993. Protein structure comparison by alignment of distance matrices. J. Mol. Biol. 233:123-138. [DOI] [PubMed] [Google Scholar]
- 18.Jay, A. J., I. J. Colquhoun, M. J. Ridout, G. J. Brownsey, V. J. Morris, A. M. Fialho, J. H. Leitão, and I. Sa-Correia. 1988. Analysis of structure and function of gellans with different substitution patterns. Carbohydr. Polym. 35:179-188. [Google Scholar]
- 19.Jin, X., M. A. Ballicora, J. Preiss, and J. H. Geiger. 2005. Crystal structure of potato tuber ADP-glucose pyrophosphorylase. EMBO J. 24:694-704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Kabsch, W., and C. Sander. 1983. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22:2577-2637. [DOI] [PubMed] [Google Scholar]
- 21.Krapp, S., A. K. Munster-Kuhnel, J. T. Kaiser, R. Huber, J. Tiralongo, R. Gerardy-Schahn, and U. Jacob. 2003. The crystal structure of murine CMP-5-N-acetylneuraminic acid synthetase. J. Mol. Biol. 334:625-637. [DOI] [PubMed] [Google Scholar]
- 22.Krissinel, E., and K. Henrick. 2004. Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions. Acta Crystallogr. D. 60:2256-2268. [DOI] [PubMed] [Google Scholar]
- 23.Laskowski, R. A., M. W. MacArthur, D. S. Moss, and J. M. Thornton. 1993. PROCHECK: a program to check the stereochemical quality of protein structures. J. Appl. Crystallogr. 26:283-291. [Google Scholar]
- 24.Leslie, A. G. W. 1992. Recent changes to the MOSFLM package for processing film and image plate data, p. 26. In Joint CCP4 and ESF-EACMB newsletter on protein crystallography. SERC, Daresbury Laboratory, Warrington, United Kingdom.
- 25.Marques, A. R., P. B. Ferreira, I. Sa-Correia, and A. M. Fialho. 2003. Characterization of the ugpG gene encoding a UDP-glucose pyrophosphorylase from the gellan gum producer Sphingomonas paucimobilis ATCC 31461. Mol. Genet. Genomics 268:816-824. [DOI] [PubMed] [Google Scholar]
- 26.Matthews, B. 1968. Solvent content of protein crystals. J. Mol. Biol. 33:491-497. [DOI] [PubMed] [Google Scholar]
- 27.Pearl, F. M., C. F. Bennett, J. E. Bray, A. P. Harrison, N. Martin, A. Shepherd, I. Sillitoe, J. Thornton, and C. A. Orengo. 2003. The CATH database: an extended protein family resource for structural and functional genomics. Nucleic Acids Res. 31:452-455. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Sá-Correia, I., A. M. Fialho, P. Videira, L. M. Moreira, A. R. Marques, and H. Albano. 2002. Gellan gum biosynthesis in Sphingomonas paucimobilis ATCC 31461: genes, enzymes and exopolysaccharide production engineering. J. Ind. Microbiol. Biotechnol. 29:170-176. [DOI] [PubMed] [Google Scholar]
- 29.Sali, A., and T. L. Blundell. 1993. Comparative protein modelling by satisfaction of spatial restraints. J. Mol. Biol. 234:779-815. [DOI] [PubMed] [Google Scholar]
- 30.Silva, E., A. R. Marques, A. M. Fialho, A. T. Granja, and I. Sa-Correia. 2005. Proteins encoded by Sphingomonas elodea ATCC 31461 rmlA and ugpG genes, involved in gellan gum biosynthesis, exhibit both dTDP- and UDP-glucose pyrophosphorylase activities. Appl. Environ. Microbiol. 71:4703-4712. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Sivaraman, J., V. Sauve, A. Matte, and M. Cygler. 2002. Crystal structure of Escherichia coli glucose-1-phosphate thymidylyltransferase (RffH) complexed with dTTP and Mg2+. J. Biol. Chem. 277:44214-44219. [DOI] [PubMed] [Google Scholar]
- 32.Sultana, Y., M. Aqil, and A. Ali. 2006. Ion-activated, Gelrite-based in situ ophthalmic gels of pefloxacin mesylate: comparison with conventional eye drops. Drug Deliv. 13:215-219. [DOI] [PubMed] [Google Scholar]
- 32a.Thoden, J. B., and H. M. Holden. 2007. The molecular architecture of glucose-1-phosphate uridylyltransferase. Protein Sci. 16:432-440. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Thorson, J. S., T. M. Kelly, and H. W. Liu. 1994. Cloning, sequencing, and overexpression in Escherichia coli of the alpha-d-glucose-1-phosphate cytidylyltransferase gene isolated from Yersinia pseudotuberculosis. J. Bacteriol. 176:1840-1849. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Videira, P. A., L. L. Cortes, A. M. Fialho, and I. Sa-Correia. 2000. Identification of the pgmG gene, encoding a bifunctional protein with phosphoglucomutase and phosphomannomutase activities, in the gellan gum-producing strain Sphingomonas paucimobilis ATCC 31461. Appl. Environ. Microbiol. 66:2252-2258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Vonrhein, C., E. Blanc, P. Roversi, and G. Bricogne. 2005. Automated structure solution with autoSHARP. Humana Press, Totowa, NJ. [DOI] [PubMed]
- 36.Zuccotti, S., D. Zanardi, C. Rosano, L. Sturla, M. Tonetti, and M. Bolognesi. 2001. Kinetic and crystallographic analyses support a sequential-ordered bi bi catalytic mechanism for Escherichia coli glucose-1-phosphate thymidylyltransferase. J. Mol. Biol. 313:831-843. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.




