Abstract
Protein geranylgeranyltransferase-I (GGTase-I) catalyzes the transfer of a 20-carbon isoprenoid lipid to the sulfur of a cysteine residue located near the C terminus of numerous cellular proteins, including members of the Rho superfamily of small GTPases and other essential signal transduction proteins. In humans, GGTase-I and the homologous protein farnesyltransferase (FTase) are targets of anticancer therapeutics because of the role small GTPases play in oncogenesis. Protein prenyltransferases are also essential for many fungal and protozoan pathogens that infect humans, and have therefore become important targets for treating infectious diseases. Candida albicans, a causative agent of systemic fungal infections in immunocompromised individuals, is one pathogen for which protein prenylation is essential for survival. Here we present the crystal structure of GGTase-I from C. albicans (CaGGTase-I) in complex with its cognate lipid substrate, geranylgeranylpyrophosphate. This structure provides a high-resolution picture of a non-mammalian protein prenyltransferase. There are significant variations between species in critical areas of the active site, including the isoprenoid-binding pocket, as well as the putative product exit groove. These differences indicate the regions where specific protein prenyltransferase inhibitors with antifungal activity can be designed.
Candida albicans is an opportunistic pathogen associated with a variety of diseases, ranging from common vaginal yeast infections and oral thrush, to systemic fungal infections in immunocompromised individuals, such as AIDS patients and transplant recipients (1, 2). Estimates place the morbidity rate for systemic C. albicans infections that have reached the bloodstream at between 20 and 47% (1, 2). In addition, emerging resistance of C. albicans to existing antifungal therapies such as fluconazole has prompted a search for new drugs (3).
The protein prenylation pathway may be a potential new target for antifungal treatment (4-8). Prenylation is an essential post-translational lipid modification for more than 120 cellular proteins, including the signal transduction proteins in the Ras, Rho, and Rab families, PRL-family tyrosine phosphatases, centromeric proteins, and nuclear lamins (9, 10). Most eukaryotes possess three protein prenyltransferases: protein farnesyltransferase (FTase) adds a 15-carbon isoprenoid lipid. Protein geranylgeranyltransferase-I (GGTase-I)2 a single 20-carbon isoprenoid lipid, and protein geranylgeranyltransferase-II (GGTase-II or Rab GGTase) successively adds two 20-carbon isoprenoid lipids. In each case, the lipid is transferred to the γ sulfur of the cysteine residue. All three protein prenyltransferases are obligate heterodimers and share a homologous architecture. Both FTase and GGTase-I prenylate a C-terminal CaaX tetrapeptide recognition sequence of their substrate proteins (C, cysteine; aa, two generally aliphatic residues; and X, a specificity determining residue). The X-residue usually determines the cognate substrate for a given enzyme (11). GGTase-II recognizes CC or CXC (C, cysteine; X; variable residue) (12).
Crystal structures of major steps along the reaction cycle for mammalian FTase and GGTase-I have been elucidated (13, 14) and are summarized in Fig. 1. Kinetic analysis of the mechanism reveals that substrate binding is ordered (15-18). The lipid complex forms first, and the CaaX tetrapeptide second. The chemical step is followed by binding of the lipid substrate of the next turn of the reaction cycle, displacing the isoprenoid moiety of the product from its original binding site into an “exit groove.” (13, 14, 16) The new CaaX substrate binds after the product dissociates from the active site to continue the new cycle. Kinetic studies reveal that product release is the rate-limiting step in the reaction (19-21), an observation supported by the crystal structures of the displaced prenylated product complexes in FTase and GGTase-I (13, 14, 21).
FIGURE 1.
A, protein prenyltransferase reaction scheme. Alkylation of the cysteine γ sulfur by the isoprenoid (C1 position) produces prenylated CaaX tetrapeptide product; pyrophosphate is the leaving group. B, protein prenyltransferase reaction cycle, adapted from Ref. 13, 14. Enzyme binds the lipid substrate (red) first (complex 1), followed by CaaX tetrapeptide substrate (blue) binding to generate a ternary substrate complex 2. Prenylated product complex 3 is formed, and pyrophosphate is released. A fresh lipid substrate molecule (red) then binds to initiate a new turn of the cycle and to complete the current by displacing prenylated product complex into a product exit groove (green) to generate 4. Displaced product is released from the active site allowing binding of a new CaaX substrate and continuing the cycle.
Prenylation of human oncogenic mutants of the Ras, Rho, and Rap small GTPase families is essential for transformation (10, 22). Consequently, FTase and GGTase-I have been a major focus of cancer chemotherapeutic research for over a decade. Two FTase inhibitors, Lonafarnib (Schering) and Tipifarnib (Johnson & Johnson) have advanced to late-stage clinical trials (10) for the treatment of cancer.
In lower eukaryotes, prenylation of essential signal transduction proteins is also required for function. In pathogenic micro-organisms, such as C. albicans, disruption of protein prenylation of such essential cellular proteins has the potential for the development of new antifungal medications (5-8, 23). The Ram2 gene in C. albicans encodes the common α-subunit of FTase and GGTase-I; knock-out mutations of this gene are lethal (5). As in mammalian cells, knock-out of a single β-subunit is not lethal, because the remaining CaaX prenyltransferase can cross-prenylate non-cognate substrates (7). Even so, at least one series of selective CaGGTase-I inhibitors has shown significant antifungal activity, suggesting that impairment of one of the CaaX prenyltransferase may be sufficient for effective treatment (23). In C. albicans, Rho family substrates of the GGTase-I regulate cell wall biogenesis, while Ras family substrates of CaFTase have been strongly implicated in virulence by regulating the transition from yeast to hyphae (5). Selective inhibitors of C. albicans CaaX prenyltransferases would therefore provide tools for understanding morphological changes in this yeast as well as potential antifungal treatments.
Here we present the crystal structure of the CaGGTase-I at 1.8-Å resolution in complex with its cognate lipid substrate, geranylgeranylpyrophosphate (GGPP). This structure reveals the structure of a non-mammalian protein prenyltransferase and provides high-resolution structural insights into the evolution of the protein prenyltransferase mechanism in lower eukaryotes and mammals. Although the yeast enzyme shares a similar overall architecture with its mammalian ortholog, close inspection of the active site reveals areas of significant divergence in critical regions. In particular, mechanisms of isoprenoid selection differ in the two enzymes; the putative product exit groove also varies significantly. We postulate that the unique structural features of CaGGTase-I will provide opportunities to design selective inhibitors for the development of new anti-fungal therapeutics.
EXPERIMENTAL PROCEDURES
Cloning and Protein Expression—C. albicans genomic DNA (strain SC5314) was obtained from the American Type Culture Collection. The RAM2 gene encoding the α-subunit was amplified from the genomic DNA using standard PCR methods and Platinum Pfx High Fidelity polymerase (Invitrogen). The forward primer sequence (SalI restriction site underlined) was 5′-CTGACGCCATGGATGACAGACTCCAAATATGAC-3′ and the reverse primer sequence (NotI site underlined) was 5′-CATTATGCGGCCGCTTACACCGATGTGAG-3′. The insert was digested with restriction enzymes SalI and NotI (New England Biolabs) for subcloning into the expression vector.
Amplification of the CDC43 gene encoding β-subunit was challenging because of the particularly AT rich sequence. The insert was amplified in two steps. First, primers were designed to amplify from regions flanking the gene, ∼250 bases upstream and downstream of the coding sequence. PCR was performed using the Phusion High Fidelity DNA polymerase (New England Biolabs) using the manufacturer's suggested procol, with the forward primer 5′-TCAAACCGGCTTCTTCAAGT-3′ and the reverse primer 5′-TGTTGATTGTGTGTGTGGGA-3′. The resulting fragment of ∼1.7 kb was used as the PCR template for another round of PCR amplification using TaqDNA polymerase (Invitrogen) according to the manufacturer's protocol with the following primers: forward, 5′-GCGCTGCATATGAACCAACTGCTGATTAACAAACATGAGAAATTTTT-3′ (NdeI restriction site underlined); reverse, 5′-GCGCTGCTCGAGTTAATACTTTATTTTTTCTTTAAAAAATTGATACGATTCTTTTGTAATTG-3′ (XhoI site underlined). The resulting PCR product was cloned into the PCR2.1 TOPO vector using the TOPO TA cloning kit from Invitrogen. Plasmids isolated from colonies positive for insert incorporation were digested using restriction enzymes XhoI and NdeI (New England Biolabs) to liberate the CDC43 insert.
The pCDFDuet-I vector (Novagen) was chosen to co-express both subunits of the enzyme in Escherichia coli (E. coli); the vector contains two multiple cloning sites (MCS) under the control of separate isopropyl-1-thio-β-d-galactopyranoside-inducible T7 promoters for robust co-expression of two gene products. The digested RAM2 gene encoding the α-subunit was subcloned into the SalI and NotI restriction sites in MCSI of the pCDFDuet-I vector. The digested CDC43 gene encoding the β-subunit was subcloned into the NdeI and XhoI restriction sites of MCSII of the pCDFDuet-I vector to achieve the final expression construct. The Duke University Medical Center DNA Analysis Facility performed the sequence analysis to confirm error-free construction of the expression vector.
The final construct was transformed into C41 (DE3) Escherichia coli (AVIDIS, S.A.) for expression. A single colony was picked from the plate and grown overnight in a 50-ml LB culture supplemented with 50 mg/ml streptomycin. The 50-ml LB culture was used to inoculate 2 liters of LB media supplemented with streptomycin until the A600 reached 0.8, at which point the culture was induced with a final concentration of 1 mm of isopropyl-1-thio-β-d-galactopyranoside for 4 h at 37 °C. The culture was supplemented with 300 μm Zn(SO4)2 at induction. Cells were harvested at 6000 × g, and the cell paste could be flash-frozen in liquid nitrogen and stored at -80 °C for several months.
Protein Purification—The cell paste was resuspended in a 10-fold volume of Buffer A (20 mm Tris, pH 7.7, 5 mm dithiothreitol, 5 μm ZnCl2) supplemented with SigmaFast general use protease inhibitor tablets (Sigma). Cells were lysed using a pressurized homogenizer (Microfluidics Corp.) and the resulting crude lysate was clarified by centrifugation at 45,000 × g for 30 min. The lysate was first applied to a DEAE Sepharose column and fractionated using gradient from Buffer A + 150 mm NaCl to Buffer A + 300 mm NaCl over 8 column volumes. The fractions containing CaGGTase-I as determined by SDS-PAGE were pooled and brought to a final concentration of 0.8 m (NH4)2SO4 by 2-fold dilution with Buffer A + 1.6 m (NH4)2SO4. As with the purification of mammalian GGTase-I, a 2-fold molar excess of GGPP was added to the pooled fractions at this point as well, and the mixture was stirred at 4 °C for 20 min before application to the column. The GGPP putatively displaces nonspecifically bound lipids in the active site of the molecule and results in a narrower elution peak from the phenyl-Sepharose column. After incubation with GGPP, the protein was applied to a phenyl-Sepharose column pre-equilibrated in Buffer A + 0.8 m (NH4)2SO4, and fractionated using a gradient of Buffer A + 0.8 m (NH4)2SO4 to Buffer A over 8 column volumes. The fractions containing CaGGTase-I (SDS-PAGE) were pooled and adjusted to 10 mS/cm conductivity by dilution in Buffer A (Thermo Scientific conductivity meter) and applied to a Q-Sepharose column pre-equilibrated in Buffer A + 150 mm NaCl. The protein was fractionated using a gradient from Buffer A + 150 mm NaCl to Buffer A + 350 mm NaCl over 8 column volumes. The fractions containing CaGGTase-I (SDS-PAGE) were concentrated to 0.5 ml total volume using a centrifugal concentrator (50-kDa MWCO, Amicon), and applied to a 120-ml Superdex 16/10 gel filtration column equilibrated in Buffer A. The final fractions containing CaGGTase-I (SDS-PAGE) were concentrated again in a centrifugal concentrator (50 kDa MWCO, Amicon) to 15 mg/ml, flash-frozen in liquid nitrogen, and stored at -80 °C. Typical yield was ∼3-5 mg of purified CaGGTase-I per liter of E. coli culture.
Crystallization and Data Collection—GGPP was added to an aliquot of protein at a 0.5:1 GGPP/protein ratio for 30 min prior to setting up the crystallization drop. CaGGTase-I crystals were grown in hanging drop format in which 1 μl of protein solution was mixed with 1 μl of well solution consisting of 25% PEG 1500 and 1× PCB buffer, pH 7.0 (100 ml of 10× PCB contains 3.84 g of sodium propionate, 4.28 g of sodium cacodylate, and 11.29 g of Bis-Tris propane). Crystals appeared within 2-3 days and grew as long thin rods, with typical dimensions 400 μm × 50 μm × 50 μm. Prior to data collection, crystals were transferred to a stabilizing solution containing 1× PCB buffer pH 7.0, 30% PEG 1500, followed by cryoprotection in stabilizing solution plus 10% ethylene glycol. Crystals were flash-frozen in liquid nitrogen. Diffraction data were collected at Southeast Regional Collaborative Access Team (SER-CAT) 22-ID beamline at the Advanced Photon Source, Argonne National Laboratory at 100 K. Crystals diffract beyond 1.7-Å resolution; a complete data set was collected to 1.8-Å resolution (Table 1). CaGGTase-I crystallized in space group C2 (a = 132.3 Å, b = 66.05 Å, c = 82.8 Å, α = γ = 90.0°, β = 100.0°) with one molecule in the asymmetric unit. HKL2000 was used for data reduction and scaling.
TABLE 1.
Data collection and refinement statistics
| Resolution range Å | 37.0-1.8 |
| Data collection | ANL-APS SER-CAT |
| Reflections, unique/total | 63,895/230,601 |
| I/σ1a | 32.3 (3.10) |
| Rsym, %b | 9.3 (37.6) |
| Unit cell | |
| Dimensions (Å) | a = 132.3 |
| b = 66.1 | |
| c = 82.9 | |
| Angles (°) | α = γ = 90; β = 100.1 |
| Space group | C2 |
| Completeness %a | 100 (77.0) |
| Non-hydrogen atoms in refinement | |
| Protein | 5582 |
| Solvent | 471 |
| Rcryst/Rfreec | 19.2/23.0 |
| R.m.s.d. from ideal bond length, Å | 0.009 |
| R.m.s.d. from ideal bond angles, ° | 1.16 |
| Overall isotropic B factor, Å2 | 27.3 |
| Ramachandran plot | |
| Favored, % | 97.7 |
| Allowed, % | 100 |
Values in parentheses indicate value from high resolution bin.
Rsym = (∑|(I - <I>)|)/(∑I), where <I> is the average intensity of multiple measurements.
Rcryst and Rfree = (∑|Fobs - Fcalc|)/(∑Fobs|). Rfree was calculated over 5% of the amplitudes not used in refinement.
Structure Solution and Refinement—The CaGGTase-I structure was determined by molecular replacement using PHASER (24). A homology model derived from the rat GGTase-I structure (PDB code 1N4P, chains A and B, (14)) was constructed using MODELLER (25) and subsequently used as the search model. A cycle of simulated annealing was then performed on the solution in CNS (26), which gave an R factor of 41.4%. ARP/wARP (27) was then used to retrace the model, properly fitting regions not successfully fit into the density by the simulated annealing refinement; iterative automated chain tracing with ARP/wARP and refinement in REFMAC5 (28) produced a model with an R factor of 22.3%. Manual building and refinement in COOT (29) and REFMAC5, respectively, were carried out to correctly build termini, rebuild several loop regions, and fit ligands and water molecules. The final R/Rfree of the model is 19.8%/23.7%. MOLPROBITY (30), and composite omit electron density maps were used to validate the quality of the model. PYMOL was used to perform structure-based alignments and to prepare all figures (31). The atomic coordinates and structure factors (PDB code 3DRA) have been deposited with the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ.
RESULTS
Overall Structure of C. albicans GGTase-1—CaGGTase-I is an 82-kDa heterodimer consisting of 37-kDa α-subunit (306 residues) and a 45-kDa β-subunit (390 residues). Fig. 2A shows the overall structural features of the enzyme. The sequence identities with respect to the mammalian ortholog are 28 and 25% for the α- and β-subunits, respectively. Despite this low overall sequence identity, the structure of CaGGTase-I is quite similar in overall architecture to mammalian GGTase-I (1.58 Å r.m.s.d. calculated over all aligned α-carbon atoms, Fig. 2B).
FIGURE 2.
A, C. albicans protein geranylgeranyltransferase-I (α-subunit, red; β-subunit, blue). Secondary structure elements for each subunit are labeled, as are the lipid substrate GGPP (green) and catalytic Zn2+ ion (pink). B, CaGGTase-I superimposed on mammalian GGTase-I (PDB code 1N4P, gray).
The CaGGTase-I α-subunit is smaller than the mammalian protein (306 versus 377 amino acids). CaGGTase-I lacks an N-terminal domain rich in proline and glutamine residues present in the mammalian enzyme. This domain is disordered in all mammalian FTase and GGTase-I structures determined to date.
Like the mammalian enzyme, the CaGGTase-I α-subunit consists of α-helices. There are 16 helices altogether, one short helix more than in mammalian enzyme, arranged in antiparallel pairs to form a crescent that envelopes part of the β-subunit. We observe variation in helix length in the α-subunit. Helices 5α and 8α extend one helical turn longer than the corresponding mammalian helix, and helix 12α is approximately one helical turn shorter. Five amino acids are inserted between the equivalents to12α and 13α in the mammalian enzyme and form an additional short helix in CaGGTase-I (13α).
The β-subunit is slightly larger than that of mammalian GGTase-I (390 residues versus 377 for human GGTase-I). Variation in loop length between the helices in the β-subunit accounts for the additional residues. The β-subunit is predominantly α-helical forming an α-α barrel with a central, largely hydrophobic cavity, which contains most of the residues that bind substrate and coordinate the catalytic Zn2+ ion (Fig. 1). The CaGGTase-I β-subunit also has two short antiparallel β sheet regions (residues 67β-76β and 141β-154β) remote from the active site, which do not appear in the mammalian enzyme.
Two loops (residues 176β-188β, 261β-272β) and several residues at the termini (1α-2α, 305α-306α;1β) are not visible in the final electron density maps. The disordered loops correspond to two areas of allelic and strain variation in the β-subunit: the region 176-188 contains a polyasparagine tract that can vary between 6 and 17 residues, depending on strain and allele. The variant described here contains six asparagines. The region 261-272 contains a region rich in asparagine, aspartate, and glycine. The number of repeats of these residues also varies across strains and alleles (8); the amino acid sequence of the variant reported here is KDGNGDNGNGDN.
Isoprenoid Substrate Binding and Selection—Like mammalian GGTase-I (14), CaGGTase-I binds its isoprenoid substrate in a hydrophobic groove on one side of the active site cavity. The diphosphate moiety forms hydrogen bonds to lysine, arginine, and histidine residues, which are all mostly conserved across species. Although conformation of the GGPP lipid substrate is similar to mammalian GGTase-I (Fig. 3A), the binding mode of the fourth isoprene in CaGGTase-I varies significantly (Fig. 3A). In mammalian GGTase-I, this fourth isoprene is directed toward the CaaX substrate-binding pocket adjacent to the isoprenoid pocket; in the yeast GGTase-I, the phenylalanine 99β is bulkier than threonine 127β at the equivalent position in rat GGTase-I, directing the fourth isoprene away from the CaaX-binding site toward the edge of the active site cavity.
FIGURE 3.
Isoprenoid substrate binding and selection. A, top; a superposition of GGPP binding modes in CaGGTase-I (isoprenoid, green) and rat GGTase-I (isoprenoid, gray). The shift in binding mode of isoprene 4 is observed where the lipid contacts Phe99β in CaGGTase-I. This residue substitutes for the smaller T127β in rat GGTase-I. Bottom, a closeup of shift in isoprene 4 binding mode. View is rotated ∼45° from top. B, GGPP-binding pocket in gray surface representation illustrates the channel for isoprene 4 binding. The leucine residues (L98β and L352β in red), which line this channel permit the longer GGPP lipid substrate to bind. In CaFTase, these residues are replaced by tyrosines (Y259β and Y502β, cyan), which impinge upon the channel and occlude binding of the 20-carbon GGPP substrate, thereby selecting for the 15-carbon FPP substrate.
This conformational difference suggests that the C. albicans prenyltransferases have an alternative mechanism for isoprenoid substrate selection. In mammalian FTase and GGTase-I a single residue (W102β in FTase and T49β in GGTase-I) dominates selection of isoprenoid length (14, 32): the tryptophan in FTase presents a steric block to any isoprenoid lipid longer than a 15-carbon FPP, while the smaller threonine in GGTase permits binding of the additional five-carbon fourth GGPP isoprene. Mutagenesis of W102β to threonine converts FTase to a geranylgeranyltransferase (32).
In CaGGTase-I, isoprenoid selection appears to be governed by two residues (L98β and L352β), for which there are no equivalent in mammalian GGTase I. Inspection of a sequence alignment of CaGGTase-I β-subunit with the CaFTase β-subunit reveals variation at the positions equivalent to these leucine residues. CaFTase is predicted to have two tyrosines, 502β and 259β, in this position. A homology model suggests that these residues impinge on the binding site for the fourth isoprene, excluding GGPP from the binding site and selecting for the shorter FPP (Fig. 3B).
Zinc and Magnesium Dependence of the Prenylation Reaction—Like all protein prenyltransferases studied to date, CaGGTase-I is a Zn2+-dependent metalloenzyme (17, 33-37), with the Zn2+ activating the cysteine thiolate of the CaaX substrate for attack on the C1 carbon of the isoprenoid substrate. The Zn2+ coordination sphere is conserved and consists of an aspartic acid (D294β), cysteine (C296β), and histidine (H349β), forming a distorted pentacoordinate geometry with two ligands contributed by D294β at 2.23 and 2.38Å, a ligand from C296β at 2.00Å, and a ligand from the Nε of H349β at 2.29Å. Consistent with the structures of mammalian FTase and GGTase-I (14, 38), a water molecule occupies the position of a fifth ligand. In the mammalian enzymes, the water is displaced by the γ sulfur of the cysteine residue of the CaaX substrate upon peptide binding.
Unlike the mammalian GGTase-I, CaGGTase-I is dependent on millimolar levels of Mg2+ for its maximum reaction rate (8). This Mg2+ dependence is also observed in both mammalian and Saccharomyces cerevisiae FTases, as well as S. cerevisiae GGTase-I (17, 39-41). A Mg2+ ion is hypothesized to stabilize the diphosphate leaving group in the chemical step of the reaction (Fig. 4A) (13, 42). Modeling of a Mg2+ in the crystal structure of mammalian FTase indicates that aspartate D352β is positioned to coordinate this ion next to the diphosphate leaving group in the modeled transition state (Fig. 4A) (13). Mutagenesis studies further support that this residue binds Mg2+ (40). The structure of mammalian GGTase-I reveals that the terminal amine of a lysine side chain at the equivalent position could effectively substitute for Mg2+ at this position (Fig. 4A) (14, 39).
FIGURE 4.
Metal dependence of CaGGTase-I. A, comparison of mammalian FTase D352β (Mg2+ ligand) and mammalian GGTase-I K311β, shows a model in which the lysine could substitute for a Mg2+ in the mammalian GGTase-I reaction. A proposed transition state is shown (gray sticks with red/green diphosphate moiety) interacting with a Mg2+ or K311β to stabilize the negative charge buildup prior to pyrophosphate release. B, comparison of mammalian GGTase-I (gray) and CaGGTase-I (green). W312β of mammalian GGTase-I restricts the conformations of K311β and orients it toward the phosphate. In CaGGTase-I, aspartate D340β occupies the equivalent position to mammalian W312β but does not orient CaGGTase-I R339β for phosphate stabilization. Furthermore, D340β may serve to decreases the charge positive density in this region, thus necessitating higher Mg2+ concentration to stabilize the phosphate during chemistry.
The CaGGTase-I structure reveals that the equivalent position to D352β or K311β in the mammalian enzymes is arginine 339β (Fig. 4B). At neutral pH, the positive charge of arginine is delocalized over the guanidinium group. In the mammalian GGTase-I, the orientation of K311β is restricted by a tryptophan, W312β, effectively directing the positive charge toward the diphosphate (14) (Fig. 4B). In CaGGTase-I, the residue adjacent to R339β is an aspartate, D340β (Fig. 4B). We propose a 2-fold effect: first, with a smaller neighbor (aspartate), the arginine can explore multiple conformations, which is supported by the observation that its guanidinium group is poorly ordered in the electron density; and second, the negatively charged D340β diminishes the positive charge density in this region, requiring higher Mg2+ levels.
The Mg2+-dependent S. cerevisiae GGTase-I has a lysine at the position equivalent to mammalian K311β and CaGGTase-I R339β, but the mammalian tryptophan is replaced by asparagine N332β (isosteric with the CaGGTase-I D340β), suggesting that the lysine conformation is less restricted, similar to CaGGTase-I R339β.
CaaX Substrate-binding Pocket—In the mammalian FTase and GGTase-I, the CaaX protein substrate binds in an extended conformation (Fig. 1B) with the cysteine coordinating the catalytic Zn2+ ion and the C terminus anchored at the bottom of the pocket by a glutamine residue (Q167α) (11, 13, 14, 43-45). In addition, the CaaX substrate makes significant van der Waals contact with the lipid substrate (Fig. 1B) (11, 13, 14, 44, 45). We expect that the extended conformation will be recapitulated in CaGGTase-I, with the Zn2+ and Q104α (equivalent to Q167α in mammalian GGTase-I) acting as anchor points.
The CaGGTase-I β-subunit barrel forms a large central cavity, which binds the two substrates similar to mammalian GGTase-I (Figs. 1B and 2). Despite the similar overall architecture, CaGGTase-I exhibits significant variation in the identities of residues comprising the CaaX X-residue-binding site compared with the mammalian GGTase-I.
Most of the residues within a 5-Å radius of the X-residue are non-conservative substitutions compared with the mammalian enzyme (14) (Fig. 5). This arrangement suggests that either there is degeneracy in the recognition of the CaaX tetrapeptide (6, 8), or that the peptide substrates adopt a different binding mode, particularly with respect to the C-terminal X-residue. The structural adaptations of the binding site to accommodate the GGPP fourth isoprene are also likely to effect binding of the X-residue, because the two substrate-binding pockets are adjacent to each other and share several residues within van der Waals distance of both substrates (Fig. 1B).
FIGURE 5.
The CaaX peptide-binding site. Superposition of CaGGTase-I binary structure with rat GGTase-I ternary complex (PDB code 1N4Q of a GGPP analog and a CVIL CaaX substrate, yellow) shows the significant variation in the CaaX substrate-binding site. CaGGTase-I residues (orange) are those which are non-conservatively substituted compared with the mammalian GGTase-I (gray sticks).
Product Exit Groove—The reaction path determined for mammalian GGTase-I and FTase reveals the presence of a displaced prenylated product intermediate that precedes product release (Fig. 1B) (13, 14). The isoprenoid portion of this intermediate is bound in a solvent-exposed product prenylated product exit groove located adjacent to the CaaX substrate-binding site (Fig. 1B). Product release for the mammalian protein prenyltransferases is the slowest step (300-fold slower relative to the catalytic step) in the reaction (19, 21). The structurally determined pre-release intermediate product complex is consistent with this kinetic scheme.
Aside from mammalian prenyltransferases, only S. cerevisiae FTase (46) has been characterized by pre-steady state kinetic analysis. The kinetically defined reaction path of S. cerevisiae FTase (46) is similar to the mammalian enzymes, with the notable exception that the product release step is no longer the clearly dominant slow step (3-fold slower relative to the catalytic step). This suggests that the product is not stably bound in this enzyme. Inspection of sequence alignment between the mammalian and S. cerevisiae FTase sequences reveals there is significant variation in the residues lining the exit groove, suggesting that the latter lacks this groove, accounting for the large difference in product release rate constants.
Only the steady state kinetics parameters for the CaGGTase-I have been reported, and its overall turnover rate is not significantly different than is reported for mammalian GGTase-I (0.076 s-1 for CaGGTase versus 0.051 s-1) (6, 39). The rate constants for product release are not available for CaGGTase-I. The crystal structure reveals significant variation in the exit groove compared with the mammalian enzyme (Fig. 6). In particular, residues 17-24 in CaGGTase-I, which make up one side of the putative exit groove, are positioned on average 4 Å closer to the other wall of the exit groove than is seen in the mammalian enzymes (13, 14). This arrangement narrows this groove to 6.4 Å at its narrowest point, thereby presenting a steric block to a displaced isoprenylated product modeled in a similar position to that observed in mammalian GGTase-I. If a displaced isoprenylated product were part of the CaGGTase-1 reaction path, it therefore must bind shallowly in the exit groove. This arrangement is expected to change the rate constants for product release relative to the mammalian counterpart.
FIGURE 6.
The prenylated product exit groove region. Superposition of CaGGTase-I (green ribbon) and mammalian GGTase-I (gray) reveals a significant narrowing of the exit groove. Residues 17-24β in CaGGTase-I (cyan) are ∼4 Å closer to the other wall of the exit groove than seen in mammalian GGTase-I, creating a gap of only 6.4 Å at the narrowest point (between proline residues 21β and 347β, cyan). The isoprenoid moiety (red) of the displaced product cannot be accommodated in the same binding mode as seen in mammalian GGTase-I (PDB code 1N4S used for superposition to model the displaced product).
DISCUSSION
The CaGGTase-I structure reveals a high-resolution picture of a non-mammalian protein prenyltransferase. The Zn2+ coordination sphere, essential for all protein prenyltransferases, is completely conserved. Despite relatively low sequence identity, there is a remarkable degree of structural conservation in regions of the protein apparently non-essential for activity.
By contrast, there is variation in parts of the structure involved in the molecular recognition of both the lipid and CaaX substrates. Despite these differences in the CaaX protein substrate-binding pocket, CaGGTase-I exhibits nearly identical substrate preferences to the mammalian enzyme to the extent that different sequences have been tested (6, 8).
Furthermore, the CaGGTase-I structure shows that the recognition and release of isoprenylated products varies across species. In particular, the product exit groove that is observed in the mammalian enzyme and confers an unusual interplay between substrate binding and product release is lacking in the CaGGTase-I enzyme. This exit groove may not be required for monoprenylation but may instead be a necessary feature to confer the processive diprenylation observed in the in Rab GGTases (13, 14).
We propose that the structural differences observed between mammalian GGTase-I and CaGGTase-I will be sufficient to devise a structure-based design strategy to develop C. albicans-selective GGTase-I inhibitors. The variation in the CaaX-binding pocket, particularly the X-residue recognition residues, as well as the uniquely shaped exit groove provide opportunities to define ligands, which will be selective for the C. albicans enzyme.
Acknowledgments
We thank C. Nelson and S. Armstrong for assistance with cloning the genes for C.albicans GGTase-I and H. W. Hellinga for critical discussions.
The atomic coordinates and structure factors (code 3DRA) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).
This work was supported, in whole or in part, by National Institutes of Health Grant GM52382 (to L. S. B.). Use of the Advanced Photon Source was supported by the United States Dept. of Energy, Office of Science, Office of Basic Energy Sciences, under Contract No. W-31-109-Eng-38. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Footnotes
The abbreviations used are: GGTase-I, protein geranylgeranyltransferase-I; FTase, protein farnesyltransferase; CaGGTase-I, C. albicans protein geranylgeranyltransferase-I; FPP, farnesylpyrophosphate; GGPP, geranylgeranylpyrophosphate; GGTase-II, protein geranylgeranyltransferase-II or rab geranylgeranyltransferase; Rab GGTase, protein geranylgeranyltransferase-II or rab geranylgeranyltransferase; CaaX, cysteine, two generally aliphatic residues, and X specificity determining variable residue; X-residue, the specificity determining residue of the CaaX motif; MWCO, molecular weight cut-off; PEG, polyethylene glycol; r.m.s.d., root mean square deviation; PDB, Protein Data Bank.
References
- 1.Pappas, P. G., Rotstein, C. M., Betts, R. F., Nucci, M., Talwar, D., De Waele, J. J., Vazquez, J. A., Dupont, B. F., Horn, D. L., Ostrosky-Zeichner, L., Reboli, A. C., Suh, B., Digumarti, R., Wu, C., Kovanda, L. L., Arnold, L. J., and Buell, D. N. (2007) Clin. Infect. Dis. 45 883-893 [DOI] [PubMed] [Google Scholar]
- 2.Zaoutis, T. E., Argon, J., Chu, J., Berlin, J. A., Walsh, T. J., and Feudtner, C. (2005) Clin. Infect. Dis. 41 1232-1239 [DOI] [PubMed] [Google Scholar]
- 3.White, T. C., Marr, K. A., and Bowden, R. A. (1998) Clin. Microbiol. Rev. 11 382-402 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Murthi, K. K., Smith, S. E., Kluge, A. F., Bergnes, G., Bureau, P., and Berlin, V. (2003) Bioorg. Med. Chem. Lett. 13 1935-1937 [DOI] [PubMed] [Google Scholar]
- 5.Song, J. L., and White, T. C. (2003) Microbiology 149 249-259 [DOI] [PubMed] [Google Scholar]
- 6.Smalera, I., Williamson, J. M., Baginsky, W., Leiting, B., and Mazur, P. (2000) Biochim. Biophys. Acta 1480 132-144 [DOI] [PubMed] [Google Scholar]
- 7.Kelly, R., Card, D., Register, E., Mazur, P., Kelly, T., Tanaka, K. I., Onishi, J., Williamson, J. M., Fan, H., Satoh, T., and Kurtz, M. (2000) J. Bacteriol. 182 704-713 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Mazur, P., Register, E., Bonfiglio, C. A., Yuan, X., Kurtz, M. B., Williamson, J. M., and Kelly, R. (1999) Microbiology 145 1123-1135 [DOI] [PubMed] [Google Scholar]
- 9.Lane, K. T., and Beese, L. S. (2006) J. Lipid Res. 47 681-699 [DOI] [PubMed] [Google Scholar]
- 10.Basso, A. D., Kirschmeier, P., and Bishop, W. R. (2006) J. Lipid Res. 47 15-31 [DOI] [PubMed] [Google Scholar]
- 11.Reid, T. S., Terry, K. L., Casey, P. J., and Beese, L. S. (2004) J. Mol. Biol. 343 417-433 [DOI] [PubMed] [Google Scholar]
- 12.Seabra, M. C., Goldstein, J. L., Sudhof, T. C., and Brown, M. S. (1992) J. Biol. Chem. 267 14497-14503 [PubMed] [Google Scholar]
- 13.Long, S. B., Casey, P. J., and Beese, L. S. (2002) Nature 419 645-650 [DOI] [PubMed] [Google Scholar]
- 14.Taylor, J. S., Reid, T. S., Terry, K. L., Casey, P. J., and Beese, L. S. (2003) EMBO J. 22 5963-5974 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hightower, K. E., Huang, C.-C., Casey, P. J., and Fierke, C. A. (1998) Biochemistry 37 15555-15562 [DOI] [PubMed] [Google Scholar]
- 16.Tschantz, W. R., Furfine, E. S., and Casey, P. J. (1997) J. Biol. Chem. 272 9989-9993 [DOI] [PubMed] [Google Scholar]
- 17.Zhang, F. L., and Casey, P. J. (1996) Biochem. J. 320 925-932 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Dolence, J. M., Cassidy, P. B., Mathis, J. R., and Poulter, C. D. (1995) Biochemistry 34 16687-16694 [DOI] [PubMed] [Google Scholar]
- 19.Pickett, J. S., Bowers, K. E., Hartman, H. L., Fu, H. W., Embry, A. C., Casey, P. J., and Fierke, C. A. (2003) Biochemistry 42 9741-9748 [DOI] [PubMed] [Google Scholar]
- 20.Huang, C., Hightower, K. E., and Fierke, C. A. (2000) Biochemistry 39 2593-2602 [DOI] [PubMed] [Google Scholar]
- 21.Furfine, E. S., Leban, J. J., Landavazo, A., Moomaw, J. F., and Casey, P. J. (1995) Biochemistry 34 6857-6862 [DOI] [PubMed] [Google Scholar]
- 22.Bishop, W. R., Kirschmeier, P., and Baum, C. (2003) Cancer Biol. Ther. 2 S96-104 [PubMed] [Google Scholar]
- 23.Sunami, S., Ohkubo, M., Sagara, T., Ono, J., Asahi, S., Koito, S., and Morishima, H. (2002) Bioorg. Med. Chem. Lett. 12 629-632 [DOI] [PubMed] [Google Scholar]
- 24.Storoni, L. C., McCoy, A. J., and Read, R. J. (2004) Acta Crystallogr. Sect. D-Biol. Crystallogr. 60 432-438 [DOI] [PubMed] [Google Scholar]
- 25.Eswar, N., Webb, B., Marti-Renom, M. A., Madhusudhan, M. S., Eramian, D., Shen, M. Y., Pieper, U., and Sali, A. (2007) Unit 2.9 Curr. Protoc. Protein Sci. [DOI] [PubMed]
- 26.Brünger, A. T., Adams, P. D., and Rice, L. M. (1997) Structure 5 325-336 [DOI] [PubMed] [Google Scholar]
- 27.Langer, G., Cohen, S. X., Lamzin, V. S., and Perrakis, A. (2008) Nat. Protoc. 3 1171-1179 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Murshudov, G. N., Vagin, A. A., and Dodson, E. J. (1997) Acta Crystallogr. Sect. D 53 240-255 [DOI] [PubMed] [Google Scholar]
- 29.Emsley, P., and Cowtan, K. (2004) Acta Crystallogr. Sect. D 60 2126-2132 [DOI] [PubMed] [Google Scholar]
- 30.Davis, I. W., Murray, L. W., Richardson, J. S., and Richardson, D. C. (2004) Nucleic Acids Res. 32 W615-619 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Delano, W. L. (2002) DeLano Scientific, San Carlos, CA
- 32.Terry, K. L., Casey, P. J., and Beese, L. S. (2006) Biochemistry 45 9746-9755 [DOI] [PubMed] [Google Scholar]
- 33.Chen, W. J., Moomaw, J. F., Overton, L., Kost, T. A., and Casey, P. J. (1993) J. Biol. Chem. 268 9675-9680 [PubMed] [Google Scholar]
- 34.Dolence, J. M., Rozema, D. B., and Poulter, C. D. (1997) Biochemistry 36 9246-9252 [DOI] [PubMed] [Google Scholar]
- 35.Witter, D. J., and Poulter, C. D. (1996) Biochemistry 35 10454-10463 [DOI] [PubMed] [Google Scholar]
- 36.Mayer, M. L., Caplin, B. E., and Marshall, M. S. (1992) J. Biol. Chem. 267 20589-20593 [PubMed] [Google Scholar]
- 37.Kalinin, A., Thoma, N. H., Iakovenko, A., Heinemann, I., Rostkova, E., Constantinescu, A. T., and Alexandrov, K. (2001) Protein Expr. Purif. 22 84-91 [DOI] [PubMed] [Google Scholar]
- 38.Park, H. W., Boduluri, S. R., Moomaw, J. F., Casey, P. J., and Beese, L. S. (1997) Science 275 1800-1804 [DOI] [PubMed] [Google Scholar]
- 39.Hartman, H. L., Bowers, K. E., and Fierke, C. A. (2004) J. Biol. Chem. 279 30546-30553 [DOI] [PubMed] [Google Scholar]
- 40.Saderholm, M. J., Hightower, K. E., and Fierke, C. A. (2000) Biochemistry 39 12398-12405 [DOI] [PubMed] [Google Scholar]
- 41.Stirtan, W. G., and Poulter, C. D. (1995) Arch. Biochem. Biophys. 321 182-190 [DOI] [PubMed] [Google Scholar]
- 42.Pickett, J. S., Bowers, K. E., and Fierke, C. A. (2003) J. Biol. Chem. 278 51243-51250 [DOI] [PubMed] [Google Scholar]
- 43.Reid, T. S., Long, S. B., and Beese, L. S. (2004) Biochemistry 43 9000-9008 [DOI] [PubMed] [Google Scholar]
- 44.Long, S. B., Casey, P. J., and Beese, L. S. (2000) Structure Fold Des. 8 209-222 [DOI] [PubMed] [Google Scholar]
- 45.Long, S. B., Hancock, P. J., Kral, A. M., Hellinga, H. W., and Beese, L. S. (2001) Proc. Natl. Acad. Sci. U. S. A. 98 12948-12953 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Mathis, J. R., and Poulter, C. D. (1997) Biochemistry 36 6367-6376 [DOI] [PubMed] [Google Scholar]






