Abstract
The eukaryotic translation initiation factor 4A (eIF4A) is a member of the DEA(D/H)-box RNA helicase family, a diverse group of proteins that couples an ATPase activity to RNA binding and unwinding. Previous work has provided the structure of the amino-terminal, ATP-binding domain of eIF4A. Extending those results, we have solved the structure of the carboxyl-terminal domain of eIF4A with data to 1.75 Å resolution; it has a parallel α-β topology that superimposes, with minor variations, on the structures and conserved motifs of the equivalent domain in other, distantly related helicases. Using data to 2.8 Å resolution and molecular replacement with the refined model of the carboxyl-terminal domain, we have completed the structure of full-length eIF4A; it is a “dumbbell” structure consisting of two compact domains connected by an extended linker. By using the structures of other helicases as a template, compact structures can be modeled for eIF4A that suggest (i) helicase motif IV binds RNA; (ii) Arg-298, which is conserved in the DEA(D/H)-box RNA helicase family but is absent from many other helicases, also binds RNA; and (iii) motifs V and VI “link” the carboxyl-terminal domain to the amino-terminal domain through interactions with ATP and the DEA(D/H) motif, providing a mechanism for coupling ATP binding and hydrolysis with conformational changes that modulate RNA binding.
Eukaryotic initiation factor 4A (eIF4A) is a member of a protein family referred to as DEA(D/H)-box RNA helicases. [The “DEA(D/H)” motif by which this family is named is the Walker “B” motif that participates in binding MgATP]. Their identification as helicases historically is derived from conserved signature sequences first described by Gorbalenya and Koonin that are characteristic of RNA and DNA helicase proteins in general (1). Sequences for more than 500 representatives of this protein family, ranging in size from ≈400 aa to >1,200 aa, are available [see, for example, Pfam family 00270 (2)]; what they have in common is a conserved, ≈400-residue polypeptide segment that includes the helicase signature sequences. These proteins participate in diverse activities that involve interactions with RNA; representative activities of these proteins include RNA splicing (3), ribosome biogenesis (4), and RNA degradation (5). Moreover, the activities of many proteins that are identified by sequence similarity as members of this family remain obscure. Our present subject, the 394-residue yeast eIF4A protein, is a prototype, minimal DEA(D/H)-box helicase.
eIF4A participates in the initiation of polypeptide synthesis. Its apparent role is to facilitate the “melting” of secondary structure in mRNAs that might otherwise impede translation initiation [for reviews, see Pain (6) and Linder et al. (7)]. To accomplish this, eIF4A works in concert with other initiation factors; specifically, it is targeted to the 5′ cap region of mRNA by eIF4E (the cap-binding protein) and eIF4G, and it effects its helicase activity in cooperation with eIF4B (8); eIF4A has only a very weak helicase activity on its own (9). Thus, the intrinsic biochemical activities of eIF4A, which include RNA binding and an ATPase activity, are a necessary, but not sufficient component for the complex biological activities in which it participates. An analogous situation arises with many other proteins of this family, where a minimal DEA(D/H)-box helicase domain often works in conjunction with other proteins or, alternatively, with other domains within a large (≫400-residue), single polypeptide to effect a particular activity on a specific RNA substrate. By implication, the minimal DEA(D/H)-box helicase domain is a necessary, but often not sufficient, requirement for the activities in which these proteins participate.
In the classification of Gorbalenya and Koonin, helicases can be divided into several different “superfamilies” (1). Many DNA helicases fall into superfamily 1 (SF-1); among this group, crystallographic structures are available for the Escherichia coli Rep helicase (10) and the Bacillus stearothermophilus PcrA helicase (11–13). Crystallographic structures also are available for the UvrB DNA helicase (14–16) and hepatitis C virus (HCV) RNA helicase (17–20), which are members of superfamily 2 (SF-2). Although the DNA helicases and the HCV RNA helicase have substantial differences in structure—for example, the Rep and PcrA have four distinct structural domains, whereas HCV helicase has only three—it was found that the two domains that harbor the conserved helicase motifs have similar folding topologies and tertiary structures and that the conserved motifs, despite differing in sequence between the SF-1 and SF-2 families, superimpose at the same spatial positions within the structures (21).
The DEA(D/H)-box helicases fall into the SF-2 family in the classification of Gorbalenya and Koonin (1). The structure of the amino-terminal domain (residues 1–223) of eIF4A has been reported (22, 23); it is a parallel α-β domain, a subfragment of which is identical in folding topology and tertiary structure with the other helicases [and also with the recA protein (24)]. In this work, we report the structure of the carboxyl-terminal domain of eIF4A and discuss its relationship to other helicases. We also report the structure of full-length eIF4A. Using this information, we model hypothetical oligonucleotide-binding structures for eIF4A on the templates of other helicase structures. These models suggest involvement of the conserved helicase motifs of eIF4A in its functional activities.
Materials and Methods
Protein Expression and Purification.
A plasmid for recombinant expression of a carboxyl-terminal domain of yeast eIF4A (amino acid residues 230–394 with a L230 M mutation at the “start” codon) was derived from an expression plasmid for full-length eIF4A (23) essentially by deletion of codons 2–230 as follows: the original plasmid has an NdeI restriction site (CATATG) at the “start” codon (underlined); a second NdeI site was introduced at codons 229–230 by using the Stratagene “Quik-change” mutagenesis method; the resulting plasmid was digested with NdeI and religated to yield the desired deletion. Protein was expressed at 37°C in BL21(DE3) cells; expression was induced by the addition of 0.4 mM isopropyl-β-d-thio-galactoside (IPTG) when cells reached an OD600 of 0.5 and was allowed to continue for 5 h (25). Cells then were harvested by centrifugation, resuspended in 100 mM KCl/25 mM Bis-Tris, pH 7.4 (buffer A), and lysed by sonication. Lysate was clarified by centrifugation, and crude supernatant was applied to a Q-Sepharose column preequilibrated with buffer A. The column was eluted with a gradient of 0.1–0.6 M KCl. Fractions containing eIF4A carboxyl-terminal domain were combined and purified further by gel-filtration chromatography on a Superdex-75 column (Pharmacia). Eluant containing carboxyl-terminal protein was combined and concentrated to ≈22 mg/ml for crystallization.
Crystallization and Data Collection.
Initial crystallization trials with Hampton Screens I and II in the absence of reducing agents proved unsuccessful. Analysis by PAGE under native conditions revealed the presence of multimers of the protein; formation of multimers was suppressed by the addition of 50 mM 2-mercaptoethanol (BME). In the presence of reducing agent, native crystals grew as elongated rods from 20% polyethylene glycol monomethyl ether of average molecular weight of 550 (PEG-MME550), 1 mM ZnSO4, and 100 mM Mes, pH 6.5; the presence of Zn2+, or a similar divalent metal ion, was mandatory for crystallization. For a heavy atom derivative, 2 mM SmCl3 was used instead of ZnSO4 in the crystallization.
For data collection, the concentration of PEG-MME550 in the mother liquor was increased to 40% for cryoprotection and crystals were flash-cooled in a stream of nitrogen gas at ≈100 K. Native crystals are orthorhombic, space group P212121, a = 34.7 Å, b = 52.1 Å, c = 82.3 Å, with one molecule per asymmetric unit.
Native diffraction data were collected to a resolution limit of 1.75 Å on beamline 9–2 of the Stanford Synchrotron Radiation Laboratory (SSRL) at a wavelength of 0.980 Å. Multiwavelength anomalous dispersion (MAD) data for a Sm3+ derivative were collected to a resolution limit of 3.0 Å at the Lawrence Berkeley Laboratory Advanced Light Source (ALS) synchrotron at wavelengths 1.84533 Å, 1.84616 Å, and 1.77120 Å for the Sm3+ absorption peak, dispersive edge, and remote energies, respectively. In both cases, data were recorded on Quantum IV charge-coupled device detectors and processed with the programs hklview, denzo, and scalepack (26).
Structure Determination of the Carboxyl-Terminal Domain of eIF4A.
The structure of the carboxyl-terminal domain of eIF4A was solved by using experimental phases from the Sm3+ derivative. The location of a single Sm3+ ion was derived from anomalous difference Patterson peaks. MAD phases were computed by using the program package cns (27); the overall figure of merit to 3.0 Å resolution was 0.80; phasing statistics are summarized in Table 1. After solvent flattening, much of the polypeptide chain could be traced in the experimental electron density map. Subsequent rounds of model refinement against the native data, phase combination between model and experimental phases, and phase extension with model phases yielded a model that includes residues 233–351 and 357–394, 122 water molecules, and 1 Zn2+ ion. The loop from Arg-352 through Lys-356, as well as the first three amino-terminal residues, apparently is disordered and could not be traced. Residues 321–327 and 349–351 near the Zn2+ site have relatively high average B factors, suggesting that zinc binding may distort the structure locally. Only one residue (Glu-241) is in the disallowed region of the Ramachandran plot; well defined electron density and a distinct hydrogen bond network support the unorthodox conformation of this residue. Refinement statistics are summarized in Table 2.
Table 1.
Wavelength, Å | Observations (total/unique) | Completeness | Rsym* | <l>/<σ(l)> | Phasing power | f′ | f" |
---|---|---|---|---|---|---|---|
λ1 = 1.77120 | 13729/5570 | 0.950 (0.945) | 0.040 (0.056) | 15.15 (15.02) | 2.48 | −12.1 | 10.9 |
λ2 = 1.84616 | 14174/5667 | 0.971 (0.980) | 0.041 (0.062) | 15.34 (13.93) | 3.04 | −36.1 | 15.2 |
λ3 = 1.84533 | 14176/5666 | 0.967 (0.954) | 0.047 (0.069) | 15.32 (14.03) | 3.77 | −21.5 | 35.8 |
Resolution (Å) | |||||||||
---|---|---|---|---|---|---|---|---|---|
5.96 | 4.74 | 4.14 | 3.77 | 3.5 | 3.29 | 3.13 | 2.99 | Total | |
Reflections | 641 | 700 | 672 | 701 | 708 | 741 | 716 | 659 | 5538 |
Figure of merit | 0.810 | 0.833 | 0.775 | 0.798 | 0.794 | 0.776 | 0.795 | 0.764 | 0.799 |
Sm3+ derivative of the carboxyl-terminal domain of yeast eIF4A. Resolution range (last shell), 26.6–3.0 (3.11–3.00) Å.
Space group: P212121; unit cell (Å) a = 34.7, b = 52.1, c = 82.3. Values of f′ and f" were initially estimated from an EXAFS scan and refined in cns.
Rsym = ∑|Ihkl − <Ihkl>|/∑<Ihkl>, where Ihkl = single value of measured intensity of hkl reflection, and <I> = mean of all measured value intensity of hkl reflection.
Table 2.
Carboxyl-terminal domain of eIF4A* | Full-length eIF4A† | |
---|---|---|
Wavelength, Å | 0.980 | 0.980 |
Resolution range (last shell), Å | 30–1.75 (1.81–1.75) | 40–2.8 (2.9–2.8) |
Observations (total/unique) | 41481/15307 | 53337/17192 |
Completeness, % | 96.5 (93.2) | 92.0 (80.4) |
Completeness, I>3σ | 87.3 (64.4) | NA |
Rsym‡ | 0.029 (0.151) | 0.071 (0.097) |
<I>/<σ(l)> | 17.4 | 18.3 |
Rcryst§ | 0.221 | 0.244 |
Rfree | 0.252 | 0.273 |
rmsd bond length, Å | 0.010 | 0.010 |
rmsd angles, ° | 1.51 | 1.69 |
No. of residues modeled | 157 | 593 |
No. of waters modeled | 122 | 146 |
Space Group: P212121, unit cell (Å) a = 34.7, b = 52.1, c = 82.3 (Å). One protomer per asymmetric unit.
Space group P1, unit cell a = 38.8, b = 71.3, c = 73.2 (Å), α = 94.0°, β = 89.6°, γ = 101.0°. Two protomers per asymmetric unit.
Rsym is defined in Table 1.
Rcryst = ∑|Fobs − Fcalc|/∑Fobs, where Fobs = observed structure factor amplitude and Fcalc = structure factor calculated from model. Rfree is computed in the same manner as Rcryst, using a test set composed of 15% (carboxyl-terminal domain) or 10% (full-length eIF4A) of the observed reflections, which were randomly selected and omitted from all model-phase calculations.
Structure Determination of Full-Length eIF4A.
The structure of one protomer of the full-length eIF4A molecule was completed by using molecular replacement with the model of the carboxyl-terminal domain and data from triclinic crystals described previously (23). These crystals have two eIF4A molecules per asymmetric unit, related by an approximate noncrystallographic 2-fold screw axis. Earlier work described the complete structures of both amino-terminal domains and, additionally, a partial polypeptide backbone model for one carboxyl-terminal domain; however, the carboxyl-terminal domains are significantly disordered in this crystal form, defeating a de novo determination of their structure. One copy of the refined molecular model of the carboxyl-terminal domain of eIF4A was placed initially by using the partial backbone trace as a guide; subsequent rigid body refinement, model building, and model refinement resulted in a model for this copy of eIF4A that includes residues 11–350 and 356–394. Placing the carboxyl-terminal domain for the second eIF4A protomer in these crystals yielded a molecular replacement solution that improved the overall refinement statistics; however, the resulting 2Fo − Fc and Fo − Fc electron density maps in this region could not be interpreted reliably, and the average B factor for this domain was >100 Å2, confirming that this domain is substantially more disordered than its counterpart in the first protomer. Hence, the carboxyl-terminal domain of this protomer was deemed too uncertain to be included in the final model. Refinement statistics for the crystals of full-length eIF4A are summarized in Table 2.
Results and Discussion
We first solved the crystallographic structure of the carboxyl-terminal domain of eIF4A with high-resolution data. The resulting model was used in molecular replacement to complete the structure of full-length eIF4A at lower resolution. Finally, we have superimposed the model of the carboxyl-terminal domain on the helicase “template” that is provided by structures of the Rep, PcrA, and HCV helicases to develop a working hypothesis for the involvement of specific amino acid residues in the biochemical activities of eIF4A.
Structure of the Carboxyl-Terminal Domain of eIF4A.
The crystallographic structure of the carboxyl-terminal domain of eIF4A has been solved to a resolution of 1.75 Å. The average B factor for the structure is 35.7 Å2. The anticipated coordinate error, estimated by the method of Luzzati (28), is 0.28 Å. A single Zn2+ ion is coordinated by the side chains of residues Glu-248 and Asp-252 of one molecule and Glu-341 and His-345 of a neighboring molecule.
The carboxyl-terminal domain has a parallel α-β structure (Fig. 1) with the same topology as the equivalent domain of other helicases [domain 2A of the Rep (10) and PcrA (11) DNA helicases; domain 2 of the HDV RNA helicase (17)]. The α-carbon backbone of this domain of eIF4A superimposes on the backbones of the Rep, PcrA, HCV, and UvrB helicases with root mean square differences of 1.8 Å, 1.9 Å, 1.8 Å, and 2.0 Å for 86, 84, 85, and 112 equivalent Cα atoms, respectively. The conserved topological “core” shared by these helicases is composed of the six parallel β-strands plus helices 1, 2, 4, and 5. Helix 3, which follows β-strand 3 of eIF4A carboxyl-terminal domain, is not found in DNA helicases of SF-1, for which the polypeptide emigrates to another structural domain (domain 2B) after β-strand 3, and reenters the domain at β-strand 4. α-Helix 6 is not present in HCV. Additionally, eIF4A has a short segment of ≈16 residues after helix 6, which adds a seventh β-strand and a well defined coil at the carboxyl terminus.
The conserved helicase sequence motifs IV, V, and VI, as originally described by Gorbalenya and Koonin (1), are highlighted in Fig. 1. Their positions in the eIF4A domain are topologically equivalent to the positions of these motifs in other helicases (note that motif IV would be equivalent to the recently described motif IVa of DNA helicases, as well as to motif IV of HCV helicase, as discussed in ref. 21). In addition, we observe an arginine residue, Arg-298 in helix 3, whose side chain is well ordered in the structure and whose spatial position suggests an involvement in RNA binding. Inspection of sequence alignments of DEA(H/D) RNA helicases [e.g., Pfam version 5.4, family 00271 (2)] reveal that this arginine is strictly conserved and, further, that it occurs as part of a QXXR motif, where the X amino acids often are charged. Notably, it is not present in the DNA helicases of SF-1, which lack this helix (and, hence, would not have been identified as a motif in sequence alignments looking for correlations across all helicases). The suggestion that Arg-298 may participate in RNA binding is discussed in more detail below.
After this structure was completed, we searched for similar structures in the Protein Data Bank through the Dali server (www2.ebi.ac.uk/dali). The search identified a domain of the UvrB helicase as the most closely related structure. In addition, we submitted the amino acid sequence to several “fold recognition” servers. The pdb-blast method (bioinformatics.burnham-inst.org/pdb_blast) identified UvrB as the closest homolog for which a structure is known; this method correctly aligned eIF4A and UvrB over a stretch of 80 residues for which the structures superimpose without insertions or deletions; however, the method failed to accurately identify insertions and deletions that would have correctly superimposed shorter segments of the polypeptide backbone.
Structure of Full-length eIF4A.
In previous work on crystals of full-length eIF4A protein (23), the amino-terminal domains of the two crystallographically independent protomers in the asymmetric unit were modeled successfully, but it proved impossible to complete a polypeptide trace of the carboxyl-terminal domains because of the high noise level of the electron density maps in those regions, indicative of significant disorder. In this work, we have used molecular replacement with the refined, high-resolution model of the carboxyl-terminal domain to complete the structure of the more well ordered of the two full-length molecules in these crystals. The polypeptide linker between the domains was traced, and minor adjustments were made within the carboxyl-terminal domain by interpreting 2Fo − Fc and Fo − Fc maps during the course of refinement. The average B factor for this domain is 78.6 Å2, as compared with 29.9 Å2 for the amino-terminal domains, illustrating the substantial disorder. Segments of polypeptide whose continuity could be ascertained reliably, but in which conformational details were clouded by noise, were residues 256–259, 284–285, 317–323, and 365–368. The resulting model of full-length eIF4A protein reliably shows the overall tertiary structure of the molecule, despite the relatively low precision in prescribing some of the conformational details of the carboxyl-terminal domain.
In this crystal form, eIF4A has a “dumbbell” structure in which the amino- and carboxyl-terminal domains are connected by an extended, 11-residue linker (Fig. 2). The end-to-end length of the molecule is ≈80 Å; the linker is ≈18 Å long. The conserved helicase motifs on the amino- and carboxyl-terminal domains have no spatial relationship that suggest a mechanism of coupling ATP binding/hydrolysis with RNA binding/release in this structure. This structure and the intrinsic disorder of the carboxyl-terminal domain within it suggest that, in solution, eIF4A can be a distended molecule and that the linker between domains is relatively flexible (although it does not have an unusually high fraction of glycine, alanine, or serine, which is characteristic of many flexible polypeptides). (Implicitly, we assume that crystallization has selected and stabilized, through packing contacts, one representative conformation from the many that the molecule can access in solution.)
Modeling a Compact RNA-Binding Structure on a Helicase Template.
The extended dumbbell structure of eIF4A is consistent with some published data on eIF4A activity, but inconsistent with other data. Under some conditions, the size of the single-stranded RNA-binding site is approximately 15 nt per eIF4A monomer (29), which is compatible with the extended structure. However, mutations in motif VI of the carboxyl-terminal domain (cyan in Fig. 2) reduce the hydrolysis rate for ATP bound to the amino-terminal domain (specifically, bound to motif I, blue in Fig. 2) (30); the extended structure is incompatible with this observation. To account for this observation, it is necessary to suggest that when ATP binds eIF4A, the protein undergoes a conformational change that results in a compact structure in which the two domains interact directly with each other. Changes in proteolytic digestion patterns have demonstrated that eIF4A undergoes a series of nucleotide- and RNA-dependent conformational changes (31, 32), which are consistent with this suggestion, although the proteolysis patterns do not, by themselves, demonstrate juxtaposition of the two domains.
The two domains of eIF4A share a structural similarity with their counterparts in other helicases (this work and refs. 22 and 23), and the conserved helicase motifs occur at similar positions in the structures. It is likely that the similarity in domain structure is mandated by similarity in the mechanisms of interaction with ATP and oligonucleotides. In this context, we suggest that eIF4A forms a compact structure similar to that of other helicases at some point in its ATP-dependent helicase cycle. Although this suggestion may seem inconsistent with the extended structure we observe in our crystals, it should be borne in mind that the eIF4A molecule by itself has an intrinsically weak helicase activity (9), which can be enhanced substantially by the collaboration of eIF4B (8) or eIF4H (33). By implication, the collaborating proteins may facilitate formation and stabilization of a compact eIF4A structure. We may then ask hypothetically, “What would such a model suggest about interactions of eIF4A with RNA and ATP, and are there data that would justify the model?”
We have superimposed both the amino- and carboxyl-terminal domains of eIF4A onto their counterparts in the crystallographic structures of PcrA (PDB id 3PJR), Rep (PDB id 1UAA), and HCV (PDB id 1A1V) helicases complexed with oligonucleotides, as well as UvrB (PDB id 1D9X) without oligonucleotide. Representative models are shown in Fig. 3, with the orientation of the amino-terminal domain of eIF4A approximately the same as in Fig. 2. In these models, the distance from eIF4A residue 223 to residue 233 is 18–20 Å, which can be accommodated readily by the linker polypeptide. Such modeling is necessarily imprecise, because (i) the positioning of the carboxyl-terminal domain relative to the amino-terminal domain varies by several Angstroms from one helicase template to another and (ii) for both PcrA and HCV helicases, several structures have been solved in different crystal forms and with different ligands (DNA, AMPPNP, ADP) bound to the proteins; comparison of these structures reveals substantial intramolecular conformational changes that are thought to be related to the helicase mechanism (12, 20). Nonetheless, the models share several features: (i) an extended single-stranded oligonucleotide-binding region across the “top” of domains 1 and 2 of other helicases (in the orientation shown in Fig. 3) and (ii) juxtaposition of motifs I and II of domain 1 (the classical Walker A and B nucleotide-binding motifs, colored blue and red, respectively, in Fig. 3) with motifs V and VI of domain 2 (colored magenta and cyan, respectively). For eIF4A, these features translate into (i) a putative RNA-binding region with an extended network of arginines, viz., Arg-98 (of motif IA) and 148 (of the TPGR motif) of the amino-terminal domain, and Arg-269, -270, and -298 (plus, perhaps, 321) from the carboxyl-terminal domain, along the interface of the single-stranded oligonucleotides, and (ii) juxtaposition of side chains of several amino acid residues of motifs V and VI with the ATP-binding site and the DEAD motif of the amino-terminal domain, suggesting a role in coupling interactions with ATP or ADP to conformational changes in the protein. Representative specific interactions observed in other helicase structures, and the extrapolated interactions that they suggest for the carboxyl-terminal motifs of eIF4A, are summarized in Table 3. [The probable role of the motifs in the amino-terminal domain if eIF4A has been discussed previously (22, 23) and will not be repeated here.]
Table 3.
Motif | Sequence | Interactions | |
---|---|---|---|
IV | PcrA (355–365): | AVLYRTNAQSR | R359 binds DNA and forms salt bridge to E600; N361 interacts with ssDNA; R365 near dsDNA. |
Rep (346–356): | AILYRGNHQSR | R350, N352* H353*, R356* interact with ssDNA. | |
HCV (365–372): | LIFCHSKK | K371 interacts with ssDNA. | |
UvrB (444–451): | LVTVKTVR | R451 may interact with DNA. | |
IF4A (263–270): | VIFCNTRR | R269 and/or R270 may interact with ssRNA. | |
V | PcrA (565–571): | HAAKGLE | H565 interacts with ssDNA; K568 forms salt bridge with E224 and D227 of DEYQD227 motif†; E571 interacts with ribose of AMPPNP. |
Rep (558–564): | HASKGLE | H558 interacts with DNA; K561 forms salt bridge with D218 of DEYQD218 motif†. | |
HCV (413–419): | ALMTGFT | No specific interactions for these residues. | |
UvrB (501–505): | REGLD | R501 may interact with DNA; D505 interacts with ATP. | |
IF4A (321–324): | RGID | R321 may interact with DEAD172 motif† or with ssRNA; D324 may interact with ATP. | |
VI | PcrA (606–613): | VGITRAEE | R610 interacts with γ phosphate of AMPPNP. |
Rep (598–605): | VGITRAQK | R602 is near the P-loop and may interact with ATP. | |
HCV (460–467): | QRRGRTGR | Q460 interacts with H290 of DECH290 motif; a conformational change could allow R464 and R467 to interact with phosphates of ATP. | |
UvrB (531–538): | QTIGRAAR | Q531 interacts with H341 of DESH341 motif; R535 and R538 interact with phosphates of ATP. | |
IF4A (345–352): | HRIGRGGR | H345 may interact with D172 of DEAD172 motif; R349 and R352 could interact with phosphates of ATP. |
Amino acid residues in PcrA, Rep, HCV and UvrB helicases showing specific interactions that are likely to be involved in biochemical activity are boldface underlined; residues for which one can propose hypothetical interactions in eIF4A and other helicases are shown in boldface italics. Motif sequences are aligned vertically. For UvrB, Thermus thermophilus sequence and numbering are used. ssDNA, single-stranded DNA; dsDNA, double-stranded DNA.
In at least one of the two independent copies of Rep in the crystallographic asymmetric unit.
The classical DExx (Walker “B”) motif is four residues in length; interactions are seen with the residue following the DExx motif.
Motif IV participates in oligonucleotide binding in all of the helicase–DNA complexes. Arg-269 and Arg-270 of eIF4A (colored green in Figs. 1–3) may interact with RNA.
The side chain of Arg-298 (colored purple in Figs. 1–3) is well ordered in the eIF4A structure, and sequence alignments suggest that this residue belongs to a “QXXR” motif that is specific to the DEA(D/H)-box helicases. In the models shown in Fig. 3, it would interact with oligonucleotides.
In the DNA helicases, residues from motif V “bridge” the ATP-binding site and the oligonucleotide-binding site. A histidine at the beginning of the motif (see Table 3) interacts with DNA; a lysine in the middle interacts with the DEYQD motif (the equivalent of the DEAD motif of eIF4A, plus the aspartic acid that follows it), and in PcrA with ATP bound, the carboxyl of Glu-571 interacts with the ribose of ATP.
In motif V of the DEA(D/H)-box helicases, only the four-residue RGID is conserved. Arg-321 of eIF4A could interact with either the DEAD motif or with the oligonucleotide. Asp-324 may interact with ATP similar to Glu-571 of PcrA. This motif therefore could play a role in coupling ATP binding and hydrolysis to RNA binding and/or a conformational change.
In mouse eIF4A, mutation of any of the three arginines in motif VI to glutamine reduces the ATPase activity to less than 20% of its wild-type value, whereas mutation of either the second or third arginine to lysine yielded a protein that retained >60% of its wild-type activity. Each of these mutations abolishes helicase activity in an assay employing both eIF4A and eIF4B. These data support an involvement of residues in this motif in both ATP hydrolysis and coupling of ATP binding or hydrolysis to RNA binding and helicase activity (30).
In motif VI of PcrA, the Arg-610 binds the γ-phosphate of ATP. In UvrB, Arg-535 and Arg-538 both interact with the phosphates of ATP. These interactions suggest that in eIF4A, Arg-349 and/or Arg-352 may bind the phosphates of ATP, although modeling the interactions is hampered by the fact that we have been unable to trace residues 352–356 in the structures.
His-345 of eIF4A is in a position to form a salt bridge to Asp-172, the final aspartic acid of the DEAD motif, resulting in a histidine–aspartate interaction. In both the HCV and UvrB helicases, we see a similar interaction, but with a “switch” of residues; to wit, H → Q in motif VI and D → H in the DExx motif, resulting in a glutamine–histidine interaction. It is interesting to note that in the sequences of most DEAH-box helicases, a glutamine is conserved at the sequence position of His-345 in eIF4A, which also would result in the same glutamine–histidine interaction; this “bridge” between motif VI and the DExx motif is found in many helicases of SF-2 in one or the other of these two alternative versions.
Although the interactions we propose are tentative, they reconcile results of previous mutagenesis studies with the structural information and provide a working model for examining the role of residues whose functional significance heretofore has not been tested.
Conclusions
The high-resolution structure of the carboxyl-terminal domain of eIF4A reveals the structural similarity of this domain in DEA(D/H)-box helicases to the equivalent domain in other helicases. In our crystals, full-length eIF4A protein is a “dumbbell” molecule with two globular domains connected by an extended (and in all probability, flexible) linker. Modeling a hypothetical compact, oligonucleotide-binding structure for eIF4A on the templates provided by other helicase structures gives us a testable, working model for intermolecular interactions with RNA and for intramolecular interactions that may couple the ATPase cycle to conformational changes and helicase activity in eIF4A.
Acknowledgments
We thank Ram Samudrala for assistance with fold recognition programs and beamline staff members of SSRL and ALS for assistance with synchrotron data collection. This work was supported by award MCB-9874582 from the National Science Foundation to D.B.M. and a Stanford Dean's Postdoctoral Fellowship to J.M.C. This work is based on research conducted at SSRL, which is funded by the Department of Energy (BES, BER) and National Institutes of Health (NCRR, National Institute of General Medical Sciences), and at Advanced Light Source, which is funded by the Department of Energy.
Abbreviations
- eIF4A
eukaryotic initiation factor 4A
- HCV
hepatitis C virus
- SF-1
helicase superfamily 1
- SF-2
helicase superfamily 2
Footnotes
Data Deposition: The atomic coordinates and structure factors have been deposited in the Protein Data Bank, www.rcsb.org (PDB ID codes 1FUU and 1FUK for full-length eIF4a and its carboxyl-terminal domain, respectively).
References
- 1.Gorbalenya A E, Koonin E V. Curr Opin Struct Biol. 1993;3:419–429. [Google Scholar]
- 2.Bateman A, Birney E, Durbin R, Eddy S R, Howe K L, Sonnhammer E L. Nucleic Acids Res. 2000;28:263–266. doi: 10.1093/nar/28.1.263. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Staley J P, Guthrie C. Cell. 1998;92:315–326. doi: 10.1016/s0092-8674(00)80925-3. [DOI] [PubMed] [Google Scholar]
- 4.Venema J, Tollervey D. Yeast. 1995;11:1629–1650. doi: 10.1002/yea.320111607. [DOI] [PubMed] [Google Scholar]
- 5.Py B, Higgins C F, Krisch H M, Carpousis A J. Nature (London) 1996;381:169–172. doi: 10.1038/381169a0. [DOI] [PubMed] [Google Scholar]
- 6.Pain V M. Eur J Biochem. 1996;236:747–771. doi: 10.1111/j.1432-1033.1996.00747.x. [DOI] [PubMed] [Google Scholar]
- 7.Linder P, Gasteiger E, Bairoch A. Yeast. 2000;16:507–509. doi: 10.1002/(SICI)1097-0061(200004)16:6<507::AID-YEA549>3.0.CO;2-N. [DOI] [PubMed] [Google Scholar]
- 8.Jaramillo M, Dever T E, Merrick W C, Sonenberg N. Mol Cell Biol. 1991;11:5992–5997. doi: 10.1128/mcb.11.12.5992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Rogers G W, Jr, Richter N J, Merrick W C. J Biol Chem. 1999;274:12236–12244. doi: 10.1074/jbc.274.18.12236. [DOI] [PubMed] [Google Scholar]
- 10.Korolev S, Hsieh J, Gauss G H, Lohman T M, Waksman G. Cell. 1997;90:635–647. doi: 10.1016/s0092-8674(00)80525-5. [DOI] [PubMed] [Google Scholar]
- 11.Subramanya H S, Bird L E, Brannigan J A, Wigley D B. Nature (London) 1996;384:379–383. doi: 10.1038/384379a0. [DOI] [PubMed] [Google Scholar]
- 12.Velankar S S, Soultanas P, Dillingham M S, Subramanya H S, Wigley D B. Cell. 1999;97:75–84. doi: 10.1016/s0092-8674(00)80716-3. [DOI] [PubMed] [Google Scholar]
- 13.Soultanas P, Dillingham M S, Velankar S S, Wigley D B. J Mol Biol. 1999;290:137–148. doi: 10.1006/jmbi.1999.2873. [DOI] [PubMed] [Google Scholar]
- 14.Machius M, Henry L, Palnitkar M, Deisenhofer J. Proc Natl Acad Sci USA. 1999;96:11717–11722. doi: 10.1073/pnas.96.21.11717. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Nakagawa N, Sugahara M, Masui R, Kato R, Fukuyama K, Kuramitsu S. J Biochem (Tokyo) 1999;126:986–990. doi: 10.1093/oxfordjournals.jbchem.a022566. [DOI] [PubMed] [Google Scholar]
- 16.Theis K, Chen P J, Skorvaga M, Van Houten B, Kisker C. EMBO J. 1999;18:6899–6907. doi: 10.1093/emboj/18.24.6899. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Yao N, Hesson T, Cable M, Hong Z, Kwong A D, Le H V, Weber P C. Nat Struct Biol. 1997;4:463–467. doi: 10.1038/nsb0697-463. [DOI] [PubMed] [Google Scholar]
- 18.Cho H S, Ha N C, Kang L W, Chung K M, Back S H, Jang S K, Oh B H. J Biol Chem. 1998;273:15045–15052. doi: 10.1074/jbc.273.24.15045. [DOI] [PubMed] [Google Scholar]
- 19.Kim J L, Morgenstern K A, Griffith J P, Dwyer M D, Thomson J A, Murcko M A, Lin C, Caron P R. Structure. 1998;6:89–100. doi: 10.1016/s0969-2126(98)00010-0. [DOI] [PubMed] [Google Scholar]
- 20.Yao N, Reichert P, Taremi S S, Prosise W W, Weber P C. Struct Fold Des. 1999;7:1353–1363. doi: 10.1016/s0969-2126(00)80025-8. [DOI] [PubMed] [Google Scholar]
- 21.Korolev S, Yao N, Lohman T M, Weber P C, Waksman G. Protein Sci. 1998;7:605–610. doi: 10.1002/pro.5560070309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Benz J, Trachsel H, Baumann U. Structure. 1999;7:671–679. doi: 10.1016/s0969-2126(99)80088-4. [DOI] [PubMed] [Google Scholar]
- 23.Johnson E R, McKay D B. RNA. 1999;5:1526–1534. doi: 10.1017/s1355838299991410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Story R M, Weber I T, Steitz T A. Nature (London) 1992;355:318–325. doi: 10.1038/355318a0. [DOI] [PubMed] [Google Scholar]
- 25.Studier F W, Rosenberg A H, Dunn J J, Dubendorff J W. Methods Enzymol. 1990;185:60–89. doi: 10.1016/0076-6879(90)85008-c. [DOI] [PubMed] [Google Scholar]
- 26.Otwinowski Z, Minor W. Methods Enzymol. 1997;276:307–326. doi: 10.1016/S0076-6879(97)76066-X. [DOI] [PubMed] [Google Scholar]
- 27.Brunger A T, Adams P D, Clore G M, DeLano W L, Gros P, Grosse-Kunstleve R W, Jiang J S, Kuszewski J, Nilges M, Pannu N S, et al. Acta Crystallogr D. 1998;54:905–921. doi: 10.1107/s0907444998003254. [DOI] [PubMed] [Google Scholar]
- 28.Luzzati P V. Acta Crystallogr. 1952;5:802–810. [Google Scholar]
- 29.Peck M L, Herschlag D. RNA. 1999;5:1210–2121. doi: 10.1017/s1355838299990817. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Pause A, Methot N, Sonenberg N. Mol Cell Biol. 1993;13:6789–6798. doi: 10.1128/mcb.13.11.6789. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Lorsch J R, Herschlag D. Biochemistry. 1998;37:2194–2206. doi: 10.1021/bi9724319. [DOI] [PubMed] [Google Scholar]
- 32.Lorsch J R, Herschlag D. Biochemistry. 1998;37:2180–2193. doi: 10.1021/bi972430g. [DOI] [PubMed] [Google Scholar]
- 33.Richter-Cook N J, Dever T E, Hensold J O, Merrick W C. J Biol Chem. 1998;273:7579–7587. doi: 10.1074/jbc.273.13.7579. [DOI] [PubMed] [Google Scholar]
- 34.Kraulis P. J Appl Crystallogr. 1991;24:946–950. [Google Scholar]
- 35.Merritt E A, Bacon D J. Methods Enzymol. 1997;277:505–524. doi: 10.1016/s0076-6879(97)77028-9. [DOI] [PubMed] [Google Scholar]