Abstract
The Ras superfamily comprises many guanine nucleotide-binding proteins (G proteins) that are essential to intracellular signal transduction. The guanine nucleotide-dependent intrinsic flexibility patterns of five G proteins were investigated in atomic detail through Molecular Dynamics simulations of the GDP- and GTP-bound states (SGDP and SGTP, respectively). For all the considered systems, the intrinsic flexibility of SGDP was higher than that of SGTP, suggesting that Guanine Exchange Factor (GEF) recognition and nucleotide switch require higher amplitude motions than effector recognition or GTP hydrolysis. Functional mode, dynamic domain, and interaction energy correlation analyses highlighted significant differences in the dynamics of small G proteins and Gα proteins, especially in the inactive state. Indeed, SGDP of Gαt, is characterized by a more extensive energy coupling between nucleotide binding site and distal regions involved in GEF recognition compared to small G proteins, which attenuates in the active state. Moreover, mechanically distinct domains implicated in nucleotide switch could be detected in the presence of GDP but not in the presence of GTP. Finally, in small G proteins, functional modes are more detectable in the inactive state than in the active one and involve changes in solvent exposure of two highly conserved amino acids in switches I and II involved in GEF recognition. The average solvent exposure of these amino acids correlates in turn with the rate of GDP release, suggesting for them either direct or indirect roles in the process of nucleotide switch. Collectively, nucleotide binding changes the information flow through the conserved Ras-like domain, where GDP enhances the flexibility of mechanically distinct portions involved in nucleotide switch, and favors long distance allosteric communication (in Gα proteins), compared to GTP.
Author Summary
The Ras superfamily comprises many guanine nucleotide-binding proteins (G proteins) that are essential to intracellular signal transduction. These proteins act biologically as molecular switches cycling between ON and OFF states, thereby controlling a variety of processes ranging from cell growth and differentiation to vesicular and nuclear transport. In spite of the extremely high biological and medical relevance of the Ras GTPase superfamily, a comprehensive structural/dynamic view of the trans-family and family-specific functioning mechanisms is still lacking. In this study, we gained insights into the functional dynamics of Ras GTPases by deciphering the dynamic information encrypted in the topology of these proteins depending on the nucleotide-bound state, i.e. GDP- or GTP-bound (SGDP and SGTP, respectively). Collectively, nucleotide binding changes the information flow through the conserved Ras-like domain, where GDP enhances the flexibility of mechanically distinct portions involved in nucleotide switch, and favors long distance allosteric communication (in Gα proteins), compared to GTP. Functional dynamics is instrumental in GDP switch, which for the members of the Gα family, different from small G proteins, requires allosteric communication between nucleotide and Guanine Exchange Factor binding sites.
Introduction
The Ras superfamily comprises many guanine nucleotide-binding proteins (G proteins) that are essential to intracellular signal transduction [1], [2]. These proteins act biologically as molecular switches cycling between ON and OFF states, thereby controlling a variety of processes ranging from cell growth and differentiation to vesicular and nuclear transport [1]. The switch-on process requires the release of the bound Guanosine Di-Phosphate (GDP) and the subsequent binding of the Guanosine Tri-Phosphate (GTP), an intrinsically slow process catalyzed by Guanine nucleotide Exchange Factors (GEFs) [1]. In the GTP-bound active state (SGTP), the G proteins display high affinity for binding downstream effectors, interactions through which they exert their specific biologic functions. The switch-off process involves the hydrolysis of GTP to GDP, reaction that is accelerated by Guanine nucleotide Activating Proteins (GAPs) and leads to release of effector proteins (due to reduced affinity) and attenuation of downstream signaling [1]. Peculiar to the members of the Gα family is the fact that in the inactive GDP-bound state (SGDP) they form membrane-associated αβγ heterotrimers, with GDP bound to the α-subunit [3].
Nucleotide switch and hydrolysis in Ras GTPases are played by the conserved core, Ras-like domain (see Figure 1 legend for structure description). Selected members of the superfamily, such as the members of the Gα family, hold an extra-Ras α-helical domain (Figures 1 and 2) constituted by a long central helix surrounded by five shorter helices. This feature makes Gα proteins significantly larger than all the other members of the Ras superfamily, which are, hence, indicated as “small G proteins”.
The central role of Ras GTPases in cell function is testified by the proved involvement of selected members like Ras and Rho in many aspects of cancer development and tumor progression, which makes these proteins very interesting targets in cancer therapy [2]. This is why oncogenic Ras mutants have been the target of computational experiments aimed at unraveling the dynamic information encoded into the structure [4], [5].
In a recent study, we combined Elastic Network Model (ENM) coarse grained simulations with Principal Component Analysis (PCA) on the experimental structures of representatives Ras GTPases to decipher the physical and evolutionary deformability patterns that enable switching between active and inactive states [6], [7]. The analysis highlighted functional separation of the conserved core into two lobes, as previously suggested by others [5], [8]. The deformation modes involved in the switching function are conserved along evolution and are localized in lobe 1 portions close to the nucleotide (Figure 1). These modes lead to functional specialization when associated with evolution-driven deformations of protein portions essentially located in lobe 2, distal from the nucleotide, and involved in specific interactions with membrane, GEFs, or effectors [6]. Additional evidence that the Ras superfamily members share a set of switching dynamics was inferred by the identification of conserved hinge points throughout all subfamilies, which remark the bi-lobate dynamics of the conserved core [6].
In this study, we analyze in atomistic detail the intrinsic flexibility patterns of G proteins through Molecular Dynamics (MD) simulations of selected members of the five major families of Ras GTPases (i.e. Arf1, Gαt (also called transducin), Sec4, H-Ras, and RhoA). Functional dynamics of these molecular switches was investigated through the analyses of essential motions, functional modes, interaction energy correlations and dynamic domains in relation to the functional states, i.e. SGDP or SGTP. Our simulations confirm the complexity of the deformability patterns of Ras GTPases and the careful tuning that evolution has made on it. In G proteins, functional dynamics is suggested to be instrumental in GDP switch, which, for the members of the Gα family, different from the small G proteins, requires allosteric communication between nucleotide and GEF binding sites [3], [9], [10].
Results
Analysis of the intrinsic flexibility of Ras GTPases
Remarkable functional-state dependent features shared by the five homologous GTPases reside in the Cα-Root Mean Square Fluctuation (Cα-RMSF) profiles, showing higher flexibility for the inactive forms compared to the active ones, with swI and swII being the lobe 1 portions that better define the intrinsic trans-family flexibility, as they account for major differences between SGDP and SGTP (Table 1 and Figure 3). Such an increase in flexibility of swI and swII has already been shown for Ras p21, as a feature linked to the absence of the γ-phosphate [11]. For Gαt, the higher flexibility of SGDP resides also in the α-helical domain (Figures 3 and S1). Such a flexibility trend between inactive and active states is mostly shown also by the average Cα-Root Mean Square Deviations (Cα-RMSD; Table S1), which for all GTPases but H-Ras is higher in SGDP compared to SGTP.
Table 1. Correlations between shape descriptor and PCs.
Family | State | Vara | Rb | SASATGavg (Å2)c | GDP released |
Arf1 | SGDP | 91.00 | 0.85 | 91.71±20.64 | 0.04 [12] |
SGTP | 65.63 | 0.32 | 8.34±2.37 | ||
Gαt | SGDP | 390.37 | 0.60 | 87.32±18.21 | 0.00072 [13] |
SGTP | 285.11 | 0.40 | 16.85±4.48 | ||
Sec4 | SGDP | 157.88 | 0.86 | 151.27±29.10 | 0.21 [14] |
SGTP | 79.31 | 0.52 | 18.59±4.05 | ||
H-Ras | SGDP | 87.19 | 0.79 | 120.08±21.58 | 0.025 [16] |
SGTP | 60.96 | 0.65 | 20.93±7.04 | ||
RhoA | SGDP | 111.70 | 0.83 | 107.68±20.76 | 0.0072 [17] |
SGTP | 103.93 | 0.57 | 20.49±6.88 |
Total variance obtained by summing the eigenvalues from PCA.
Correlation coefficient between SASATG and a combination of the first twenty PCs.
SASATG index averaged over the 40000 frames constituting the 40 ns trajectories.
Rate of GEF-independent GDP release (min−1); the relative bibliographic source is in square brackets.
The analysis of concerted motions was carried out both on single and on concatenated Cα-atom trajectories of the inactive and active forms. Consistent with the Cα-RMSF profiles, the total variance computed by summing the eigenvalues from PCA on single trajectories is always higher for the inactive forms compared to the active ones (Table 1, Figure 3). As for PCA on the concatenated trajectories, the first eigenvector (i.e. principal component 1; PC1) separated the structures visited along the trajectory into two clusters, corresponding to SGDP and SGTP (this separation is especially clear for all small G-proteins; Figure 4 and Figure S1). Displacements along second and third eigenvectors (PC2 and PC3) generally reflect the higher motility of SGDP compared to SGTP (Figure 4). For the small G proteins, Cα-atom projections along the three PCs concern collective motions of swI, swII, and inter-switch (Figures S1, S2, and S3). In contrast, for Gαt the collective motions of switches and inter-switch are associated with motions of the α/β loops and of the α-helical domain (Figures S1, S2 S3).
Collectively, Ras GTPases share a similar essential dynamics that is more amplified in SGDP and involves swI, swII, and inter-switch. Such a functional state-dependent dynamics is presumably linked to the nucleotide switch mechanism that pertains to SGDP. The additional essential motions differentiating Gαt from the small G proteins may be considered as expressions of family-specific functional dynamics finalized to the nucleotide switch.
Functional modes in the small G proteins are associated with changes in solvent accessibility of lobe 1 portions
In an attempt to find a trans-family structural indicator of the differences between SGDP and SGTP, we computed the Solvent Accessible Surface Area (SASA) of all the highly conserved amino acids in the G-boxes (Figure 1). The analysis highlighted the SASA computed over T(G2∶4) and G(G3∶4) (SASATG; see Figure 1B legend for the numbering explanation) as the only functional state descriptor valid for all the five GTPase families (Table 1). Indeed, the SASATG averaged over all the trajectory frames (SASATGavg in Table 1) or plotted as time series (Figure 5) is always greater for the inactive form than for the active one. This is due to the breakage of the interactions between the side chain of T(G2∶4) and both the Mg2+ ion and a γ-phosphate oxygen atom, and between the backbone nitrogen atom of G(G3∶4) and a γ-phosphate oxygen atom on going from SGTP to SGDP (Figure 6). Such a breakage of intermolecular interactions is expected to contribute, at least in part, to the higher flexibility of the two switches in the inactive state compared to the inactive one (Figure 3), suggesting the existence of a link between SASATG and RMSF profiles.
According to the results of Functional Mode Analysis (FMA) (see Methods), for the small G proteins SASATG correlates extremely well with the first twenty PCs derived from PCA on the single Cα-trajectories of SGDP, Gαt showing a lower correlation coefficient (Table 1). Collectively, correlations are lower for SGTP compared to SGDP (Table 1).
The Cα-atom projections of the linear combination of the first twenty PCs shows that the portions that contribute the most to the combined essential motions in the SGDP state include swI, swII, and inter-switch (Figure 7). Gαt shows additional collective motions of α-helical domain and α4/β6 loop, whereas RhoA is characterized by additional essential motions of the α-insert (Figures 1 and 7). Remarkably, the highly conserved amino acids that contribute to SASATG lay just on swI (T(G2∶4)) and on swII (G(G3∶4)). Incidentally, in small G proteins, the movement of swI and, by a lower extent, of swII, marked, respectively, by the increase in solvent accessibility of T(G2∶4) and G(3∶4) on going from SGTP to SGDP, is instrumental in GEF recognition (Figure 6 and Figure S4). The fact that those modes, which correlate with SASATG, concern portions of the small G proteins implicated in GEF recognition, and that correlations pertain mostly to the SGDP state, which is engineered to recognize GEFs, suggests that the correlated modes are, indeed, functional modes related to the nucleotide switch mechanism in the small G proteins. These inferences are also supported by the correlation between SASATGavg and rate of GDP release (Table 1, the linear correlation coefficient for the five considered systems being 0.86) [12]–[17]. In this respect, Sec4 shows the highest SASATGavg and GDP release rates in the absence of GEF (Table 1).
Comparative analyses of interaction energy correlations
Further insights into the dynamic properties of SGDP and SGTP involved in functional specialization of selected families were gained through the analysis of correlated non-bonded interactions energies. Highly correlated interacting pairs are markers of protein regions that communicate between each other. A functional state-independent feature of the five G proteins detected by correlated energies is the lack of interaction energy correlations between β4 amino acids and any amino acid from lobe 1 (Figures S5 and S6), which reflects the low flexibility of the β-strand [6]. Other functional-state independent features, shared by Arf1 and Gαt, are the diffuse energy correlations involving α4/β6 loop and both lobe 1 and lobe 2 portions (Figures S5 and S6).
The patterns of correlated interaction energies show that the representative members of the five G protein families considered in this study, less evident in Arf1, share intra-lobe 1 energy correlations in their inactive state, which are lost upon nucleotide exchange. On the contrary, inter-lobe energy coupling between α3, on one side, and swII and inter-switch, on the other one, is more marked in SGTP compared to SGDP. A singularity of H-Ras with respect to the other small G proteins is the energy coupling between β2/β3 turn and C-term of α5, which is more marked in SGDP compared to SGTP (Figure S5).
Family specialization in the context of the energetic coupling between amino acid pairs is more evident by comparing Gαt with the four small G proteins (Figures S5 and S6). In this respect, a singularity of Gαt is the extended coupling between C-term of α5 and lobe 1 portions in SGDP, which is lost upon activation (Figure S6). Other singularities of the inactive state of Gαt is the inter-domain coupling between αF/linker1, on one side, and inter-switch, swII, α3, α4/β6 loop and α5, on the other, which is more marked in the inactive state. Finally, in SGDP of Gαt, the α/β loops that are involved in GEF recognition show more marked energy coupling compared to the β/α loops. Taken together, these features reflect the bi-domain structural organization of the protein as well as the singular GEF recognition mode and GEF-catalyzed nucleotide switch mechanism, which makes the difference from the small G proteins.
We also analyzed functional-state dependent changes in the patterns of the amino acid pairs whose interaction energies correlate with the pairwise interaction energies involving the nucleotide (i.e. amino acid pairs energetically coupled with nucleotide-mediated interaction(s)). Incidentally, the average interaction energy profile of the nucleotide in the two different functional states of the five representatives shows remarkable trans-family conservation (Figure S7). In more detail, for both the SGDP and SGTP forms of the small G proteins, the G1 and G4 boxes (Figure 1) give, respectively, the first and second strongest contributions in terms of attractive interaction energy values, whereas the G2 and G3 boxes contribute only in the SGTP state. Peculiar to both functional forms of Gαt is the fact that the G2 box gives the second strongest contribution due to R174(G2∶1).
In general, in SGDP of the small G proteins, the amino acids involved in direct interaction with the nucleotide essentially lay on α1 and the P-loop, whereas in SGTP, the nucleotide interaction sphere extends to swI and swII amino acids (Figure 8). Upon activation, for Arf1, Sec4, and RhoA the energy coupling between nucleotide binding site and other protein portions increases. In contrast, for H-Ras, GTP substitution for GDP reduces the network of electrostatic interactions energetically coupled with GDP found between the β2/β3 hairpin and α5 (i.e. including the salt bridges between D47 in the β2/β3 turn and both R161(α5∶10) and R164(α5∶13) and between E49(β3∶-2) and R164(α5∶13); Figure 8).
The energetic coupling in Gαt shows a singular behavior compared to the small G proteins (Figures 8 and 9). In fact, both SGDP and SGTP show a significantly higher number of correlated pairs compared to the small G proteins (Figure 9). These pairs are both inter- and intra-domain located and undergo a change in distribution upon activation (Figure 9). As for SGDP, inter-domain correlated pairs essentially involve R172(αF∶6) and the fully conserved D268(G4∶4), which interacts also with the nucleotide, as well as D146 (in the αD/αE loop) and K266 (in the β4/α3 loop) (Figure 9). As for the intra-α-helical domain-located pairs, only ten amino acid pairs correlate with the nucleotide interaction(s). Finally, intra-Ras correlated pairs are both intra-lobe and inter-lobe located (see Figure 9 legend for deep detail). Activation causes a general reduction in energy coupling, which is more significant in lobe 1 and between the two lobes of the Ras-like domain. Indeed, the correlated pairs characterizing SGTP are essentially located in lobe 2 portions distal from the nucleotide (i.e. in α3, α4 as well as the α3/β5 and α4/β6 loops (Figure 9)).
Collectively, the analysis of correlated interaction energies highlights the presence of an allosteric mechanism associated with the nucleotide switch in Gαt but not in the small G proteins.
Comparative analysis of dynamic domains
The five G proteins (in both active and inactive forms) were also compared by the Dynamic Domain (DD) method, which clusterizes the Cα-atoms of the system according to their propensity to be part of mechanically coherent domains in the trajectory frames [18]. Not surprisingly, the first cluster separates active vs inactive G-proteins, stressing the existence of differential effects of GDP and GTP binding on G protein dynamics. In SGDP of the small G proteins, mechanically distinct domains include swI and swII, which are directly implicated in GEF recognition and the nucleotide switch, whereas SGTP does not show common clustering patterns. Furthermore, the SGDP of Gαt shows unique features since the α-helical domain appears mechanically distinct from the Ras-like domain (Figure 10).
Discussion
MD simulations done in this study helped unraveling the functional dynamics of Ras superfamily GTPases, providing atomistic details that were not reached by previous evolutionary or coarse-grained studies [6]. Indeed, current simulations clearly show that the intrinsic flexibility of SGDP is higher than that of SGTP, suggesting that GEF recognition and nucleotide switch mechanism require higher amplitude motions than effector recognition or GTP hydrolysis.
Novel trans-family features pertaining to functional dynamics were inferred from the analysis of interaction energy correlations. The latter revealed intra-lobe 1 correlations in all the five G proteins; such correlations attenuated upon activation. This may relate with lobe 1 being heavily involved in function-retention dynamics. Furthermore, interaction energy correlations highlighted almost complete lack of correlations involving the β4 segment in both functional states of the five G proteins. This feature remarks the bi-lobate nature of the conserved Ras-like domain, which is related to β4 being the holder of the strongest and most conserved hinge point [6].
The results of this study suggest that, in small G proteins, functional modes, i.e. collective motions directly related to function, are more evident in the inactive state rather than in the active one. These modes, which involve swI and swII in lobe 1, correlate with changes in solvent exposure of T(G2∶4) and G(G3∶4), which, in the small G proteins, with emphasis on T(G2∶4), are involved in GEF recognition. These results, together with the existence of a correlation between SASATGavg and rate of GDP release, suggest that the two conserved G-box amino acids participate either directly or indirectly in the mechanism of nucleotide switch in small G proteins. Functional modes involving swI and swII are less detectable for Gα proteins, suggesting that biologically relevant modes in large and small G proteins are different. Indeed, the analyses of essential motions and dynamic domains support previous inferences that in Gα proteins the nucleotide switch involves concerted motions of the α-helical domain with respect to the Ras-like domain, following allosteric GEF recognition by the α/β loops [19]. Consistent with these results, the inactive state of Gαt is characterized by a significantly more extensive communication between nucleotide binding site and distal regions involved in GEF recognition compared to the small G proteins. This was inferred from SGDP of Gαt showing an evident energy coupling between nucleotide binding site and distal portions like the β2/β3 turn, α5, and the three α/β loops, which participate in GEF recognition [3]. Remarkably, this energy coupling is absent in SGTP, in which function, i.e. effector binding and GTP-hydrolysis, does not require long distance communication as it involves regions like swI and swII that participate in the nucleotide binding site. Thus, the energy coupling between nucleotide and GEF binding sites likely pertains to the nucleotide switch mechanism in Gα proteins and not in small G proteins. The evident communication between nucleotide binding site and inter-switch-C term of Gαt is in line with the results of previous simulations of the receptor-G protein complex suggesting that the receptor-induced detachment of α5 from the inter-switch promotes a cascade of structural changes in Gα that propagate from the C-term to the α-helical domain through lobe 1 portions of the Ras-like domain [19]. These changes, indeed, culminate with the formation of a nucleotide exit route in between the αF-helix and β6/α5 loop [19].
The analysis of the interaction energy correlations highlights novel family-specific features. These include interaction energy correlations involving α4/β6 loop and the remaining portions of Arf1 and Gαt, related to the higher length and flexibility of this loop in the two proteins compared to the other three G proteins. Moreover, a singularity of H-Ras compared to the other small G proteins is a reduction in energy coupling between nucleotide binding site and distal regions like α5, on going from SGDP to SGTP. This may relate, at least in part, to the postulated implication of inter-switch and α5 of H-Ras in a novel switch mechanism involving a nucleotide-dependent change in the membrane anchoring of the protein, operated through α4 and the hypervariable region (HVR) in α5 [20].
In summary, in Ras GTPases, the intrinsic dynamics oriented to functional specialization essentially pertains to the inactive state rather than to the active one and clearly separates the small G proteins from Gαt. Indeed, mechanically distinct domains implied in the mechanism of nucleotide switch could be detected in the presence of GDP but not in the presence of GTP. SGDP is engineered to respond to the GEF's request to release GDP and this follows different mechanisms in the small G proteins compared to the members of the Gα family. Whereas in small G proteins GEF binds directly to swI and swII, which are implicated in nucleotide exit, in the Gα family GEF binds to protein portions that are distal from the nucleotide, thus implying allosteric communication for GEF-catalyzed nucleotide release. The latter is expected to involve also a concerted motion of the α-helical domain with respect to the Ras-like domain, feature intrinsic to SGDP and amplified by GEF. In contrast, for SGTP of all G proteins, in which nucleotide and effector binding sites are quite close, the functioning mechanism does not require long distance communication nor inter-domain motions that are, hence, attenuated compared to SGDP. In this respect, the intrinsic flexibility of small G proteins and Gα proteins shares more commonalties in the active state compared to the inactive one.
Current and previous results [6] suggest that the Ras superfamily utilizes a hierarchical organization of its structural flexibilities; lobe 1 motions associated with its switching function must be retained in order to accomplish the primary G protein function of changing its affinity for GEFs, effector proteins, and GAPs through different bound nucleotides, but additional motions across both lobes of the protein are family-specific and play a role in determining the unique functional characteristics of specific members. Molecular communication between lobe 1 and lobe 2 portions (intra-Ras communication) and between α-helical and Ras-like domains, for Gαt, represents the way to accomplish functional specialization.
Taken as a whole the results of this investigation reveal that the topology of the conserved Ras-like domain is such that it allows for differential flow of information depending on the bound nucleotide, i.e. GDP enhances the flexibility of mechanically distinct portions involved in nucleotide switch, and favors long distance allosteric communication (in Gα proteins), compared to GTP.
Methods
Simulated proteins
The following SGDP and SGTP representatives of the Arf1, Gαt, Sec4, H-Ras, and RhoA families were selected as input of MD simulations: Arf1 (PDB codes: 1HUR and 1O3Y), Gαt (PDB codes: 1TAG and 1TND), Sec4 (PDB codes: 1G16 and 1G17), H-Ras (PDB codes: 4Q21 and 5Q21), and RhoA (PDB codes: 1FTN and 1KMQ) based on the following criteria: i) homogeneity of the molecular specie, i.e. same sequence, for the two functional states, and ii) existence of a crystal structure for the isolated state, i.e. not in complex with other proteins. In a few cases, wild-type structures (in one or both SGDP and SGTP states) were generated by manipulation of mutant structures: L71Q mutant for Arf1 SGTP, N25F, and N25F/L63Q for the SGDP, and SGTP states of RhoA, respectively. Due to the intrinsic GTPase activity, the GTP-bound forms had been crystallized in complex with an hydrolysis-resistant analogue of GTP, i.e. GppNHp for 5P21, 1G17, and 1KMQ, and GTPγS for 1TND. In our simulations the GTP analogues were converted to GTP.
Finally, all the simulated systems contained the Mg2+ ion together with the coordinating water molecules. In the case of the GDP-bound form of Sec4, the Mg2+ ion substitutes for the Co2+ ion originally present in the 1G16 structure. The two different metal ions are expected to be interchangeable with no substantial structural effects [21].
Set-up of MD simulations
MD simulations on the five representative Ras GTPases (Arf1, Gαt, Sec4, H-Ras, and RhoA) were carried out using the GROMACS4 simulation package [22] with the AMBER03 all atoms force field [23], [24], by using the TIP3P water model to describe the solvent. AMBER parameters to describe the GDP and GTP molecules were taken from literature [25]. Depending on the dimensions of the systems, a variable number of Na+ and Cl− ions placed at optimum electrostatic positions were added in order to neutralize the system (Table S1). Periodic Boundary Conditions (PBC) were applied using an octahedric box as a unit cell, imposing a minimum distance of 12 Å between the solute and the box boundaries.
All the input crystallographic structures were subjected to energy minimization keeping restricted the positions of main chain atoms, all the Cβ atoms, the nucleotide, the Mg2+ cation and the coordinating water molecules. The systems were then equilibrated at 300 K for 4 ns of backbone-restricted MD simulations. The Particle Mesh Ewald (PME) method was employed to compute the electrostatic interactions. Short range repulsive and attractive interactions were computed using a Lennard-Jones potential with a cutoff of 10 Å. The LINCS algorithm [26] was used to constrain all bond lengths except those in water molecules, allowing for an integration time step of 2 fs by the leap-frog algorithm. The v-rescale thermostat [27] was employed to keep the system at a constant temperature of 300 K, by using a coupling constant (τt) of 0.1 ps. The pressure of the system was kept fixed at 1 atm, using the Berendsen weak coupling algorithm [28] with a coupling constant (τp) of 1 ps. The pre-equilibrated systems were then released for 1 ns prior to 40 ns of unrestrained isothermal-isobaric (T = 300K, P = 1 atm) MD simulations.
For each system, trajectory replicas of 40 ns were achieved by reassigning velocities to the frame extracted from the 20000th frame of the original trajectory. Due to overall consistency between original and replicated trajectories, this work shows only the results of the original one.
Analysis of trajectories
Trajectories were subjected to a variety of analyses. RMSDs, RMSF, and SASA were performed using the Wordom software [29]. Detailed information concerning the essential motions of the proteins was obtained by using Essential Dynamics (ED) analyses. In this respect, covariance matrices (Cα-atoms in our case) were built, using either isolated or concatenated MD trajectories, and diagonalized to a set of eigenvectors (describing the essential deformation modes) and associated eigenvalues (which explain the amount of each-mode variance). As for ED on the concatenated trajectories, the Cα-atom structure averaged over all the concatenated frames was used as a reference for the building of the co-variance matrix. In order to facilitate comparison along different proteins, a minimum consensus length for the inactive and active states was chosen for each family, i.e. 18-177 for Arf1, 29-339 for Gαt, 20-183 for Sec4, 3-165 for H-Ras, and 5-178 for RhoA.
The search for correlations between structural features and essential modes was carried out through the FMA [30] tool, by using the Linear Mutual Information (LMI) estimator [31]. In this framework, a number of size/shape and intermolecular interaction descriptors were correlated with linear combinations of a variable number of PCs.
The dynamical/mechanical properties of the simulated systems were characterized by means of the DD method implemented into the PCASUITE package (http://mmb.pcb.ub.es/software/pcasuite/) [32].
Non bonded interaction energies involving each protein residue, the nucleotide, and the Mg2+ ion were monitored every 20 ps along the trajectory. Only the non bonded interactions whose average values along the simulations were greater than 2 kcal mol−1 were considered relevant for the analysis and were processed to find potential pair-correlations according to the following equation [33]:
(1) |
Where and (as well as and ) stand for instantaneous (i.e. frameth-associated) and average interaction energy values between particles i and j (as well as k and l). A N × N correlation matrix is thus built, where N is the total number of relevant interacting pairs. Only the elements with absolute correlation coefficient ≥0.4 were further considered in the analysis. Note that by mapping the inter-atomic energy correlation matrix into the residue space, a residue correlation matrix can be derived:
(2) |
where N is the dimension of the energy correlation matrix and k is the number of relevant interactions, whose correlation value is above the correlation threshold, in which residues i and j are involved. is 1 if residue i and j are involved in interactions with m and n (n and m), otherwise is set to 0.
Supporting Information
Acknowledgments
We are grateful to Michele Seeber for his technical support. We also thank Ongun Onaran for helpful discussion.
Footnotes
The authors have declared that no competing interests exist.
This study was supported by a Telethon-Italy grant n. S00068TELU, a grant from the Spanish Ministry of Science (BIO2009-10964, Consolider E-science project, the COMBIOMED ISCIII project), and Fundacion Marcelino Botin. This work was carried out under the HPC-EUROPA2 project (project number: 228398) with the support of the European Commission Capacities Area Research Infrastructures Initiative. All calculations were carried out at BSC Marenostrum supercomputer and local workstations. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Vetter IR, Wittinghofer A. The guanine nucleotide-binding switch in three dimensions. Science. 2001;294:1299–1304. doi: 10.1126/science.1062023. [DOI] [PubMed] [Google Scholar]
- 2.Colicelli J. Human RAS superfamily proteins and related GTPases. Sci STKE. 2004;2004:RE13. doi: 10.1126/stke.2502004re13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Oldham WM, Hamm HE. Structural basis of function in heterotrimeric G proteins. Q Rev Biophys. 2006;39:117–166. doi: 10.1017/S0033583506004306. [DOI] [PubMed] [Google Scholar]
- 4.Futatsugi N, Tsuda M. Molecular dynamics simulations of Gly-12—>Val mutant of p21(ras): dynamic inhibition mechanism. Biophys J. 2001;81:3483–3488. doi: 10.1016/S0006-3495(01)75979-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gorfe AA, Grant BJ, McCammon JA. Mapping the nucleotide and isoform-dependent structural and dynamical features of ras proteins. Structure. 2008;16:885–896. doi: 10.1016/j.str.2008.03.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Raimondi F, Orozco M, Fanelli F. Deciphering the deformation modes associated with function retention and specialization in members of the Ras superfamily. Structure. 2010;18:402–414. doi: 10.1016/j.str.2009.12.015. [DOI] [PubMed] [Google Scholar]
- 7.Van Wynsberghe AW, Cui Q. Conservation and variation of structural flexibility in protein families. Structure. 2010;18:281–283. doi: 10.1016/j.str.2010.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Grant BJ, Gorfe AA, McCammon JA. Ras Conformational Switching: Simulating Nucleotide-Dependent Conformational Transitions with Accelerated Molecular Dynamics. PLoS Comput Biol. 2009;5:e1000325. doi: 10.1371/journal.pcbi.1000325. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Oldham WM, Hamm HE. Heterotrimeric G protein activation by G-protein-coupled receptors. Nat Rev Mol Cell Biol. 2008;9:60–71. doi: 10.1038/nrm2299. [DOI] [PubMed] [Google Scholar]
- 10.Sayar K, Ugur O, Liu T, Hilser VJ, Onaran O. Exploring allosteric coupling in the alpha-subunit of heterotrimeric G proteins using evolutionary and ensemble-based approaches. BMC Struct Biol. 2008;8:23. doi: 10.1186/1472-6807-8-23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ma JP, Karplus M. Ligand-induced conformational changes in ras p21: A normal mode and energy minimization analysis. J Mol Biol. 1997;274:114–131. doi: 10.1006/jmbi.1997.1313. [DOI] [PubMed] [Google Scholar]
- 12.Paris S, Beraud-Dufour S, Robineau S, Bigay J, Antonny B, et al. Role of protein-phospholipid interactions in the activation of ARF1 by the guanine nucleotide exchange factor Arno. J Biol Chem. 1997;272:22221–22226. doi: 10.1074/jbc.272.35.22221. [DOI] [PubMed] [Google Scholar]
- 13.Thomas TO, Bae H, Medkova M, Hamm HE. An intramolecular contact in Galpha transducin that participates in maintaining its intrinsic GDP release rate. Mol Cell Biol Res Commun. 2001;4:282–291. doi: 10.1006/mcbr.2001.0293. [DOI] [PubMed] [Google Scholar]
- 14.Kabcenell AK, Goud B, Northup JK, Novick PJ. Binding and hydrolysis of guanine nucleotides by Sec4p, a yeast protein involved in the regulation of vesicular traffic. J Biol Chem. 1990;265:9366–9372. [PubMed] [Google Scholar]
- 15.Feuerstein J, Kalbitzer HR, John J, Goody RS, Wittinghofer A. Characterisation of the metal-ion-GDP complex at the active sites of transforming and nontransforming p21 proteins by observation of the 17O-Mn superhyperfine coupling and by kinetic methods. Eur J Biochem. 1987;162:49–55. doi: 10.1111/j.1432-1033.1987.tb10540.x. [DOI] [PubMed] [Google Scholar]
- 16.Neal SE, Eccleston JF, Hall A, Webb MR. Kinetic analysis of the hydrolysis of GTP by p21N-ras. The basal GTPase mechanism. J Biol Chem. 1988;263:19718–19722. [PubMed] [Google Scholar]
- 17.Hutchinson JP, Eccleston JF. Mechanism of nucleotide release from Rho by the GDP dissociation stimulator protein. Biochemistry. 2000;39:11348–11359. doi: 10.1021/bi0007573. [DOI] [PubMed] [Google Scholar]
- 19.Raimondi F, Seeber M, De Benedett PG, Fanelli F. Mechanisms of inter- and intramolecular communication in GPCRs and G proteins. J Am Chem Soc. 2008;130:4310–4325. doi: 10.1021/ja077268b. [DOI] [PubMed] [Google Scholar]
- 20.Abankwa D, Hanzal-Bayer M, Ariotti N, Plowman SJ, Gorfe AA, et al. A novel switch region regulates H-ras membrane orientation and signal output. Embo J. 2008;27:727–735. doi: 10.1038/emboj.2008.10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Stroupe C, Brunger AT. Crystal structures of a Rab protein in its inactive and active conformations. J Mol Biol. 2000;304:585–598. doi: 10.1006/jmbi.2000.4236. [DOI] [PubMed] [Google Scholar]
- 22.Hess B, Kutzner C, Van Der Spoel D, Lindahl E. GROMACS 4: Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. J Chem Theory Comput. 2008;4:435–447. doi: 10.1021/ct700301q. [DOI] [PubMed] [Google Scholar]
- 23.Case DA, Cheatham TE, Darden T, Gohlke H, Luo R, et al. The Amber biomolecular simulation programs. J Comput Chem. 2005;26:1668–1688. doi: 10.1002/jcc.20290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Sorin EJ, Pande VS. Exploring the helix-coil transition via all-atom equilibrium ensemble simulations. Biophys J. 2005;88:2472–2493. doi: 10.1529/biophysj.104.051938. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Meagher KL, Redman LT, Carlson HA. Development of polyphosphate parameters for use with the AMBER force field. J Comput Chem. 2003;24:1016–1025. doi: 10.1002/jcc.10262. [DOI] [PubMed] [Google Scholar]
- 26.Hess B, Bekker H, Berendsen HJC, Fraaije JGEM. LINCS: A linear constraint solver for molecular simulations. J Comput Chem. 1997;18:1463–1472. [Google Scholar]
- 27.Bussi G, Donadio D, Parrinello M. Canonical sampling through velocity rescaling. J Chem Phys. 2007;126:014101. doi: 10.1063/1.2408420. [DOI] [PubMed] [Google Scholar]
- 28.Berendsen HJC, Postma JPM, Vangunsteren WF, Dinola A, Haak JR. Molecular-Dynamics with Coupling to an External Bath. J Chem Phys. 1984;81:3684–3690. [Google Scholar]
- 29.Seeber M, Cecchini M, Rao F, Settanni G, Caflisch A. Wordom: a program for efficient analysis of molecular dynamics simulations. Bioinformatics. 2007;23:2625–2627. doi: 10.1093/bioinformatics/btm378. [DOI] [PubMed] [Google Scholar]
- 30.Hub JS, de Groot BL. Detection of functional modes in protein dynamics. PLoS Comput Biol. 2009;5:e1000480. doi: 10.1371/journal.pcbi.1000480. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Lange OF, Grubmuller H. Generalized correlation for biomolecular dynamics. Proteins. 2006;62:1053–1061. doi: 10.1002/prot.20784. [DOI] [PubMed] [Google Scholar]
- 32.Meyer T, Ferrer-Costa C, Perez A, Rueda M, Bidon-Chanal A, et al. Essential dynamics: A tool for efficient trajectory compression and management. J Chem Theory Comput. 2006;2:251–258. doi: 10.1021/ct050285b. [DOI] [PubMed] [Google Scholar]
- 33.Kong Y, Karplus M. The signaling pathway of rhodopsin. Structure. 2007;15:611–623. doi: 10.1016/j.str.2007.04.002. [DOI] [PubMed] [Google Scholar]
- 34.Orengo CA, Michie AD, Jones S, Jones DT, Swindells MB, et al. CATH - a hierarchic classification of protein domain structures. Structure. 1997;5:1093–1108. doi: 10.1016/s0969-2126(97)00260-8. [DOI] [PubMed] [Google Scholar]
- 35.Noel JP, Hamm HE, Sigler PB. The 2.2 Å crystal structure of transducin-α complexed with GTPγS. Nature. 1993;366:654–663. doi: 10.1038/366654a0. [DOI] [PubMed] [Google Scholar]
- 36.Shatsky M, Nussinov R, Wolfson HJ. A method for simultaneous alignment of multiple protein structures. Proteins. 2004;56:143–156. doi: 10.1002/prot.10628. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.