Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Aug 14.
Published in final edited form as: J Chem Theory Comput. 2012 Jul 5;8(8):2949–2961. doi: 10.1021/ct300377a

Exploring residue component contributions to dynamical network models of allostery

Adam T VanWart 1, John Eargle 2, Zaida Luthey-Schulten 2, Rommie E Amaro 1,*
PMCID: PMC3489502  NIHMSID: NIHMS393419  PMID: 23139645

Abstract

Allosteric regulation in biological systems is of considerable interest given the vast number of proteins that exhibit such behavior. Network models obtained from molecular dynamics simulations have been shown to be powerful tools for the analysis of allostery. In this work, different coarse-grain residue representations (nodes) are used together with a dynamical network model to investigate models of allosteric regulation. This model assumes that allosteric signals are dependent on positional correlations of protein substituents, as determined through molecular dynamics simulations, and uses correlated motion to generate a signaling weight between two given nodes. We examine four types of network models using different node representations in Cartesian coordinates: the (i) residue alpha-carbons, (ii) sidechain center of mass, (iii) backbone center of mass, and the entire (iv) residue center of mass. All correlations are filtered by a dynamic contact map that defines the allowable interactions between nodes based on physical proximity. We apply the four models to imidazole glycerol phosphate synthase (IGPS), which provides a well-studied experimental framework in which allosteric communication is known to persist across disparate protein domains (e.g. a protein dimer interface). IGPS is modeled as a network of nodes and weighted edges. Optimal allosteric pathways are traced using the Floyd Warshall algorithm for weighted networks, and community analysis (a form of hierarchical clustering) is performed using the Girvan-Newman algorithm. Our results show that dynamical information encoded in the residue center of mass must be included in order to detect residues that are experimentally known to play a role in allosteric communication for IGPS. More broadly, this new method may be useful for predicting pathways of allosteric communication for any biomolecular system in atomic detail.

INTRODUCTION

Proteins and enzymes act as catalytic waypoints along a natural products assembly line. They are the molecular machinery of the cell designed to assist complex biological synthesis and mediate biosynthetic regulation. Allostery is broadly categorized as the mechanism, or set of mechanisms, for which an event at a distinct site affects the activity, function, conformational ensemble, or general attributes at another site within an enzyme or protein 1. As an example, in the case of allosteric modulation, the catalytic efficiency of an allosterically active enzyme or protein is affected through the binding of a metabolite to an allosteric site. The metabolite is said to be an effector molecule that may propagate an activation or inhibition signal to a distant catalytic site that in turn regulates the catalytic activity of the enzyme or protein. Furthermore, a metabolite may take part in the catalytic reaction itself, or instead, merely serve to induce allosteric modulation 2, 3. In this way, allostery mechanizes regulation within metabolic pathways as an intrinsic characteristic of a variety of proteins and enzymes. Further understanding of allostery may lead to advances in practical applications, such as rational drug design, protein engineering, and development of synthetic catalysts.

Allosteric regulation usually involves at least two different stereospecific receptor sites 24. As metabolic pathways are critical to living organisms, it follows that communication between both receptor sites must be a protected feature of allosteric modulation. Consequently, homologous proteins often preserve structural elements that conserve communication between receptor sites. The central theme being that structural elements important to protein function, and thus allosteric signaling, have been evolutionarily conserved 58. A notable proponent of this theme can be found in Statistical Coupling Analysis first introduced by Lockless and Ranganathan 9.

Much effort has been put forth to develop rigorous theories of allosteric modulation. Some of the early foundations of allostery are proposed in the concerted MWC model (Monod, Wyman and Changeux 3) and sequential KNF model (Koshland, Nemethy, and Filmer 10). These models conceptualize the mechanism within a protein modulated by an allosteric effector as a conformational transition that is either concerted or induced sequentially 11. Since then, mechanistic details have become rooted in an ensemble view of the protein where cooperativity between sites is governed by a density of states that is intimately connected to entropy and enthalpy 12, 13. The accessibility of an allosteric state may be through an effector substrate inducing a large conformational change within the protein complex as well as small local conformational changes 14. However, allosteric regulation can also occur in the absence of large conformational changes, as found in works by Cooper and Dryden 15. In such a situation, an enzyme or protein would have the propensity to facilitate allosteric signals due to small vibrations inherent to the proteins state and not due to a change in shape or fold.

Experimental constructs to explore allostery have included x-ray crystallography with site-directed mutagenesis, 1H-15N heteronuclear single quantum correlation (HSQC) NMR, kinetic assays with site directed mutagenesis, inhibition assays, and others 16. These tests probe a range of questions associated with small molecule effectors, protein-protein interfaces (hot spots), allosteric pathways, and general protein functionality. NMR provides increasing potential for a direct view into the mechanics of allostery. The 1H-15N amide bond vector is an inherent characteristic of polypeptides and thus has been used to characterize fast backbone and side-chain dynamics. In NMR experiments, thermal motions are present in all proteins above the glass transition temperature and there is a direct link between molecular motions and local magnetic field fluctuations. These dynamics are commonly associated with the Lipari-Szabo order parameter S2 17, 18. It is generally accepted that ps-ns timescales capture allostery in the absence of large conformational shifts for a majority of proteins (see review 19). The parameter S2 is related to the Gibbs free energy through the canonical partition function and provides a means of deriving conformational entropy from bond vector fluctuations 20.

Molecular dynamics seeks to give accurate virtual results that compete with experiment at a fraction of the cost. With the use of molecular dynamics simulations 21, 22, details of conformational changes are sampled to clarify and categorize mechanisms of allostery that may otherwise be elusive. Quasi harmonic analysis is a popular analytical tool that has been used in molecular dynamics to quantify motions in the protein's trajectory ensemble 2325. This approach aligns trajectories to a reference frame, often the average of the protein, and can be used to deduce correlations among variables (e.g. dihedral angles, position) that account for the configurational space. However, harmonic motions may be an oversimplified assumption for protein dynamics and the need for an anharmonic metric arises. Mutual information methods outlined by Lange and Grubmuller provide a more generalized framework that captures anharmonic motions in Cartesian coordinates 26. Recent works by Killian, Kravitz, and Gilson derive configurational entropies using internal coordinates for a trajectory ensemble 27. The mutual information values are the second-order terms derived from the entropy expansion and directly link correlated conformations within the protein to molecular configurational entropy values. This work inspired McClendon, Jacobson, and coworkers to develop an entropy, based approach to harmonic and anharmonic correlations via mutual information (MutInf) among dihedral angles 28; this method includes Bayesian filtering as well as sampling penalties to generate more statistically robust values.

Dynamical network analysis is another method used to explore putative allosteric communication pathways 29, 30. Within the realm of biological computation, dynamical network analysis samples correlations among time dependent variables for predefined components of a biological system (i.e. protein). These variables are associated with the trajectory ensemble and provide a distribution at equilibrium. As examples, a variable could be the position of an atom, center of mass (c.o.m.) of a residue, or a vector modeling the orientation of a residue. These variables serve to model the configuration of the protein at a given time. As developed by Luthey-Schulten and colleagues 29, 30, the variable was the position of each alpha-carbon atom within an aminoacyl-tRNA synthetase system. MD simulations sample the system dynamics, which are used to generate correlation values among all variables (nodes) of the biological system. The absolute value of a given correlation is then incorporated as a function that models the signaling between nodes as a distance. The behavior of nodes that are highly correlated and within close physical proximity (aggregates) can be analyzed in terms of community behavior and structure. Allosteric signaling between community aggregates occurs through particular node pairs that exhibit a high degree of network degeneracy (i.e. community cross talk). These “critical nodes” transfer a relatively large degree of information across their edge. As critical node pairs carry out inter-community cross talk, they may provide favorable sites for signal disruption.

In this work we extend the dynamical network analysis method and propose allosteric signaling pathways in imidazole glycerol phosphate synthase (IGPS) (HisH-HisF), which belongs to the glutamine amidotransferase (GAT) family of enzymes (figure 1). This multi-domain globular protein regulates the fifth step of the histidine biosynthetic pathway in plants, fungi, and microbes. Histidine is considered an essential amino acid, as humans do not have a histidine biosynthetic pathway. In bacteria, HisH-HisF is a protein dimer, whereas in eukaryotes it exists as one single polypeptide chain, with a ~40 amino-acid-residue tether connecting the two domains. The HisH-HisF dimer has two receptor sites; one receptor site (effector site) is positioned at the C-terminal cyclase domain of HisF, and the other (allosteric site) is located in HisH (the glutaminase domain) roughly 30 Å away 31. The triad GAT subfamily strictly conserves these catalytic residues across all members of the set, although the allosteric effector, remote binding site, upregulation, and allosteric site vary between different enzymes 3235. Another important conserved subfamily structure in GATs is a (β/α)8 barrel which is located in HisF. The (β/α)8 barrel is believed to assist in transport of the ammonia moiety to the effector or substrate molecule and may also play a role in allostery 3638.

Figure 1. The structure and reaction of hisH-hisF.

Figure 1

(A) The two domains, HisH (glutaminase domain) and HisF (cyclase domain) are shown colored by secondary structural elements. (B) The catalytic triad, residues tested experimentally for allostery, and the substrate PRFAR are presented again in ball and stick representation for clarity with the same orientation as (A), showing the allosteric and effector sites, which are over 30 Å apart. (C) The overall reaction catalyzed by HisH and HisF produces AICAR and ImGP.

Attachment of the primary metabolite N1-[(5'-phosphoribulosyl)-formimino]-5-aminoimidazole-4-carboxamide ribonucleotide (PRFAR, which is also the allosteric effector) to the cyclase domain induces several global structural changes in the protein dimer. A flexible loop on the C-terminal end of the (β/α)8 barrel appears to modulate access to the PRFAR substrate, and a hinge motion at the dimer interface changes the HisF and HisH domains by roughly 7.1° to form a tighter interface 8. These large scale conformational changes indicate that the PRFAR-bound (holo) and PRFAR-unbound (apo) states are distinct. Other induced motions consist of smaller conformational changes in the sidechain orientations 37. The catalytic turnover of glutamine to glutamate is increased ~4900 fold when PRFAR is bound, however the details of the allosteric mechanism are not completely understood 8. All members of the GAT family of enzymes exhibit similar allosteric regulation to control the production and utilization of ammonia in the subsequent catalytic reaction 39. After the catalytic turnover of glutamine, nascent ammonia is shuttled across the dimer interface and through the center of the (β/α)8 barrel along the barrel's principal axis to the cyclase domain where PRFAR is broken into 5'-(5-aminoimidazole-4-carbox- amide) ribonucleotide (AICAR) and ImGP (figure 1) 40. Previous studies have also found the AICAR and ImGP products stimulate the glutaminase half reaction, although at a reduction of 45 to 188 times less, respectively, than the PRFAR substrate molecule 37. Moreover, the two independent products also stimulate the glutaminase half reaction 15 times less than PRFAR. These results suggest that AICAR and ImGP both contribute to allostery, but ImGP may be more involved in the allosteric signaling network. Furthermore, the chemical bond between the AICAR and ImGP moieties is necessary for enhancement of the allosteric state.

We explore extensions of the original dynamical network analysis to ascertain new models for allosteric communication, which are benchmarked against experimental site-directed mutagenesis studies in HisH-HisF. The impact of utilizing different residue contributions (i.e. nodal methods) within the dynamical network analysis framework is investigated using HisH-HisF as a model system. The Floyd Warshall algorithm determines the shortest (optimal) path between two nodes in a network 41, 42, and the Girvan-Newman algorithm 43, a method without free parameters, generates aggregate nodal clusters, or communities, that optimize intra-communication pathways within the protein. For each nodal method, we determine the optimal signaling pathways. We then compare and investigate the corresponding community structures for the alpha-carbon and residue c.o.m. methods. Ultimately, we propose novel allosteric communication pathways for HisH-HisF and show that using the residue c.o.m. provides advantages over other nodal methods within the dynamical network analysis framework.

METHODS

System setup

The initial coordinates for the (holo) HisH-HisF were obtained as PDB 1GPW (from Thermotoga maritima) from the Protein Data Bank and included resolved phosphate groups still bound to the active site. Furthermore, the PDB 1GPW included three states of HisH-HisF categorized by chains AB, C–D, E–F. In order to prepare an accurate residue environment, PROPKA was used to determine protonation states of titratable residues based on an environment at pH 7 44. The δ-nitrogen was protonated for the HisF domain for residues His84, His209, His244 and His73, His120, and His141 in HisH. An ε-nitrogen protonation state was assigned for HisF:His228, and for HisH:His53. Both the δ-nitrogen and ε-nitrogen of the residue HisF:His151 were protonated.

The crystal structure 1OX5 (from Saccharomyces cerevisiae) included all PRFAR heavy atoms and was used to build in the PRFAR substrate for 1GPW. Using the VMD plugin MultiSeq 45, the crystal structure chain A 1OX5 was aligned to 1GPW chains C and D via secondary structures in the HisF domain using the STAMP structural alignment algorithm 46, 47. The result was that the PRFAR substrate aligned correctly to the 1GPW:HisF domain for the holo-state of HisH-HisF. This alignment is consistent with other works 38, 48. The 1GPW:crystal-waters, 1OX5:PRFAR metabolite, 1GPW:HisF, and 1GPW:HisH segments were processed by PSFGEN to generate psf and pdb files. The parameterization of the PRFAR substrate followed standards established in the CHARMM protocol 22, and used results from previous works 49. The formal charge for the protein dimer was −8.0 e, and the formal charge for the primary metabolite, PRFAR, was −5.0 e. The formal charge of the system was −13.0 e in total. The system was solvated using SOLVATE 50 to add explicit TIP3 water molecules to a box with a water pad of 10 Å around the protein. Thirteen sodium counterions were added with the VMD plugin Autoionize, bringing the overall formal charge of the entire system to zero.

The (apo) HisH-HisF was also derived from 1GPW using chains A and B. The system setup for the (apo) HisH-HisF followed the exact same procedure as (holo) HisH-HisF without the addition of the PRFAR ligand.

Molecular Dynamics Simulations

A four-step energy minimization was employed for both apo and holo structures to ensure maximal retention of experimental structural information. The four steps constrained heavy atoms for 1000 fs, heavy atoms excluding water for 1000 fs, backbone atoms excluding all other molecular groups for 2500 fs, and finally a constraint-free motion of all atoms for 5000 fs. Harmonic constraints were induced to allow minimal backbone motion to achieve water equilibration for five time frames of 25000 fs each and different backbone constraint constants of 1.0, 0.75, 0.50, 0.25, and 0.0 kcal mol−1 Å.

Molecular dynamics for the holo structure were run for 30 ns (including an initial 10 ns to ensure the mutual equilibration of PRAFR and HisH-HisF) with an integration time step of 1 fs using NAMD2.7 21 and the CHARMM27 force field 21, 51. Production molecular dynamics for the (apo) HisH-HisF were run for 20 ns with an initial 1 ns harmonic constraints equilibration phase. All other simulation parameters and benchmarks were the same as the (holo) HisH-HisF simulation. Periodic boundary conditions were used with the NPT ensemble, a pressure of 1 atmosphere using a Langevin piston, and a temperature set to 310 K. A particle mesh Ewald summation was used for the electrostatic approximations 52. The long range van der Waals interactions were given a cut off distance of 12 Å and a switching distance at 10 Å. Simulations were performed on the TACC Ranger cluster using 256 processors. The benchmark for the ~45000 atom systems were 0.0136156 s/step and 0.157588 days/ns.

Protein snapshots were extracted at 500 fs intervals over the 20 ns production phases of both the holo-state and apo-state, resulting in two sets of 40,000 structures for subsequent analysis. Each snapshot was aligned via RMSD to the average position of all atoms within the protein for both systems over the entire 20 ns simulation of production dynamics, in order to remove any rotational or translational motions.

Dynamical Network Model

We explore the use of different sources of correlation information, a.k.a. nodal methods, including: (i) alpha carbon, and the c.o.m. of (ii) the backbone, (iii) sidechain, and (iv) whole-residue representations (residue c.o.m.). In each case, the covariance matrix is normalized, and as a consequence, the position, and not the mass of the node, is important. In order to model PRFAR as a set of nodes, the PRFAR substrate is broken into two moieties, which represent the reaction end products AICAR and ImGP (figure 1). The centers of mass for both the AICAR and ImGP moieties are defined as separate nodes for the analysis.

Methods are carried out using a similar protocol to Ref. 29, 30. A contact map is used to determine Boolean values for node-node interactions. If a residue is within 4.5 Å of another residue for 75% of the 40,000 frame (20 ns) trajectory, then the two residues are considered to be in contact with one another (SI figure 1). Nodes that do not fit these requirements are filtered out. The correlation values are obtained by trajectory data using the program CARMA 53 . The correlation is defined by equations (1) and (2), where Cij is the correlation value between two nodes over a given simulation time (figure 2). The value of Cij is unity when i = j and satisfies the condition that a node correlates completely with itself. In equation (2), the position of a given node at time t is subtracted from the average position of the node over the entire production dynamics (20 ns) and requires that the average position of the node sampled is approximately stationary over time. When comparing the motion of one nodal method to another, equation (1) is also used (figure 3); these are correlations among different nodal methods within the same residue.

Figure 2. Correlation maps.

Figure 2

Entries in the lower right triangular show pairwise correlation values using the alpha-carbon atoms; the upper left triangular shows the correlation values using the c.o.m. of the entire residue.

Figure 3. Correlation of different residue components.

Figure 3

(A) Here correlation between the alpha-carbon and the backbone c.o.m. is shown for each residue. The alpha-carbon is strongly correlated to the backbone c.o.m. and not well correlated to the sidechain c.o.m. suggesting that information given by the alpha carbon is highly biased toward backbone motion. (B) Next, correlation between the alpha-carbon and the c.o.m. for the entire residue is shown. (C) Lastly, comparisons between the alpha-carbon and sidechain c.o.m. show the lowest correlation values and greatest spread, which indicates that information from sidechain motion may be lost in an alpha-carbon representation.

Cij=Δri(t)·Δrj(t)(Δri(t)2Δrj(t)2)1/2 eq. (1)

and

Δri(t)=ri(t)-ri(t) eq. (2)

The use of equation (1) and equation (2) as well as interaction criteria determined through the contact maps generates a filtered correlation matrix.

The edge between a pair of nodes is then given a length in network space by functionalizing the correlation values. A positive or negative correlation is viewed equally in that residue interactions are coherent whether they are correlated or anti-correlated and so the edge lengths take the value wij equation (3) between node i and node j 29, 30:

wij=-log(|Cij|) eq. (3)

Although the values of each length have meaning in network diagrams, they can be mapped to the physical protein space by weighting each edge with a thickness or cylindrical diameter. Therefore, thick edges between nodes are indicative of edges close to one another in network space while thin edges and edges that do not exist are far or infinitely separated respectively (figures 47). Such edges represent the signaling strength between nodes.

Figure 4. Optimal allosteric pathways.

Figure 4

(A) and (B): Here we show the optimal pathway generated from the Floyd-Warshall algorithm using (A) the alpha-carbon method (red spheres) and (B) the residue c.o.m. (orange spheres) correlation method. In both methods, the source is the ImGP moiety and the sink is catalytic residue HisH:Glu180. The ellipsoids (blue dots) are aligned along the principal axes of a given residue to illustrate a residues orientation and spatial displacement. Edges are drawn between nodes and the edge thickness is radially proportional to the weight given by eq. (3).

Figure 7. Comparison of Apo/Holo HisH-HisF for community structure.

Figure 7

(A) and (B): The optimal communities for communication among residues are depicted for both the apo-state and holo-state using the residue c.o.m. method. There are seven communities in the protein dimers for both the apo-state and holo-state: three largely associated with HisH (silver, ochre, and blue) and four associated with HisF (black, orange, green, and ice-blue). However, these communities do extend across the dimer interface. The edges that attach critical nodes between communities are shown as red lines. The black community in the apo-state (A) is larger than the holo-state (B). The ImGP moiety is a member of the black community for the holo-state and this may be the direct cause of the community shift from the larger black community associated with the apo-state to the smaller black community in the holo-state.

One aim of our analysis is to identify putative allosteric communication pathways in proteins, using HisH-HisF as a model system. Evidence of cooperation between the substrate PRFAR (i.e. the source of allosteric signal) and the catalytic triad (i.e. where the allosteric signal ends) is captured within this framework. In this model we assume that the most likely or biologically relevant pathways are those that minimize the network distance. Finding a singular optimal path is equivalent to minimizing the distance traveled between nodes in the network. The length of such a path is the sum of the individual path lengths between each node set i, j equation (4), starting with the source nodes in PRFAR and ending with the three sink nodes in the catalytic triad:

Dij=k,lwk,l eq. (4)

All other paths are deemed suboptimal if they fall within an acceptable deviation from the optimal path. Such pathways are likely to present highly degenerate signaling along the optimal path. Although it is possible to determine numerous suboptimal pathways within the dynamical network analysis framework, we do not investigate suboptimal pathways in this work. Yet, we acknowledge that their analysis may likely lead to further insight into the complexities of allosteric regulation.

Entropy Calculations

The entropy for both holo and apo-states assuming a quasiharmonic approximation was calculated using CARMA 53, which follows the derivation outlined by Andricioaei 54.

Sho=ki3n-6ωi/kTeωi/kT-1-ln(1-e-ωi/kT) eq. (5)

The mass weighted covariance matrix was used for the alpha-carbons and each mass was 12.0107 g mol−1. A mass weighted covariance matrix was also used for the residue c.o.m. method and each mass varied depending on the selected residue. The PRFAR molecule was omitted when comparing entropies of the apo and holo-states in order to maintain identical degrees of freedom between the ligand bound and ligand free systems.

Results

MD Simulations

The MD simulations for HisH-HisF must be sampled over equilibrium conditions in order to use correlations since the average position of a node is required to be approximately stationary over time. To assess each nodal method, 30 ns of MD were run for the holo-state. The RMSD for all 30 ns of molecular dynamics was calculated at each frame for the holo-state and indicates system equilibration after the first 10 ns (SI figure 2); the remaining 20 ns is considered production dynamics and was utilized in the dynamical network analyses. The RMSD for the 20 ns molecular dynamics of the apostate also showed convergence. Trajectories were assessed at 5 ns, 10 ns, 15 ns, and 20 ns to determine the time dependence and convergence of the optimal paths.

Correlation and contact maps

The full correlation matrix, representing both the residue c.o.m. and alpha carbon methods, is presented (figure 2). In parallel, a contact map indicating which residues were closely interacting was generated over the production MD phase for the holo-state. If residues were within 4.5 Å of another residue for 75% of the 40,000 frame (20 ns) trajectory, the correlation value in the correlation matrix was kept; otherwise the correlation value was set to zero. This contact-map filtering prevents edges from being drawn between residues that are physically far apart. The same contact map was used for the holo-state across each time assessment at 5 ns, 10 ns, 15 ns, and 20 ns (SI figure 1).

The contact map for the apo-state was generated from a separate 20 ns trajectory and followed the same criteria as the holo-state. This contact map was used to assess the apo-state over the entire 20 ns trajectory for comparison to the holo-state.

Correlation of nodal-methods in holo HisH-HisF

The correlation of the different nodal methods was analyzed for the holo-state over 20 ns of production dynamics by determining the positional correlation between the alpha-carbon and the c.o.m. of the: (a) backbone, (b) sidechain, and (c) entire residue. This analysis indicates that the motion exhibited by the alpha-carbon is most highly correlated to the backbone motion as a general trend for same residue selections (figure 3). The position of the alpha carbon correlates strongest with the position of the backbone c.o.m., and weakest with the position of the c.o.m. of the sidechain. Between these two extremes, the position of the residue c.o.m., which represents both the backbone and sidechain atoms, exhibits moderate correlation to the alpha carbon position.

Optimal pathways

Comparison of different nodal methods within the holo-state

The optimal communication path is the shortest path between the source of the allosteric signal, in this case PRFAR, and the end-point of the allosteric signal, or sink, which is the catalytic triad (figure 4). The catalytic triad is comprised of three residues HisH:Glu180, HisH:His178, and HisH:Cys84 and therefore three potential sink candidates. Each residue within the triad was thus investigated in the search for the optimal paths. For simplicity, we focus on the sink residue HisH:Glu180 (table 1). However, we note differences in the optimal pathways among all combinations of sources and sinks and provide this additional information (SI table 1).

Table 1. Optimal pathways for Holo ImGP-HisH:Glu180 for each nodal method.

The optimal path from source ImGP to sink HisH:Glu180 is calculated based on 20 ns of production dynamics after an initial 10 ns to allow PRFAR and HisH-HisF to equilibrate. Each entry in the table represents a residue or moiety of PRFAR. The residue name and dimer superset of each residue is indicated above the table. Number entries are associated with the crystal structure 1GPW from the protein data bank. The resulting correlation matrix for each nodal method is calculated at 5 ns, 10 ns, 15, ns and 20 ns. However, the same contact map is used for each and is based on the 20 ns trajectory. For each nodal method shown, the optimal paths appear to converge to a consistent set of residues. Evolutionarily conserved (100% sequence identity) residues are both underlined and bold where partially conserved (greater than 50% sequence identity) residues are just underlined in reference to previous homology studies 8.

graphic file with name nihms393419f8.jpg

The optimal path between source node ImGP and sink node Glu180 captured the same residues at 15 ns and 20 ns for the different nodal methods and the optimal path for the residue c.o.m. method was surprisingly the same for all time assessments (table 1). The pathway between the ImGP moiety and residue Glu180 yielded similar results to the pathway between the AICAR moiety and residue Glu180 when using the alpha-carbon nodal method (SI table 1).

The sidechain c.o.m. method gives an optimal path of ImGP, HisF:Leu50, HisF:Val48, HisF:Thr78, HisF:Lys99, HisF:Asp98, HisH:Lys181, and HisH:Glu180. The residue c.o.m. method gives an optimal path of ImGP, HisF:Leu50, HisF:Gly80, HisF:Val79, HisF:Lys99, HisF:Asp98, HisH:Lys181, and HisH:Glu180. In contrast, the pathway predicted by the alpha-carbon method suggests an optimal pathway connecting the ImGP source through HisF:Leu50, HisF:Phe49, HisF:Phe77, HisF:Pro76, HisF:Lys181, and HisH:Glu180 (sink). The optimal pathway predicted by the backbone c.o.m. method traces through ImGP (source), HisF:Leu50, HisF:Phe49, HisF:Thr78, HisF:Phe77, HisF:Lys181, and HisH:Glu180 (sink).

Comparison of holo- and apo-states using the residue center of mass method

Both the holo-state, with allosteric effector PRFAR bound, and the apo-state simulations sample equilibrium conformations of the protein and provide a comparison between the ligand activated holo-state and the basal (inactive) apo-state. The optimal path for the holo-state is compared to the optimal path of the apostate from ImGP (source) to HisH:Glu180 (sink) using the residue c.o.m. as the preferred nodal method (table 2). The residue c.o.m. method traces an optimal path for the holo-state as: ImGP, HisF:Leu50, HisF:Gly80, HisF:Val79, HisF:Lys99, HisF:Asp98, HisH:Lys181, and HisH:Glu180. In contrast the apo-state gives an optimal path of HisF:Leu50, HisF:Phe49, HisF:Phe77, HisF:Pro76, HisH:Lys181, and HisH:Glu180. A noteworthy switch between the two states is the HisF:Leu50-HisF:Gly80 in the holo-state versus the HisF:Leu50-HisF:Phe49. The apo-state HisF:Leu50-HisF:Phe49 is clearly a backbone dominated correlation where the holo-state HisF:Leu50-HisF:Gly80 suggests a correlation that may depend on sidechain interaction. Another important difference is seen at the interface of the protein dimer, where the apo-state HisH:181Lys-HisF:Pro76 communication path switches to HisH:181Lys-HisF:Asp98 in the holo-state. The HisF:Asp98 residue is a component of the salt bridge verified by experiment to be a contributing residue in the allosteric mechanism of HisH-HisF 8, 55. Although the optimal apo-state path has fewer nodes than the holo-state path, in part due to the absence of ImGP, it has a longer overall distance. This indicates that the coupled residue-residue motions involved in molecular signaling between the effector site and the active site are stronger when PRFAR is bound.

Table 2. Optimal pathways Apo/Holo HisH-HisF using residue center of mass method.

Here we show the optimal pathway between ImGP and HisH:Glu180 for both the apo-state and holo-state simulation with only the c.o.m. method being used (the preferred method for this paper). The apo-state and holo-state of HisH-HisF is compared here and shows a clear switch between both states. The holo-state traces out the slat bridge HisF:Asp98-HisH:Lys181 that is verified experimentally to be part of the allosteric mechanism 8. The apo-state does not provide this bridge. The distance values on the right are calculated according to eq. 4.

graphic file with name nihms393419f9.jpg

Community structure analysis

Comparison between different nodal methods (holo-state)

Community structures depict high regional correlations among nodes within the protein, and these regions are modeled as node aggregates or communities. The community subsets may give insights to allosteric signaling, and, in particular, node pairs that perform inter-community signaling, so-called “critical nodes”, provide possible sites for signal disruption. Although similarities between the alpha-carbon method and residue c.o.m. were exhibited in the community structures, the residue c.o.m. method indicated a significant difference regarding the community ImGP was associated with. Furthermore, certain communities were drastically altered (blue in figure 5) between the two methods.

Figure 5. Community network analysis.

Figure 5

(A) and (B): The optimal communities for communication among residues are depicted, grouped via the Girvan-Newman algorithm. Note the significant change in community structure between the two methods: residue alpha-carbon (A) and residue c.o.m. (B). There are seven communities in the protein dimers: three for HisH (silver, ochre, and blue) and four for HisF (black, orange, green, and ice-blue). The edges that attach critical nodes between communities are shown as red lines and depict high levels of degenerate signaling between community structures. Edge thickness is radially proportional to the weight given by eq. (3).

The alpha-carbon method indicated an inter-community connection between AICAR and ImGP, and the AICAR moiety was associated with a portion of the (β/α)8 barrel (green in figure 6A). The alpha carbon method suggested AICAR and ImGP to be constituents of two separate communities (orange and green in figure 6A) and composed of proportional parts of the (β/α)8 barrel. In the alpha-carbon method, an intra-community connection was exhibited between ImGP and HisF:Cys9, HisF:Gly202, HisF:Gly203, HisF:Leu222, HisF:Ala223, HisF:Ala224, and HisF:Ser225. Intra-community connections were also detected between AICAR and HisF:Gly82, and HisF:Gly81, which was similar to the c.o.m. method.

Figure 6. Inter/Intra-community connections of PRFAR.

Figure 6

(A) and (B): The change in community structure for connections between PRFAR and other immediate residues is depicted. The alpha-carbon method (A) shows ImGP (blue) as a member of the orange community and AICAR (blue) as a member of the green community. Both the green and orange communities largely include two halves of the (β/α)8 barrel in HisF. The residue c.o.m. method (B) shifts the ImGP moiety (blue) into the black community comprising the loop at the C-terminal end of the (β/α)8 barrel.

In the residue c.o.m. method, the ImGP moiety is predicted to belong to the nodal community comprised of a loop near the C-terminus (black in figure 6B) with intra-community connections to HisF:Lys19 and HisF:Asp11. The AICAR moiety for the c.o.m. method belonged to the nodal community composed of a portion of the (β/α)8 barrel (green in figure 6B) with intra-community connections to HisF:Gly82, HisF:Gly81, and HisF:Leu50. Although the alpha carbon and residue c.o.m. methods differ in important details of the community partitioning, both show a split between the ImGP and AICAR moieties of PRFAR in agreement with the catalytic mechanism of HisF.

Community structure comparison between the apo- and holo-states

The residue c.o.m. method was used to derive community structures for both the apo and holo conformational ensembles. Considerable shifts in community aggregates are seen in the HisH domain (figure 7A and 7B, blue regions). The set of community aggregates also rearranged themselves in the HisF domain between apo and holo states (figure 7A and 7B, black regions). Both the apo- and holo- states provided seven community aggregates; three community aggregates in the HisH domain and four in HisF. However, differences in community crossover at the dimer interface are present.

The black community in both the apo- and holo- states contains the flexible loop at the C-terminal end of the (β/α)8 barrel. The black community is clearly altered between the apo- and holo- states (figure 7). The residues that both states have in common are HisF:Asp11 thru HisF:Glu34 and residue HisF:Ile52. Root mean squared deviation (RMSD) analysis was used to quantify the structural differences for this set of common residues between both states over the course of the trajectories (SI figures 3 and 4). The average RMSD for the holo-state is 2.166 Å with a standard deviation of 0.412 Å, and for the apo-state is 1.416 Å with a standard deviation of 0.322 Å. Addition of PRFAR at the effector site (holo-state) increased the RMSD as well as the fluctuations about the average protein structure of the loop. This corresponds to a decrease in correlations within the black community and provides a distinct shift to the holo-state as seen by the extension of the green community into regions occupied by the black community in the apo-state (figure 7A and 7B).

Critical node comparison between the apo- and holo-states

Critical nodes represent strong signaling conduits between communities. Critical nodes were analyzed for both the apo-state and holo-state simulations (table 3). Four critical nodes were the same across both the apo-state and holo-state in HisF. These nodes were HisF:Arg5, HisF:Glu46, HisF:Lys99 and HisF:Val127. All of these nodes are evolutionarily conserved except for HisF:Val127 which is partially conserved 8. For the HisH domain, six critical nodes were the same. The evolutionarily conserved node is HisH:Tyr138. The partially conserved nodes are HisH:Val51, HisH:Tyr143, HisH:Val193 8. The two nodes that did not show conservation are HisH:Arg18 and HisH:Leu189.

Table 3. Critical nodes in Apo/Holo HisH-HisF using residue center of mass method.

The critical nodes in a community structure present high signal degeneracy between community aggregates. Here we compare the critical nodes calculated for the apo-state and holo-state. The preservation of critical nodes across both states (grey columns) may be important to protein structure and function. Residues that are evolutionarily conserved are both bold and underlined while partial conservation is indicated by just an underline. Residues that are not evolutionarily conserved are written in the column headers as plain text.

HisF Critical Nodes
Arg lle Val Gly Pro Leu Ser Asp Glu Val Ala Glu lle Pro Asp Lys lle lle Ala Gly
Holo 5 17 30 32 35 46 69 71 99 113 116 117 121
Apo 5 7 40 45 46 70 75 76 98 99
Gin Val Val Trp Glu Asp Thr Ser Leu Ala Leu Val Asn Arg Leu Glu Gly
Holo 123 126 127 167 176 249
Apo 127 156 178 180 211 213 222 246 247 250 251 252
HisH Critical Nodes
Arg Val Gly Met Leu lle Ser Glu Arg Trp Tyr Tyr Tyr His Lys Lys lle Leu Val ImGP/AICAR
Holo 18 51 58 67 94 117 123 138 143 158 189 193 ImGP/AICAR
Apo 18 51 57 61 96 138 143 178 181 184 185 189 193 none

Entropy calculations between apo and holo-states

The entropy of the apo and holo-states were calculated for both the alpha-carbon and c.o.m. methods using a quasiharmonic approximation 54. The alpha-carbon method provided entropy values of (13192.54 J mol−1 K−1) and (13664.46 J mol−1 K−1) for the respective apo and holo-states. Whereas the residue c.o.m. method provided entropy values of (27540.41 J mol−1 K−1) and (27905.33 J mol−1 K−1). These results show a positive entropy gain, upon PRFAR binding, with values (ΔSαc = 471.92 J mol−1 K−1) and (ΔScom = 364.92 J mol−1 K−1) for the alpha-carbon and c.o.m. methods, respectively.

DISCUSSION

In this work, we have examined four types of networks by varying node representations in Cartesian coordinates: the (i) residue alpha-carbons, (ii) sidechain c.o.m., (iii) backbone c.o.m., and the entire (iv) residue c.o.m.. We applied these to imidazole glycerol phosphate synthase (IGPS) in order to test for allosteric signaling pathways within the framework of dynamical network analysis. The analysis was carried out at equilibrium conditions and the resulting pathways were sensitive to which node representation was used.

In our results, the alpha-carbon method failed to indicate interplay between HisH:Lys181 and HisF:Asp98 for the holo-state optimal path ImGP to HisH:Glu180, whereas the residue c.o.m. and sidechain c.o.m. methods succeeded in showing this particular interaction at the interface (table 1). Detecting HisH:Lys181 and HisF:Asp98 is a promising feature of the residue c.o.m. methodology because this salt bridge has been experimentally validated to participate in allostery for the cyclase half reaction 8, 55. Furthermore, HisH:Lys181 and HisF:Asp98 are coupled to the active site of the catalytic triad through close physical proximity and strong correlations. The holo-state of HisH:Lys181 and HisF:Asp98 forms a long-lasting contact across the dimer interface, confirmed by our simulations (data not shown), and both residues are directly adjacent to the glutaminase active site.

Mutations of HisF:Thr104 and HisF:Lys19, assessed in previous works, 8, 55 affected the glutaminase half reaction and may be attributed to destroying the functionality of the receptor site where PRFAR binds. In our simulation, the residue HisF:Lys19 is adjacent to the ImGP moiety of PRFAR and the Girvan-Newman algorithm clusters both into the same community when based on the residue c.o.m. methodology (black, figure 7). Trajectories are distinct between the apo-state, where HisF:Lys19 is not linked to PRFAR, and the holo-state, where HisF:Lys19 is directly linked to the ImGP moiety of PRFAR and indirectly to the (β/α)8 barrel (green and orange, figure 7). When analyzing the RMSD for both the apo-state and holo-state (SI figures 3 and 4), the variations are more stationary for the apo-state and less so for the holo-state over the 20 ns trajectory. The holo-state may comprise specific harmonic, and anharmonic motions between the loop structure and components of the (β/α)8 barrel that is mediated by the ImGP moiety of PRAFR. This would imply that HisF:Lys19 plays a role, for the holo-state, via signaling attributes of the loop structure by channeling motion through a direct connection into the optimal path from ImGP to HisH:Glu180. The apo- and holo-state RMSD time series are consistent with previous NMR studies for this loop structure and may indicate signaling attributes enveloped on the millisecond timescale 56.

Comparison of the different nodal methods indicates that the alpha-carbon and backbone c.o.m. methods are strongly correlated (figure 3). This analysis suggests a clear bias of the alpha-carbon to detect the same signals as the residue backbone.

Moreover, using the covariance matrix to calculate quasiharmonic entropy values gives notably different results when using the alpha-carbon versus the residue c.o.m. method. Collectively, these results suggest that biologically relevant information may be lost if sidechain substituents are not taken into account, and emphasizes the importance of accounting for sidechain dynamics in allosteric signaling captured through the use of the residue c.o.m.. Including sidechain information in dynamical network models may be essential to providing an accurate analysis of allosteric pathways as well as hierarchical clustering of dynamical networks.

Evolved interdependence of the two domains

Interdependence between HisH and HisF exists as a result of protein function. Elegant biochemical kinetic experiments for HisH-HisF have decoupled the two enzyme reactions in vitro. Constructs in bacteria, where the two domains are encoded on separate polypeptide chains, have shown that the cyclase reaction can occur via HisF in non-biological conditions (50 mM Tris-acetate buffer, pH 8.5, at 25 °C), whereas the glutaminase reaction of isolated HisH has no detectable activity 57. Consequently, residues that form a foundation for the dependence of the two domains may be highly conserved due to evolutionary pressures. Some of these residues are likely along the signaling pathway.

Every residue along the optimal path determined with our residue c.o.m. analysis, from ImGP to HisH:Glu180, is known to be evolutionarily conserved 8. This result is in contrast to the alpha-carbon method, which also included a non-conserved residue HisF:Thr78 in its optimal path (table 1). Furthermore, many residues along the (β/α)8 barrel were highly selected for in both the apo-state and holo-state along the optimal paths for all methods. This result is consistent with the conserved nature of the barrel structure across the sub family of GAT enzymes, and is a quantitative argument against possible circuitous routes via other secondary structural elements of the protein. The pathway comparison between apo- and holo-state (table 2) may also provide insights to basal turnover as HisH-HisF produces low levels of glutamate in the half reaction when PRFAR is not bound.

Community structure as an allosteric descriptor

Overall community structure is a function of intra-communication among a set of residues, and consequently provides a coarse grained intercommunication between a set of communities. The community results are derived from different nodal methods and indicate that the choice used to determine the correlation matrices is important (figure 57). When using the residue c.o.m. method, a clear shift among community structures is seen between the apo- and holo-states. The loop structure at the base of the (β/α)8 barrel directly influences the ImGP moiety of PRFAR as previously mentioned. However, other shifts in community structure may be more relevant. At the dimer interface, clear changes exist between the apo- and holo-state (figure 7).

Critical nodes at the interface of the communities may be adequate descriptors of a protein's state. Even though critical nodes may not be part of the optimal pathway, they are assumed to be important to the configurational ensemble. For example, a critical node may represent a pharmaceutically relevant site for allosteric inhibition. The critical nodes between apo- and holo-states also shift (table 3). However, some residues preserve this critical node quality between both apo- and holo-states. For instance, HisF:Lys99 is evolutionarily conserved and quantified as critical for both states. Whereas, HisF:Asp98 and HisH:Lys181 are also conserved but are only critical nodes for the apostate. The allosteric importace of HisF:Asp98 and HisH:Lys181 has already been established, and their split into separate communities is a feature of the apo-state that quantifies the separation and extent of influence between the two residues.

From the community analysis, ImGP is also considered a critical node and carries information from the black community to both the green and orange communities which are commensurate halves of the (β/α)8 barrel. Furthermore, AICAR is also a critical node and this reflects the flexible characteristics of the PRFAR metabolite as both moieties correlate with different communities (figure 6B).

Limitations of the model

The optimal pathways for allosteric signaling were based on a contact-map-dependent methodology defined previously 29, 30. Residues within a particular distance of another residue for some percentage of the simulation are assumed to influence the communication pathway directly and residues that do not satisfy these constraints are removed from consideration. However, other methods for generating a contact map may improve definitive connections within the protein network. For example, a reasonable alternative contact map may use values that are not Boolean, instead relying on a continuous function that varies depending on the distance between nodes. Alternate methods for defining contacts among residues is beyond the scope of this paper, though we note such efforts may impact results and should be considered for future dynamical network models.

The available high-resolution crystal structures of the glutaminase domain have shown that the oxyanion hole required for glutaminase activity is not properly formed in the absence of the cyclase domain or the effector molecule 31, 5860. In fact, the rearrangement of the active site to a catalytically competent conformation is a hallmark feature of the regulation of nitrogen metabolism across the entire family of glutamine amidotransferases. The energy barrier required to overcome this oxyanion strand rearrangement in HisH-HisF has been estimated at over 20 kcal/mol 56. Notably, these crystal structures, as well as the simulations presented here, did not contain the glutamine substrate at the active site of HisH. Our work assesses the effects of PRFAR alone. Further studies are warranted in order to determine whether the inclusion of glutamine within the active site of HisH-HisF, either covalently bound to the active site, cysteine, or simply present locally, would effect the networks and community structures determined from the PRFAR-bound state.

Using a network approach to allostery is both intuitive and instructive for garnering information as to what residues may play crucial roles in an allosteric mechanism. However, there are limitations on how our correlation metrics may provide insight. Not all correlations are taken into account, as there may be correlations that exist but are computationally orthogonal to one another due to bond constraints. Furthermore, the covariance matrix assumes fluctuations about an average structure, but the structural average may not be adequate for anharmonic type motions.

Conclusions

In this work, two of five residues (HisH:Lys181 and HisF:Asp98), experimentally found to facilitate allostery in hisH-hisF, were determined to be along the optimal pathway of our network model when using correlation information determined from the residue c.o.m. method. The other three residues appear to be related to the disruption of the allosteric state, for which we find strong qualitative and quantitative arguments that HisF:Lys19 is a crucial link between ImGP and the loop structure at the base of the (β/α)8 barrel. This loop structure is distinctly different for the apo-state and holo-state and is arguably a feature of the allostery exhibited in HisH-HisF. The alpha-carbon method was clearly similar to the backbone c.o.m. method and traced out an optimal pathway that did not emphasize the dimer sidechain interactions for ImGP (source) through HisH:Glu180 (sink). Using the covariance matrix to calculate quasiharmonic entropy values gives notably different results when using the alpha-carbon versus the residue c.o.m. method. In principal, quasiharmonic vibrational energies are also dependent on this choice and may lead to inadequate values for properties of the system, such as heat capacity. As the residue c.o.m. is a composite set of the backbone and sidechain groups, our works suggest that this method provides superior information in contrast to the popular alpha-carbon approach.

Supplementary Material

1_si_001

Acknowledgments

This work was funded in part by the National Institutes of Health through the NIH Director's New Innovator Award Program 1-DP2-OD007237 and through the NSF XSEDE Supercomputer resources grant LRAC CHE060073N to R.E.A. Simulations were run at the Texas Advanced Computing Center. This work was also supported by the NIH Resource for Macromolecular Modeling and Bioinformatics P41-RR05969 and NSF grant MCB-0844670 to Z.L.S. We would also like to thank Dr. Robert Swift for his insight and consultation pertaining to general topics of molecular dynamics.

Footnotes

Supporting Information Available

Four figures and two tables. This information is available free of charge at http://pubs.acs.org.

References

  • 1.Changeux JP. 50th anniversary of the word “allosteric”. Protein Sci. 2011;20:1119–24. doi: 10.1002/pro.658. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Monod J, Changeux JP, Jacob F. Allosteric proteins and cellular control systems. J Mol Biol. 1963;6:306–29. doi: 10.1016/s0022-2836(63)80091-1. [DOI] [PubMed] [Google Scholar]
  • 3.Monod J, Wyman J, Changeux JP. On the Nature of Allosteric Transitions: A Plausible Model. J Mol Biol. 1965;12:88–118. doi: 10.1016/s0022-2836(65)80285-6. [DOI] [PubMed] [Google Scholar]
  • 4.Rubin MM, Changeux JP. On the nature of allosteric transitions: implications of non-exclusive ligand binding. J Mol Biol. 1966;21:265–74. doi: 10.1016/0022-2836(66)90097-0. [DOI] [PubMed] [Google Scholar]
  • 5.Suel GM, Lockless SW, Wall MA, Ranganathan R. Evolutionarily conserved networks of residues mediate allosteric communication in proteins. Nat Struct Biol. 2003;10:59–69. doi: 10.1038/nsb881. [DOI] [PubMed] [Google Scholar]
  • 6.Chen Y, Reilly K, Chang Y. Evolutionarily conserved allosteric network in the Cys loop family of ligand-gated ion channels revealed by statistical covariance analyses. J Biol Chem. 2006;281:18184–92. doi: 10.1074/jbc.M600349200. [DOI] [PubMed] [Google Scholar]
  • 7.del Sol A, Fujihashi H, Amoros D, Nussinov R. Residues crucial for maintaining short paths in network communication mediate signaling in proteins. Mol Syst Biol. 2006;2:1–12. doi: 10.1038/msb4100063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Amaro RE, Sethi A, Myers RS, Davisson VJ, Luthey-Schulten ZA. A network of conserved interactions regulates the allosteric signal in a glutamine amidotransferase. Biochemistry. 2007;46:2156–73. doi: 10.1021/bi061708e. [DOI] [PubMed] [Google Scholar]
  • 9.Lockless SW, Ranganathan R. Evolutionarily conserved pathways of energetic connectivity in protein families. Science. 1999;286:295–9. doi: 10.1126/science.286.5438.295. [DOI] [PubMed] [Google Scholar]
  • 10.Koshland DE, Jr, Nemethy G, Filmer D. Comparison of experimental binding data and theoretical models in proteins containing subunits. Biochemistry. 1966;5:365–85. doi: 10.1021/bi00865a047. [DOI] [PubMed] [Google Scholar]
  • 11.Changeux JP, Edelstein SJ. Allosteric mechanisms of signal transduction. Science. 2005;308:1424–8. doi: 10.1126/science.1108595. [DOI] [PubMed] [Google Scholar]
  • 12.Tsai CJ, del Sol A, Nussinov R. Allostery: absence of a change in shape does not imply that allostery is not at play. J Mol Biol. 2008;378:1–11. doi: 10.1016/j.jmb.2008.02.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Goodey NM, Benkovic SJ. Allosteric regulation and catalysis emerge via a common route. Nat Chem Biol. 2008;4:474–82. doi: 10.1038/nchembio.98. [DOI] [PubMed] [Google Scholar]
  • 14.Chennubhotla C, Yang Z, Bahar I. Coupling between global dynamics and signal transduction pathways: a mechanism of allostery for chaperonin GroEL. Mol Biosyst. 2008;4:287–92. doi: 10.1039/b717819k. [DOI] [PubMed] [Google Scholar]
  • 15.Cooper A, Dryden DTF. Allostery without Conformational Change - a Plausible Model. Eur Biophys J Biophys Lett. 1984;11:103–109. doi: 10.1007/BF00276625. [DOI] [PubMed] [Google Scholar]
  • 16.Arkin MR, Wells JA. Small-molecule inhibitors of protein-protein interactions: progressing towards the dream. Nat Rev Drug Discovery. 2004;3:301–17. doi: 10.1038/nrd1343. [DOI] [PubMed] [Google Scholar]
  • 17.Lipari G, Szabo A. Model-Free Approach to the Interpretation of Nuclear Magnetic-Resonance Relaxation in Macromolecules .1. Theory and Range of Validity. J Am Chem Soc. 1982;104:4546–4559. [Google Scholar]
  • 18.Lipari G, Szabo A. Model-Free Approach to the Interpretation of Nuclear Magnetic-Resonance Relaxation in Macromolecules .2. Analysis of Experimental Results. J Am Chem Soc. 1982;104:4559–4570. [Google Scholar]
  • 19.Sapienza PJ, Lee AL. Using NMR to study fast dynamics in proteins: methods and applications. Curr Opin Pharmacol. 2010;10:723–30. doi: 10.1016/j.coph.2010.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Yang D, Kay LE. Contributions to conformational entropy arising from bond vector fluctuations measured from NMR-derived order parameters: application to protein folding. J Mol Biol. 1996;263:369–82. doi: 10.1006/jmbi.1996.0581. [DOI] [PubMed] [Google Scholar]
  • 21.Phillips JC, Braun R, Wang W, Gumbart J, Tajkhorshid E, Villa E, Chipot C, Skeel RD, Kale L, Schulten K. Scalable molecular dynamics with NAMD. J Comput Chem. 2005;26:1781–802. doi: 10.1002/jcc.20289. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Brooks BR, Brooks CL, 3rd, Mackerell AD, Jr, Nilsson L, Petrella RJ, Roux B, Won Y, Archontis G, Bartels C, Boresch S, Caflisch A, Caves L, Cui Q, Dinner AR, Feig M, Fischer S, Gao J, Hodoscek M, Im W, Kuczera K, Lazaridis T, Ma J, Ovchinnikov V, Paci E, Pastor RW, Post CB, Pu JZ, Schaefer M, Tidor B, Venable RM, Woodcock HL, Wu X, Yang W, York DM, Karplus M. CHARMM: the biomolecular simulation program. J Comput Chem. 2009;30:1545–614. doi: 10.1002/jcc.21287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Horstink LM, Abseher R, Nilges M, Hilbers CW. Functionally important correlated motions in the single-stranded DNA-binding protein encoded by filamentous phage Pf3. J Mol Biol. 1999;287:569–77. doi: 10.1006/jmbi.1999.2629. [DOI] [PubMed] [Google Scholar]
  • 24.Watney JB, Hammes-Schiffer S. Comparison of coupled motions in Escherichia coli and Bacillus subtilis dihydrofolate reductase. J Phys Chem B. 2006;110:10130–8. doi: 10.1021/jp0605956. [DOI] [PubMed] [Google Scholar]
  • 25.Bradley MJ, Chivers PT, Baker NA. Molecular dynamics simulation of the Escherichia coli NikR protein: equilibrium conformational fluctuations reveal interdomain allosteric communication pathways. J Mol Biol. 2008;378:1155–73. doi: 10.1016/j.jmb.2008.03.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Lange OF, Grubmuller H. Generalized correlation for biomolecular dynamics. Proteins. 2006;62:1053–61. doi: 10.1002/prot.20784. [DOI] [PubMed] [Google Scholar]
  • 27.Killian BJ, Yundenfreund Kravitz J, Gilson MK. Extraction of configurational entropy from molecular simulations via an expansion approximation. J Chem Phys. 2007;127:024107. doi: 10.1063/1.2746329. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.McClendon CL, Friedland G, Mobley DL, Amirkhani H, Jacobson MP. Quantifying Correlations Between Allosteric Sites in Thermodynamic Ensembles. J Chem Theory Comput. 2009;5:2486–2502. doi: 10.1021/ct9001812. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Sethi A, Eargle J, Black AA, Luthey-Schulten Z. Dynamical networks in tRNA:protein complexes. Proc Natl Acad Sci U S A. 2009;106:6620–5. doi: 10.1073/pnas.0810961106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Eargle J, Sethi A, Li L, Luthey-Schulten Z. Dynamical Network Analysis Plugin for V.M.D. Tutorial. 2012 [Google Scholar]
  • 31.Chaudhuri BN, Lange SC, Myers RS, Chittur SV, Davisson VJ, Smith JL. Crystal structure of imidazole glycerol phosphate synthase: a tunnel through a (β/α)8 barrel joins two active sites. Structure. 2001;9:987–97. [PubMed] [Google Scholar]
  • 32.Chaparian MG, Evans DR. The catalytic mechanism of the amidotransferase domain of the Syrian hamster multifunctional protein CAD. Evidence for a CAD-glutamyl covalent intermediate in the formation of carbamyl phosphate. J Biol Chem. 1991;266:3387–95. [PubMed] [Google Scholar]
  • 33.Roux B, Walsh CT. p-aminobenzoate synthesis in Escherichia coli: kinetic and mechanistic characterization of the amidotransferase PabA. Biochemistry. 1992;31:6904–10. doi: 10.1021/bi00145a006. [DOI] [PubMed] [Google Scholar]
  • 34.Nakamura J, Straub K, Wu J, Lou L. The glutamine hydrolysis function of human GMP synthetase. Identification of an essential active site cysteine. J Biol Chem. 1995;270:23450–5. doi: 10.1074/jbc.270.40.23450. [DOI] [PubMed] [Google Scholar]
  • 35.Willemoes M. Thr-431 and Arg-433 are part of a conserved sequence motif of the glutamine amidotransferase domain of CTP synthases and are involved in GTP activation of the Lactococcus lactis enzyme. J Biol Chem. 2003;278:9407–11. doi: 10.1074/jbc.M212995200. [DOI] [PubMed] [Google Scholar]
  • 36.Lang D, Thoma R, Henn-Sax M, Sterner R, Wilmanns M. Structural evidence for evolution of the β/α barrel scaffold by gene duplication and fusion. Science. 2000;289:1546–50. doi: 10.1126/science.289.5484.1546. [DOI] [PubMed] [Google Scholar]
  • 37.Myers RS, Jensen JR, Deras IL, Smith JL, Davisson VJ. Substrate-induced changes in the ammonia channel for imidazole glycerol phosphate synthase. Biochemistry. 2003;42:7013–22. doi: 10.1021/bi034314l. [DOI] [PubMed] [Google Scholar]
  • 38.Amaro R, Tajkhorshid E, Luthey-Schulten Z. Developing an energy landscape for the novel function of a (β/α)8 barrel: ammonia conduction through HisF. Proc Natl Acad Sci U S A. 2003;100:7599–604. doi: 10.1073/pnas.1331150100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Alifano P, Fani R, Lio P, Lazcano A, Bazzicalupo M, Carlomagno MS, Bruni CB. Histidine biosynthetic pathway and genes: structure, regulation, and evolution. Microbiol Rev. 1996;60:44–69. doi: 10.1128/mr.60.1.44-69.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Chaudhuri BN, Lange SC, Myers RS, Davisson VJ, Smith JL. Toward understanding the mechanism of the complex cyclization reaction catalyzed by imidazole glycerolphosphate synthase: crystal structures of a ternary complex and the free enzyme. Biochemistry. 2003;42:7003–12. doi: 10.1021/bi034320h. [DOI] [PubMed] [Google Scholar]
  • 41.Warshall S. A Theorem on Boolean Matrices. J Assoc Comput Mach. 1962;9:11. [Google Scholar]
  • 42.Floyd RW. Algorithm-97 - Shortest Path. Commun ACM. 1962;5:345–345. [Google Scholar]
  • 43.Girvan M, Newman ME. Community structure in social and biological networks. Proc Natl Acad Sci U S A. 2002;99:7821–6. doi: 10.1073/pnas.122653799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Michał R, Mats3 O. Graphical analysis of pH-dependent properties of proteins predicted using PROPKA. BMC Structural Biology. doi: 10.1186/1472-6807-11-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Roberts E, Eargle J, Wright D, Luthey-Schulten Z. MultiSeq: unifying sequence and structure data for evolutionary analysis. BMC Bioinf. 2006;7:382. doi: 10.1186/1471-2105-7-382. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Russell RB, Barton GJ. Multiple Protein-Sequence Alignment from Tertiary Structure Comparison - Assignment of Global and Residue Confidence Levels. Proteins: Struct, Funct, Genet. 1992;14:309–323. doi: 10.1002/prot.340140216. [DOI] [PubMed] [Google Scholar]
  • 47.Russell RB, Barton GJ. The Limits of Protein Secondary Structure Prediction Accuracy from Multiple Sequence Alignment. J Mol Biol. 1993;234:951–957. doi: 10.1006/jmbi.1993.1649. [DOI] [PubMed] [Google Scholar]
  • 48.Amaro RE, Myers RS, Davisson VJ, Luthey-Schulten ZA. Structural elements in IGP synthase exclude water to optimize ammonia transfer. Biophys J. 2005;89:475–87. doi: 10.1529/biophysj.104.058651. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Amaro R, Luthey-Schulten Z. Molecular dynamics simulations of substrate channeling through an α–β barrel protein. Chem Phys. 2004;307:147–155. [Google Scholar]
  • 50.Grubmuller H. Solvate V 1.0. Theoretical Biophysics Group, Institute for Medical Optics, Ludwig-Maximilians University; Munich: 1996. [Google Scholar]
  • 51.Bjelkmar Pr, Larsson P, Cuendet MA, Hess B, Lindahl E. Implementation of the CHARMM Force Field in GROMACS: Analysis of Protein Stability Effects from Correction Maps, Virtual Interaction Sites, and Water Models. J Chem Theory Comput. 2010;6:459–466. doi: 10.1021/ct900549r. [DOI] [PubMed] [Google Scholar]
  • 52.Darden T, York D, Pedersen L. Particle Mesh Ewald - an N.Log(N) Method for Ewald Sums in Large Systems. J Chem Phys. 1993;98:10089–10092. [Google Scholar]
  • 53.Glykos NM. Software news and updates. Carma: a molecular dynamics analysis program. J Comput Chem. 2006;27:1765–8. doi: 10.1002/jcc.20482. [DOI] [PubMed] [Google Scholar]
  • 54.Andricioaei I, Karplus M. On the calculation of entropy from covariance matrices of the atomic fluctuations. J Chem Phys. 2001;115:6289–6292. [Google Scholar]
  • 55.Myers RS, Amaro RE, Luthey-Schulten ZA, Davisson VJ. Reaction coupling through interdomain contacts in imidazole glycerol phosphate synthase. Biochemistry. 2005;44:11974–85. doi: 10.1021/bi050706b. [DOI] [PubMed] [Google Scholar]
  • 56.Lipchock JM, Loria JP. Nanometer Propagation of Millisecond Motions in V-Type Allostery. Structure. 2010;18:1596–1607. doi: 10.1016/j.str.2010.09.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Beismann-Driemeyer S, Sterner R. Imidazole glycerol phosphate synthase from Thermotoga maritima. Quaternary structure, steady-state kinetics, and reaction mechanism of the bienzyme complex. J Biol Chem. 2001;276:20387–96. doi: 10.1074/jbc.M102012200. [DOI] [PubMed] [Google Scholar]
  • 58.Douangamath A, Walker M, Beismann-Driemeyer S, Vega-Fernandez MC, Sterner R, Wilmanns M. Structural evidence for ammonia tunneling across the (β/α)8 barrel of the imidazole glycerol phosphate synthase bienzyme complex. Structure. 2002;10:185–193. doi: 10.1016/s0969-2126(02)00702-5. [DOI] [PubMed] [Google Scholar]
  • 59.Omi R, Mizuguchi H, Goto M, Miyahara I, Hayashi H, Kagamiyama H, Hirotsu K. Structure of imidazole glycerol phosphate synthase from Thermus thermophilus HB8: Open-closed conformational change and ammonia tunneling. J Biochem. 2002;132:759–765. doi: 10.1093/oxfordjournals.jbchem.a003284. [DOI] [PubMed] [Google Scholar]
  • 60.Korolev S, Skarina T, Evdokimova E, Beasley S, Edwards A, Joachimiak A, Savchenko A. Crystal structure of glutamine amidotransferase from Thermotoga maratama. Proteins: Struct, Funct, Genet. 2002;49:420–422. doi: 10.1002/prot.10161. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1_si_001

RESOURCES