Abstract
Transcriptional activation domains (ADs) are generally thought to be intrinsically unstructured, but capable of adopting limited secondary structure upon interaction with a coactivator surface. The indeterminate nature of this interface made it hitherto difficult to study structure/function relationships of such contacts. Here we used atomistic accelerated molecular dynamics (aMD) simulations to study the conformational changes of the GCN4 AD and variants thereof, either free in solution, or bound to the GAL11 coactivator surface. We show that the AD-coactivator interactions are highly dynamic while obeying distinct rules. The data provide insights into the constant and variable aspects of orientation of ADs relative to the coactivator, changes in secondary structure and energetic contributions stabilizing the various conformers at different time points. We also demonstrate that a prediction of α-helical propensity correlates directly with the experimentally measured transactivation potential of a large set of mutagenized ADs. The link between α-helical propensity and the stimulatory activity of ADs has fundamental practical and theoretical implications concerning the recruitment of ADs to coactivators.
Author Summary
The regulated transcription of eukaryotic genes is governed by gene-specific transcription factors that contain activation domains to stimulate the expression of nearby genes. Activation domains are unable to take up a defined three-dimensional conformation. Nevertheless, as we demonstrate in our study, molecular dynamics simulations reveal that the key docking point of such domains (centered around several large hydrophobic amino acid sidechains) folds into fluctuating α-helical conformations. Analysis of published data shows that this tendency of adopting such local structures correlates directly with stimulation activity. We also investigate the interaction of these structurally unstable domains with a coactivator interaction partner. Computational simulations are ideally suited for analysing the rapidly changing, "fuzzy" interactions occurring between these protein partners. We gained new insights into the competitive nature of the key hydrophobic sidechains in binding to a pocket on the coactivator surface and documented for the first time the rapidly changing movements of an activation domain during these interactions.
Introduction
Control of gene expression plays a crucial role throughout all three evolutionary domains of life, allowing cells to establish cellular identity, adapt to environmental challenges and prevent diseases caused by misregulation of transcription [1]. The expression of the genome is controlled predominantly by a network of gene-specific transcription factors (GSTFs) that, after binding to target sites on DNA, regulate the rate of expression of nearby genes. GSTFs performing as transcriptional activators usually contain one or multiple activation domains (ADs; [2]) that orchestrate localized remodelling of the chromatin structure, enhanced recruitment of components of the basal transcriptional machinery on the core promoter and/or stimulate promoter escape and subsequent elongation events [3–6]. These activities typically require binding of the ADs to coactivators that integrate and convey activation signals to other components of the transcriptional machinery [6,7]. The Mediator complex surrounding the basal transcriptional machinery during transcription initiation [8–11] contains coactivators that have been shown experimentally to interact with ADs to regulate gene-specific transcription (Fig 1; [11–13]).
While more than 50 common structural motifs have been described for the DNA-binding domains, the available knowledge concerning the structure and function of ADs is comparatively limited [14]. The first ADs described almost three decades ago were shown to be both necessary and sufficient to confer the transcriptional stimulatory properties [2,15]. From a structural perspective, ADs are often characterized by their unusual primary amino acid sequence abundant in acidic amino acids, glutamine or proline residues [14–17]. The enrichment for such amino acids is thought to discourage the formation of higher order structures and thus results in an intrinsically disordered structure ("acid blobs and negative noodles" or "polypeptide lasso" structures [18–20]). In turn, the intrinsic disorder allows ADs to interact in a highly adaptable manner with a range of coactivators, culminating in a synergistic regulation of the basal transcriptional machinery by one or multiple activators (Fig 1; [21,22]). The affinity of AD-coactivator binding is reasonably high (low micro- to high nanomolar range [12,21,23]) and results in interactions lasting for several milliseconds. NMR-studies provided structural insights into a various aspects of AD-coactivator complexes (TFIID/Taf40-VP16 [24]; TFIIH/Tfb1-VP16 (PDB#2K2U [23]); NcoA1-STAT6 (PDB#1OJ5 [25]); MDM2-p53 (PDB#1YCQ [26]); CBP-CREB (PDB#1KDX [27]; MED25/VP16 (PDB#2XNF [12] and 2KY6 [13]; GAL11-GCN4 (PDB#2LPB [11]). Site-directed mutagenesis and structural studies have shown that evolutionarily highly conserved bulky hydrophobic residues within ADs play a key structural role in mediating interactions with coactivators (Fig 2A and S1 Text, [16,23,24,26,28]). When bound to coactivators, ADs form a "fuzzy" family of stochastically related structures (Fig 2D, [29–31]).
Many of the yet unanswered questions regarding AD-coactivator interactions are challenging to address experimentally, especially those concerning the dynamic range of AD conformations over time, key interaction points on coactivator surfaces, the energetics of such interactions and the structures of ADs prior to binding coactivators. Computational approaches are highly effective to model such systems on the atomic level, to study their behavior and gain new mechanistic insights that consolidate present knowledge and guide future experimental work. Here we describe the results obtained from a series of long, fully atomistic molecular dynamics simulations focusing on the experimentally well-characterized GCN4-GAL11 system from Saccharomyces cerevisiae. Accelerated molecular dynamics (aMD) methods [33,34] provide powerful tools for investigating the binding of the ADs to their coactivator targets, as well as for studying the structural properties of ADs in isolation. We describe the structural interplay of AD-coactivator complexes and explore an extensive experimental data set based on synthetic AD variants to demonstrate a high degree of correlation between the α-helix propensity, degree of "fuzziness" and the transactivation potential.
Results
Microsecond simulations of the GAL11-GCN4 complex: Structural and energetic aspects
The yeast transcriptional activator GCN4 contains two tandemly arranged ADs (Fig 2A) that stimulate the expression of more than 70 "downstream" genes. The GCN4 ADs achieve this task by targeting a variety of components of the basal transcriptional machinery, including the coactivator GAL11 (also known as MED15) within the mediator complex [21]). GAL11 contains three structurally independent AD-binding domains ("Activator-Binding Domains" ["ABDs"]; Fig 2A). For one of these, ABD-1, a high-resolution structure shows a stable α-helical structure that includes a groove for interactions with ADs (PDB#2LPB; Fig 2B–2D; [11]). NOE and spin-labeling data of GAL11/ABD-1 complexed with the central AD of GCN4 (GCN4-cAD) were used to create several models illustrating the diversity of interaction between this coactivator and the AD. The bound cAD models contain a short helical stretch (encompassing GCN4 residues 116 to 124) that includes three large hydrophobic residues (W120, L123 and F124) highly conserved during evolution (Fig 2A and S1 Text). The coactivator GAL11/ABD-1 interaction surface displays three computationally detectable "hot spots" ("Pocket #1", "Pocket #2" and "Pocket #3") [32] that are distinguished by their concave topology and potentially become occupied by these particular GCN4 hydrophobic residues (Fig 2C).
We subjected PDB#2LPB-model 1 to extensive aMD simulations to gain deeper insight into various structural aspects, such as variation in AD secondary structure, orientation relative to the coactivator surface and energetic changes underpinning the conformational changes. Simulations were carried out as four independent replica runs with different initial Boltzmann distributions of particle velocities. Each simulation lasted for one microsecond, but the results reflect a period around two or three orders of magnitude longer due to the acceleration protocol used (that is, hundreds of microsecond- to millisecond-range; Table 1).
Table 1. Summary of MD simulations.
Simulated Structure | Simulation Name | Duration | #atoms |
---|---|---|---|
GAL11-ABD1/GCN4-cAD | GAL11-ABD1/GCN4-cAD _aMD_no1 | 1 μs | 45,489 |
(PDB#2LPB-model 1) | GAL11-ABD1/GCN4-cAD _aMD_no2 | 1 μs | 45,489 |
GAL11-ABD1/GCN4-cAD _aMD_no3 | 1 μs | 45,489 | |
GAL11-ABD1/GCN4-cAD _aMD_no4 | 1 μs | 45,489 | |
GCN4-cAD | GCN4_aMD_no1 | 1 μs | 32,551 |
(de novo starting structure) | GCN4_aMD_no2 | 1 μs | 32,551 |
GCN4_aMD_no3 | 1 μs | 32,551 | |
GCN4_aMD_no4 | 1 μs | 32,551 | |
GAL11-ABD1 | GAL11-ABD1 _aMD_no1 | 1 μs | 42,355 |
(PDB#2LPB; without GCN4) | GAL11-ABD1 _aMD_no2 | 1 μs | 42,355 |
GAL11-ABD1 _aMD_no3 | 1 μs | 42,355 | |
GAL11-ABD1 _aMD_no4 | 1 μs | 42,355 | |
cAD-like07 | cAD-like07_aMD_no1 | 1 μs | 28,131 |
(de novo starting structure) | |||
cAD-like96 | cAD-like96_aMD_no1 | 1 μs | 28,424 |
(de novo starting structure) | |||
GAL11-ABD1/GCN4-cADlike96 | GAL11-ABD1/cAD-like96_aMD_no1 | 1 μs | 46,862 |
(PDB#2LPB; in silico | GAL11-ABD1/cAD-like96_aMD_no2 | 1 μs | 46,862 |
mutagenized GCN4) | GAL11-ABD1/cAD-like96_aMD_no3 | 1 μs | 46,862 |
GAL11-ABD1/cAD-like96_aMD_no4 | 1 μs | 46,862 |
We were initially curious to see whether the aMD simulations would recreate the different binding states previously proposed by Brzovic et al. [11]. We used distance measurements between GCN4-W120 or GCN4-F124 relative to GAL11/ABD1-A126 (which forms the floor of Pocket #1; Fig 2B) to monitor pocket occupancy. The measurements show that the two key hydrophobic residues, in full agreement with the NMR-based models [11], behave in a switch-like manner and bind to GAL11/ABD1 in the three major binding states via a series of intermediate conformations (Fig 3). At various stages, the GAL11-ABD1 pocket is occupied by the sidechains of either GCN-4/W120 (Fig 3A) or F124 (Fig 3C), respectively. On several occasions, we observe a double occupancy (Fig 3B). A molecular movie illustrates a full time course of the dynamic change, including the changeover between W120 and F124 (S1 Movie). In addition to pocket occupancy state, the NMR-based models also postulate that the GCN4-cAD helical portion takes up several different orientations relative to GAL11-ABD1. Angular measurements of vectors characterizing the GCN4-cAD helix relative to GAL11-ABD1 α-helix 4 (Fig 4) correspond to orientations directly comparable to the previously described ones, but also suggest the presence of additional states representing transitional conformations. Because W120 and F124 act as pivot points in a comparable manner, the various pocket occupancy states and helix orientations observed do not appear to show any significant correlation.
Because we started the aMD simulations from just one of the 13 different models proposed previously, we wondered to what extent he observed motions of the GCN4-cAD on GAL11-ABD1 reflected the conformational space defined by the twelve remaining models. In principle, any extensive simulation of a single member of a family of structural conformers should reveal conformations that encompass the conformations of the majority of the other family members, as these structures are expected to interconvert freely during simulation. Plots of the phase space of the combined trajectories along three coordinates (helical angle; distances of the two key hydrophobic residues (W120 and F124) relative to Pocket#1) demonstrate that approximately 87% of the model coordinates are within highly populated regions (Fig 5). We conclude that the choice of 2LPB-model#1 as the starting structure for all four aMD simulations did not result in an unusually biased sampling of conformational space.
Flexibility and structural adaptability of the cAD thus enables a highly dynamic interplay that accommodates several different combinations of pocket occupancy and helical orientation. This raises intriguing questions regarding the energetics of such a variable interaction. We calculated the molecular mechanics per-residue decomposition of free binding energy (ΔGBinding) measurements along the trajectories in one-nanosecond intervals using the Molecular Mechanics Generalized Bourne Surface Area (MM-GBSA) method [35]. The van der Waals decomposition data of the GCN4-cAD confirms the dominating contribution of GCN4-W120 and F124 in binding to GAL11-ABD1 (Fig 6A; electrostatic interactions play a mostly invariant role in the GAL11-ABD1/GCN4 cAD interaction: S1 Fig). Despite the major conformational changes of the GCN4 cAD relative to the coactivator surface, the energetic contributions of GCN4-W120 and F124 interactions remain relatively steady throughout all four independent simulations. A more detailed study of these interactions reveal the varying role of at least five residues within the GAL11-ABD1 Pocket #1 in mediating these contacts (Fig 7). Two hydrophobic residues, GAL11-M173 or Y220, interact with GCN4-F124 alternatively, depending on whether the F124 sidechain is located within Pocket #1 (Figs 3A and 7A), or has moved out of it and is replaced by GCN4-W120 (Figs 3C, 7B and 7C). While GCN4-F124 occupies Pocket #1, W120 makes favorable hydrophobic contacts with the sidechains of GAL11-K217 and K221 (Fig 7A–7C). The formation of alternative—but energetically equivalent—contacts thus underpins several alternative modes of binding that are conformationally quite different from each other. Another conserved residue, GAL4-L123 (Fig 2A), provides notable van der Waals contributions, mostly in conjunction with F124, and occasionally substitutes for F124 in a reversible manner (particularly obvious in simulation GAL11-ABD1/GCN4-cAD _aMD_no1; Fig 6A).
This analysis also identifies several additional residues (GCN4-M107, F108, Y110, L113, I128, and V130) as making significant additions to ΔGBinding, but in a distinctly non-systematic manner. The residues are nodes in a structurally highly flexible network that facilitates short-lived interactions, but do not a follow recurrent pattern due to substantial and unstable conformational changes in the GCN4-cAD. To exemplify the role of these residues, we investigated the structural interactions of GCN4-M107 in more detail. In simulation GAL11-ABD1/GCN4-cAD _aMD_no2, this residue is seen as providing a substantial van der Waals contribution lasting throughout most of the second half of the simulation (timeframe 1,400–2000 ns aMD; Fig 6A). During this time, GCN4-M107 interacts predominantly with two leucine residues, L169 and L227, which are located on two different α-helices of GAL11 (helix 1 and 4, respectively), but are spatially close to each other and interact with each other via hydrophobic interactions in the folded GAL11 structure. GCN4-M107 interacts with either residue on its own, or even with both leucines by bridging them (Fig 6B and S2 Movie).
Altogether, we interpret these findings the following way: GCN4-W120 /GCN4-F124 anchor GCN4-cAD to the GAL 11 surface, but provide no preferential stabilization of conformation and/or orientation of the cAD relative to the coactivator surface. Additional hydrophobic residues located on GCN4 contribute notable, but temporary contacts (estimated by molecular mechanics measurements to contribute only between -3 to -6 kcal.mol-1 each towards ΔGBinding). The fuzzy interaction thus results from a variable combination between two relatively strong contributors (-8 to -10 kcal.mol-1 each) that are regularly supported by a host of additional minor contributors subjected to continuous change. This molecular interaction pattern predicts that the binding free energy of the GCN4 cAD to GAL11-ABD1 is far from constant and subject to constant fluctuations. The MM-GBSA estimates support the idea of substantial variation in binding affinity (over a three-fold range on the micro/millisecond time scale [S2 Fig]). The half-life of the GCN4-GAL11 interaction is estimated to be in the low millisecond range [11], which supports this interpretation. Although the constant change in affinity may result in a reasonable average affinity (and may include very high affinity states), it will also drop in affinity with statistical regularity to levels that facilitate immediate dissociation. The aMD simulations, by allowing coactivator-AD domains to be monitored over longer timer frames, convey this message much clearer than the limited number of currently existing static snap-shots of models demonstrating alternative conformations [11].
Secondary structures of ADs in presence and absence of coactivator
Having examined the molecular dynamics of the GAL11-ABD1/GCN4-cAD fuzzy complex, we turned our attention to the intrinsic structural properties of these two interaction partners. Specifically, we wanted to investigate to what extent binding of GCN4 affects the structure of GAL11 and, more importantly, how extensively the GCN4-cAD is structured on its own. We started by monitoring the formation/maintenance of secondary structure of the GAL11-ABD1/GCN4-cAD complex simulations described above. The analysis showed formation of a stable helical structure ("Helix #1") typically encompassing GCN4 residues S117 to D125 (Fig 8A). Simulation GAL11-ABD1/GCN4-cAD _aMD_no1 exceptionally shows a mixture of 310-helix and α-helix during the first 750 ns of simulation before settling into a α-helical pattern, whereas all other three simulations (including the final stages of GAL11-ABD1/GCN4-cAD _aMD_no1) display extensive and stable α-helices throughout the entire time course. These results are essentially in agreement with the 13 different models presented in PDB#2LPB [11], although the simulations suggest that the C-terminal border of Helix#1 routinely extends one residue further than previously proposed to include position D125. In addition to Helix #1, the occasional presence of another N-terminally located structure ("Helix #2") is evident. Helix #2 is less stable in GAL11-ABD1/GCN4-cAD _aMD_no1, no2 and no3 and either takes up a partial 310-helix conformation (GAL11-ABD1/GCN4-cAD _aMD_no1 and no3), or disappears eventually. In GAL11-ABD1/GCN4-cAD _aMD_no4, Helix #1 and #2 fuse into a single contiguous α-helix (spanning from M107 to N126 at its borders) that remains intact until the end of the simulation. Helix #2 includes the hydrophobic residues (GCN4-M107, F108, Y110, L113, I128, and V130) identified above as making occasional energetically favorable contributions to ΔGBinding.
We next asked to what extent the observed α-helical propensity of the GCN4-cAD was encoded within its primary structure. ADs are intrinsically disordered and are often thought to only adopt significant secondary structure upon binding to a coactivator target [11,29]. This model is, however, controversial. Whereas some NMR and circular dichroism studies of several isolated ADs claim an absence of significant secondary structure elements [11,36], other investigations suggest the presence of a significant fraction of transient α-helices [37–39] or β-sheets [40] in the unbound state of various ADs. In order to eliminate any structural "memory" from the starting structure, we constructed a model of the GCN4-cAD as a completely unfolded polypeptide from its primary amino acid sequence. After aMD simulation, any conformational changes—including the formation of secondary structure elements -will therefore solely be determined by the intrinsic properties of the polypeptide sequence itself. After a short implicit solvation minimization step to fold up the structure in a more compact random coil, we set up four independent microsecond aMD simulations under identical conditions as used previously for the aMD simulations of the GAL11-GCN4 complex (Table 1). Such simulations sample folding pathways and, especially relevant for disordered structures, reveal shifts in equilibria between short-lived conformations. The formation of α-helices occurs on the nanosecond-microsecond time scale [41] and is therefore well within the scope of the chosen simulation parameters.
An investigation of secondary structure elements formed in the GCN4-cAD aMD simulations reveals an unexpectedly high degree of spontaneously formed α-helices (Fig 8B). The formation of α-helices is especially favored in the central portion of the GCN4-cAD that contains the bulky hydrophobic residues that have been experimentally identified as critically important for the transactivation function, as well as making significant contributions to the free energy of binding to coactivators (Fig 6A). Although traces of β-sheet can be seen in GCN4_aMD_no4 (Fig 8B), these structures appear short-lived and do not support the conclusions reached by a previous study [40]. We conclude that the GCN4-cAD has intrinsic potential to form α-helical elements, even in absence of a coactivator, making it likely that these spontaneously preformed secondary structure elements represent key structural features required for coactivator interaction and binding specificity. The absence of substantial random coil elements in the region surrounding GCN4-W120, L123 and F124 allows us to postulate further that the cAD engages most likely with the coactivator with the necessary α-helices already locally preformed prior to first contact.
Although expected to have a less substantial effect, we also attempted to quantitate the effect of AD binding on the conformation of a coactivator. Consequently, we set up four independent one-microsecond aMD simulations of the GAL11-ABD1 in the absence of the GCN4-cAD (Table 1). A comparison of root mean square fluctuation (RMSF) measurements in simulations GAL11-ABD1 _aMD_no1 to no3 in the bound and unbound state shows that ABD1 becomes structurally more restricted upon cAD binding. Especially ABD1 residues involved in either pocket formation or binding of the cAD helix become less mobile (S3 Fig). GAL11-ABD1 _aMD_no4 undergoes a more substantial conformational change that includes a concerted movement of helices 1 and 2 and alters the ABD1 interaction surface. The original pocket for binding GCN4-W120 or F124 is no longer present, suggesting that this conformation of ABD1 may not be able to bind GCN4. The altered surface, however, develops new pockets, that may potentially offer alternative binding sites for other activators.
Construction principles of a highly potent AD
Up to now, we have focused our attention on a naturally occurring AD/coactivator complex that has been shown to be physiologically relevant [11,21]. Extensive mutagenesis experiments have revealed the existence of a cryptic AD within the primary amino acid sequence of GCN4. This "cAD-like" activation domain, encompassing GCN4 residues 81–100, partially matches the structural criteria for an AD, but does not display a detectable transactivation potential (Table 2; [42]).
Table 2. Sequences of cAD-like motifs used in MD simulations.
GCN481–100 (original sequence) | MKTVLPIPELDDAVVESFFSSGSGSGS |
cAD-like07 | aceMKTVLPIPELDDAVWESLFSSGSGSGSnme |
cAD-like96 | aceMKTVLPIPELDDAWWWWLFWSGSGSGSnme |
The sequences correspond to the motifs tested experimentally under in vivo conditions [42]. The key hydrophobic residues required for AD function are shown in bold, and the linker sequence used to link the AD to the GCN4 DNA-binding domain in italics. For MD simulations, these structures were created as unfolded polypeptide chains with charge-neutralized termini (ace and nme for N- and C-termini, respectively).
Substitutions of hydrophobic residues within the cAD-like motif improve its activity and make it as potent as the GCN4-cAD. This modified cAD-like domain has proven an excellent testing ground for studying the transactivation potential of an array of directly comparable structures created by high-throughput site-directed mutagenesis [42]. We included two examples in our analyses and will refer from here onwards to the members of this collection as "cAD-like xx" (where xx stands for the transactivation potential that the sequence confers). For example, cAD-like07 refers to a 'weak' cAD-like variant that is capable of stimulating ARG3 induction ~7-fold (which is equivalent to the activation potential of the GCN4-cAD). On the other hand, cAD-like96 identifies a strong transactivator motif capable of stimulating ARG3 induction ~96-fold [42]. The system thus offers ideal conditions for further elucidation of the functional necessities of an exceptionally potent AD and its interactions with coactivators.
We set up in silico folding aMD simulations for the cAD-like07 and cAD-like96 motifs under identical conditions used earlier for the GCN4-cAD (Table 1). Taking into account that we previously observed significant α-helical propensity in the isolated and de novo folded GCN4-cAD (Fig 8B), one of the first questions we asked was whether such a propensity could also be detected in the cAD-like variants (while embedded within the same primary sequence context as used in the experimental work). The only ordered secondary structures formed under these conditions are α-helices, albeit with a noticeable difference in effectiveness. In the case of cAD-like07, contiguous α-helical regions are present fleetingly throughout most of the simulation period, but these fluctuate considerably in length and position relative to the underlying primary amino acid sequence (Fig 9A). In some instances all α-helical structures were lost, but restored in a fully reversible manner shortly afterwards. We conclude that cAD-like07 displays a notable tendency towards α-helical conformations, but these structures undergo a constant equilibrium between conformations of different α-helical content and are consequently unable to adopt a higher-order structure with a degree of stability exceeding the nanosecond range. In contrast, within the first 200 ns of aMD simulation the cAD-like96 variant adopts an extensive α-helical conformation that stably propagates afterwards and encompasses the three key hydrophobic residues (W94, L97 and F98) that mediate coactivator contact. The substitutions distinguishing cAD-like96 from cAD-like07 are four tryptophan residues (Table 2; W93, 95, 96 and 99; Fig 9B). Tryptophan is the strongest known helix conformer in short helices [43] and therefore the extensive helicity in cAD-like96 observed in the aMD simulations is in excellent agreement with expectations.
The detected differences in secondary structure content and stability between cAD-like07 and cAD-like96 strongly suggest that pronounced α-helical propensity constitutes a key factor in determining the transactivation potential of an AD, even in absence of additional conformational changes induced by binding to the coactivator surface. We tested this concept further by investigating whether there was a correlation between α-helical propensity predicted by standard bioinformatics tools and the observed effectiveness in mediating transactivation in vivo. A plot of predicted α-helical propensity [44] of 24 different cAD-like variants [42] against experimentally measured transcriptional simulation provides previously undocumented evidence for a strong correlation between these two variables (Fig 9C). The results show that this approach allows a direct prediction of transactivation potentials of cAD-like variants with 95% confidence using only primary amino acid sequence information.
The extensive, stable α-helicity, combined with the presence of additional bulky hydrophobic residues next and between residues W94, L97 and F98 raises some intriguing questions regarding the interaction of cAD-like96 with the GAL11-ABD1 coactivator module. As there is no structural data available for this system, we created a starting structure by in silico substitutions of the orthologous cAD residues in the GAL11-ABD1/GCN4-cAD NMR model (PDB#2LPB-model 1). Subsequently, four independent aMD simulations were carried out using the conditions described earlier. Just as expected from the results of the simulations of cAD-like96 on its own (Fig 9B), the cAD-like96 adopts a continuously stable α-helical conformation that includes positions W94, L97 and F98 throughout all four aMD simulations (Fig 9D).
Because the cAD motif is surrounded by several additional tryptophan residues, Warfield et al. suggested that these tryptophans might be able to occupy the pocket in a similar manner to the original cAD motif key residues and contribute to increased binding efficiency [42]. A molecular mechanics decomposition of the van der Waals forces of the aMD trajectories of GAL11-ABD1/cAD-like96 (Fig 10) reveals interesting similarities and differences to the previously shown GAL11-ABD1/GCN4-cAD results (Fig 6A). First, the main ΔGBinding contributions are once again centered on two regions (W94 and L97/F98), in addition to N-terminal contacts (L84, P87, L89) that provide fleeting contributions reminiscent of the pattern found for the GCN4-cAD (Fig 6A and 6B; note that these additional contacts, compared to GCN4-cAD [Fig 8A], are not in an α-helical conformation [Fig 9D]). It is noticeable, however, that the main contributors in cAD-like96 play a less distinct, broader role; in GAL11-ABD1/cAD-like96_aMD_no1 and no4, L97 makes a major contribution, but is distinctly supported by the flanking residues W96 and F98. Such a more diffuse energetic contribution is also observable near the W94 position. In GAL11-ABD1/cAD-like96_aMD_no3, W95 makes the dominant van der Waals contribution instead of W94 (a state that is briefly and reversibly explored in aMD2 at ~1,900 nanoseconds; Fig 10). A situation where W94 and W95 simultaneously occupy the ABD-1 pocket is not observed. As the helix would have to be distorted for these two residues to gain access to the pocket, it is unlikely for this confirmation to occur. In GAL11-ABD1/cAD-like96_aMD_no4, both W93 and W94 contribute apparently equally and create a stable configuration that remains essentially unchanged throughout one microsecond of aMD simulation conditions.
After identification of the possible binding states, angular measurements of the helical domain of cAD-like 96 and ABD1 α-helix 4 were performed to analyse the relative orientations of these structures relative to each other. The measurements show that the main orientations observed for the cAD-like 96 helix range typically between ~60° and ~120° (Fig 11). In comparison, the GCN4-cAD helix adopts a significantly wider range of orientations (Fig 4). Consequently, even though rotations are observable for both GAL11-ABD1/GCN4-cAD and GAL11-ABD1/cAD-like96 simulations, the maximal rotation performed by the helical cAD-like 96 domain is only 60° compared to ~180° observed for the GCN4-cAD helix. The orientations also last significantly longer and do not follow the frequent and abrupt changes observed for GCN4-cAD. We conclude that overall the binding of cAD-like96 to GAL11-ABD1 is conformationally significantly more restricted and therefore reduced in "fuzziness". The increased degree of α-helicity, redundancy of hydrophobic contacts and reduced conformational freedom documented in the aMD simulations provide a quantitative base for understanding the high transactivation potential displayed by cAD-like96.
Discussion
Synthetic Biology aims at a quantitative knowledge of molecular structure/function relationships in order to provide the tools required for reshaping the properties of living organisms in a preconceived manner. A high-level of understanding of the processes underlying gene expression mechanisms will, without doubt, be required to achieve such a goal [45]. While decades of laboratory-based investigations have identified the key players of the transcriptional machinery and revealed how they network with each other, our insights are still mostly limited to a qualitative understanding at this stage.
Molecular dynamics simulations offer the ability to model accurately the dynamic interplay of proteins with high precision. While complex systems consisting of tens- to hundreds of thousands of atoms can be studied effectively with currently available high-performance computing hardware, simulations lasting for microseconds and beyond are still challenging. Enhanced sampling methods, such as accelerated molecular dynamics (aMD) effectively extend the range by two- or three orders of magnitude into the millisecond range [33,34]. This state-of-the art technology opens the door towards a better understanding of functional interactions involving the formation of "fuzzy" complex involving intrinsically disordered molecular partners. Computational approaches are of particular relevance in an area that defies conventional experimental approaches due to the high degree of structural flexibility, the rich diversity and the short duration (half-lives typically in the millisecond range) of such interactions. Enhanced sampling method MD simulations are ideally suited for such situations and offer genuine opportunities for gaining quantitative insights into this highly relevant, but still poorly understood field.
Activation domains (ADs) are intrinsically disordered structures that have mostly defied a detailed understanding of structure/function relationships. In this study, we employed molecular dynamics simulations to model an extensively studied experimental system, the interaction between the activator GCN4 with the coactivator GAL11. The experimental identification of in vivo coactivator targets [21], availability of high-quality structural models [11], combined with an extensive collection of functionally characterized mutants and artificial AD variants [11,42], makes GCN4 an ideal model system for a more thorough understanding of the fundamental aspects of transcriptional activation in eukaryotic systems. The GAL11-ABD1/GCN4-cAD complex was simulated stably in multiple aMD simulations. The molecular behavior observed likely represents motions lasting hundreds of microseconds due to the acceleration parameters employed [34]. None of these simulations has yet resulted in dissociation of GCN4-cAD from GAL11-ABD1. Estimates of ΔGBinding revealed, however, substantial fluctuations, that highlight the intrinsic instability of the complex and support the idea that the half-life of this complex is only in the millisecond range [11]. Longer aMD simulations under the conditions described, possibly employing reduced affinity mutants (such as GCN4-W120A or F124A; [11]), may eventually include a complete dissociation event. Similarly, long aMD simulations may reveal a real-time association of a free AD to a coactivator at a level of detail comparable to the binding of drugs to protein target sites [46].
Simulations of complexes between GAL11-ABD1 and GCN4-cAD or cADlike96 allowed us to observe a rich pattern of conformational changes that have thus far not been documented through any other experimental or theoretical approach. Using a single model as a starting structure (model 1) we confirmed essentially all aspects of the 13 different models included in the PDB structure (PDB#2LPB) and obtained an representative selection of intermediate structures (Fig 5) that can be viewed as a molecular movie (S1 Movie). The extensive collection of structures enabled us to gain insights into the constant and variable aspects of orientation of ADs relative to the GAL11-ABD1 (Figs 3, 4 and 11), changes in secondary structures (Figs 8 and 9), and key energetic contributions stabilizing the various conformers at different time points (Figs 6 and 11). Mutagenesis studies identified the cAD motif as W, L and F with spacing of i, i+3 and i+4. MM-GBSA calculations reinforced this result by demonstrating that the W and F residues are in both the cAD and cAD-like96 two major contributing residues towards binding ΔGBinding.
We also observed that the GCN4-cAD adopts α-helicity during aMD simulation in the complete absence of any interaction partner. The aMD simulations, starting from a polypeptide chain devoid of any secondary structure, demonstrate the formation of extensive α-helical conformations surrounding and including the conserved hydrophobic residues in a highly reproducible manner (Figs 8B, 9A and 9B). Although the presence and extent of these helices fluctuate on the hundreds of nanoseconds or microsecond time scale, the key hydrophobic residues are frequently (~65% for cAD-like07), often (~80% for GCN4-cAD) or essentially constantly (cAD-like96) arranged within an α-helical conformation, which could facilitate interactions with the coactivator surface, especially during the recruitment stage. The currently most widely—although not universally—accepted model ([37–39,47]) is based on the concept that ADs are extensively unstructured and only take up significant proportion of α-helical structures after interactions with the coactivator surface. These concepts are predominantly based on NMR-measurements showing only narrowly dispersed resonances in the 1H-N-dimension of various ADs, suggesting an absence of significant secondary structure elements. NMR-studies of poorly structured small domains remain, however, challenging. The energy barrier between disordered state and local α-helix conformations can be as low as 1.0–1.5 kcal/mol [48]. Differences between the intracellular environment and experimental conditions (low pH/low ionic concentrations, absence of divalent ions, effects of terminal flexibility, absence of local hydrophobic packing [49]) are likely to influence the formation and dynamics of small, unstable secondary structure elements. Apart from such experimental parameters, overlapping resonance effects in the spectra of flexible peptides, in conjunction with time-averaging phenomena can result in an underestimation of local structures in NMR experiments [50,51]. In the present case it is, however, likely that minor inaccuracies in the simulation parameters used in this study resulted in an overestimation of the stability of local α-helices. Although the Amber14SB force-field is generally not prone to overstabilize α-helical structures (in contrast to Amber ff03, CHARMM27 [52,53]), the results obtained here are in conflict with experimental data that show only 8–10% helical character in the free GCN4-cAD [11]. While the simulations correctly identify the regions displaying high α-helical propensity within the primary sequence of GCN4-cAD, it is evident that no conclusions should be drawn—as with aMD simulations in general—regarding the kinetic aspects of the data. Our conclusions regarding a recently published experimental data set based on the extensive mutagenesis of the cAD-like domain are, however, based on a different type of analysis and therefore not affected by such kinetic considerations [42]. Although not pointed out by the authors of this study, we observed a distinct correlation (r² = 0.89 for the first linear regression) between the published transactivation potential of 24 cAD-like variants and their theoretically predicted α-helicity (Fig 9C). This difference in α-helicity also emerges very clearly form the simulations of the three different ADs (GCN4-cAD, cAD-like07 and cAD-like-96) described here, both as free structures folded de novo, as well as complexed with GAL11-ABD1 (Figs 8 and 9). The α-helicity of a cAD-like variant can be determined using a quantitative helix-coil transition model [44], so that it should be relatively straightforward to design new cAD-like variants with predictable transactivation potentials.
The concept, that ADs either contain—or have a strong natural conformational bias towards taking up transient secondary structure elements—is not new. Experimental investigations of several other ADs has shown that they contain a significant fraction of transient α-helices in their unbound state (p53 [37]; pKID [38]; ACTR [39]; ERM [47]). We therefore conclude that eukaryotic ADs of widely different origin and specificity may contain pre-structured α-helical domains with low energy barriers of folding detectable by aMD de novo folding methods. Although sequence motifs with a high α-helical propensity can be clearly identified in unbound ADs in simulations, we note that there are distinct changes to the length and positions of these helices after binding to the coactivator. In the GCN4-cAD (compare Fig 8A with 8B), as well as in the cAD-like96 (compare Fig 9B and 9D), the boundaries tend to become more strictly defined (especially at the N-terminus) once bound to GAL11-ABD1, suggesting that coactivator-binding imposes a quantifiable degree of structural ordering on the AD. In addition, in the case of GCN4-cAD we detect reproducibly the formation of a previously unrecognized N-terminal secondary helix (Helix#2; Fig 8A) that spatially organizes up to four large hydrophobic residues (GCN4-M107, F108, Y110, and L113) into a structure that makes energetically significant van der Waals interactions contributions towards coactivator binding (Fig 6A and 6B and S2 Movie).
The degree of structural orderliness affects multiple key parameters [54]. A tight coupling between folding and binding may enhance equilibrium distinctions for interactions with different targets. A high degree of preformed structure may prove kinetically (dis)advantageous for binding to various target sites [55], so that the α-helical content of an unbound AD is likely to exert a significant influence on the rate of binding to various available interaction partners, even before more stable contacts are established through subsequent refolding/realigning events. The variable presence of α-helical modules (and location relative to the underlying primary sequence containing the conserved hydrophobic residues) may therefore encode a high degree of selectivity regarding to the binding of the various components of the transcriptional machinery [21].
Outlook
The molecular mechanisms that GSTFs employ to regulate the expression of their genes are still poorly understood. There is some evidence that the binding of activation domains to basal transcription factors and coactivator complexes induces major conformational changes that could allosterically transmit signals to other components of the transcriptional machinery [56–58]. This hypothesis suggests that transient interactions of activation domains with their targets could trigger the transition between long-lived alternative coactivator conformations. Such mechanisms are, however, exceedingly difficult to study using biochemical or computational tools. In the study reported here, we have found no evidence for any significant conformational change induced in the GAL11-ABD1 structure as a direct consequence of GCN4-AD binding. Even if such changes were observed, it would still be unclear whether (and how) such an alternative conformation could be allosterically transmitted to the remainder of the GAL11 subunit (and beyond) because currently our structural knowledge of GAL11 is restricted to the ABD1 domain. An alternative—and not necessarily conflicting—view is that ADs exert most of their functions through stabilizing the assembly or position of other functional components of the transcriptional machinery, such as the basal transcriptional initiation complex. Eukaryotic promoters are potentially regulated through dozens of GSTFs bound at nearby enhancer modules, so that a multitude of energetically weak, short-lived interactions between ADs and a variety of targets could provide a significant stabilization effect through synergistic action. The short interaction half-lives and multi-target specificity of the structurally disordered ADs may under such circumstances provide the flexibility to respond to rapidly changing regulatory requirements, or provide the possibility of some components, such as RNA polymerases to "break free" of these complexes after transcription initiation. Our work documents a positive correlation between α-helicity and transactivation potential, suggesting that the overall effectiveness of AD-binding to their targets can be directly controlled through changes in α-helical propensity during evolution. Such changes may, however, have to be counterbalanced with a need for a degree of intrinsic structural disorder to sustain the ability of ADs to interact with multiple target sites. Modelling approaches, including additional AD-coactivator targets, or studying the effect of AD-interactions with larger complexes, offer great opportunities to gain further insights into the dynamical processes of coactivator-activator interactions and open numerous theoretical and applied avenues for the future. Such strategies will most likely be part of synthetic biology approaches that aim at designing artificial transcription factors with a precisely controlled range of specificity and transactivation potential in eukaryotic systems.
Materials and Methods
System preparation
All structures were prepared to the same specifications to maximize comparability between the simulations. For the ABD1/cAD complex simulation both polypeptide chains of Model 1 of the GCN4-GAL11 complex (PDB#2LPB) were capped (acetyl and N-methylamide groups added to the N- and C-termini, respectively) using Yasara Structure [59]. For the ABD1/cAD-like96 simulation the GCN4-cAD structure (PDB 2LPB-Model 1) was mutagenized in silico with Yasara Structure [59] to create the cAD-like96 sequence. The coordinates were prepared for simulation in LEaP (AmberTools 14/15) with the Amber 14SB forcefield [60], neutralized and solvated in a TIP3P [61] solvent box with a minimum distance of 15 Å between solute and border. The final ionic concentration within the water box was adjusted to a final concentration of 150 mM NaCl. Capped structures of the GCN4-cAD, GCN4 cAD-like07 and cAD-like96 were built de novo from their primary amino acid in LEaP, and prefolded using 10 ns of GB implicit MD before solvating them under the same conditions described above.
Molecular dynamics simulations
The solvated models were minimized, heated to 300K and relaxed before performing a conventional MD (cMD) production run for 100 ns at a target pressure of one atmosphere to obtain values for the total potential and dihedral energy values (NPT). Simulations were carried out using the pmemd.cuda (Amber 14) applying the hybrid single/double/fixed precision model (SPFP) GPU support [62,63] using 2 fs time steps with a 10 Å cut off under control of a Langevin thermostat [64] and the SHAKE algorithm to restrain hydrogens [65]. Long-range electrostatic interactions were calculated using the Particle Mesh Ewald approximation [66]. The average total potential energies and the average dihedral energies were obtained from the cMD simulations and utilised to calculate the thresholds for dual boost aMD using an α-value of 0.2. All aMD simulations were performed with a target temperature of 300 Kelvin, and a target pressure of one atmosphere (101.325 kPa). Temperature was controlled by the Andersen temperature-coupling scheme and the pressure was controlled by the isotropic position scaling protocol applied in AMBER. Four independent 1000 ns aMD simulations were run for each structure. Details of simulations performed are summarized in Table 1. The structural models and trajectory data are available as supporting data (S1 and S2 Data Sets)
Structure and trajectory analysis
Mapping of interaction hotspots was performed using the FTMAP algorithm (http://ftmap.bu.edu [67]). Trajectory visualisation, secondary structure analysis (based on STRIDE; [68], imaging and file conversion was performed with VMD v.1.9.2 [69]. CPPTRAJ from AmberTools 15 was utilised for distance and angle measurement [70]. Bio3D was implemented for RMSD, RMSF and principal component analysis [71,72]. Visualisation of the analytical data was performed with CRAN [73]. The MM-GBSA estimation of binding free energies was performed employing the Amber forcefield ff99 [74] using the MMPBSA.py script [75]. Residue-specific decomposition was based on adding the 1–4 non-bonded interaction energies (1–4 EEL and 1–4 VDW) to the internal potential terms.
α-helical propensity predictions with Agadir
The cAD sequences were acetylated at the N-terminus and amidated at the C-terminus before predicting their α-helical properties at the residue level at pH 7.0, 150 mM NaCl and 300K [44].
Supporting Information
Acknowledgments
We would like to thank Gloria Rudenko and Finn Werner for feedback on the manuscript.
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
This work was supported by the Medical Research Council UK (www.mrc.ac.uk; grant number G1100057). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Lee TI, Young RA (2013) Transcriptional Regulation and Its Misregulation in Disease. Cell 152: 1237–1251. 10.1016/j.cell.2013.02.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Hope IA, Mahadevan S, Struhl K (1988) Structural and Functional-Characterization of the Short Acidic Transcriptional Activation Region of Yeast Gcn4-Protein. Nature 333: 635–640. [DOI] [PubMed] [Google Scholar]
- 3.Malik S, Roeder RG (2010) The metazoan Mediator co-activator complex as an integrative hub for transcriptional regulation. Nat Rev Genet 11: 761–772. 10.1038/nrg2901 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Hahn S, Young ET (2011) Transcriptional regulation in Saccharomyces cerevisiae: transcription factor regulation and function, mechanisms of initiation, and roles of activators and coactivators. Genetics 189: 705–736. 10.1534/genetics.111.127019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Taatjes DJ, Marr MT, Tjian R (2004) Regulatory diversity among metazoan co-activator complexes. Nat Rev Mol Cell Biol 5: 403–410. [DOI] [PubMed] [Google Scholar]
- 6.Tsai CJ, Nussinov R (2011) Gene-specific transcription activation via long-range allosteric shape-shifting. Biochem J 439: 15–25. 10.1042/BJ20110972 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Grunberg S, Hahn S (2013) Structural insights into transcription initiation by RNA polymerase II. Trends Biochem Sci 38: 603–611. 10.1016/j.tibs.2013.09.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Ansari SA, Morse RH (2013) Mechanisms of Mediator complex action in transcriptional activation. Cell Mol Life Sci 70: 2743–2756. 10.1007/s00018-013-1265-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Myers LC, Gustafsson CM, Bushnell DA, Lui M, Erdjument-Bromage H, et al. (1998) The Med proteins of yeast and their function through the RNA polymerase II carboxy-terminal domain. Genes & Development 12: 45–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Bjorklund S, Gustafsson CM (2005) The yeast Mediator complex and its regulation. Trends Biochem Sci 30: 240–244. [DOI] [PubMed] [Google Scholar]
- 11.Brzovic PS, Heikaus CC, Kisselev L, Vernon R, Herbig E, et al. (2011) The Acidic Transcription Activator Gcn4 Binds the Mediator Subunit Gal11/Med15 Using a Simple Protein Interface Forming a Fuzzy Complex. Molecular Cell 44: 942–953. 10.1016/j.molcel.2011.11.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Vojnic E, Mourao A, Seizl M, Simon B, Wenzeck L, et al. (2011) Structure and VP16 binding of the Mediator Med25 activator interaction domain. Nature Structural & Molecular Biology 18: 404–U429. [DOI] [PubMed] [Google Scholar]
- 13.Milbradt AG, Kulkarni M, Yi TF, Takeuchi K, Sun ZYJ, et al. (2011) Structure of the VP16 transactivator target in the Mediator. Nature Structural & Molecular Biology 18: 410–U436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Titz B, Thomas S, Rajagopala SV, Chiba T, Ito T, et al. (2006) Transcriptional activators in yeast. Nucleic Acids Res 34: 955–967. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Ma J, Ptashne M (1987) A new class of yeast transcriptional activators. Cell 51: 113–119. [DOI] [PubMed] [Google Scholar]
- 16.Triezenberg SJ (1995) Structure and function of transcriptional activation domains. Curr Opin Genet Dev 5: 190–196. [DOI] [PubMed] [Google Scholar]
- 17.Mitchell PJ, Tjian R (1989) Transcriptional regulation in mammalian cells by sequence-specific DNA binding proteins. Science 245: 371–378. [DOI] [PubMed] [Google Scholar]
- 18.Sigler PB (1988) Transcriptional activation. Acid blobs and negative noodles. Nature 333: 210–212. [DOI] [PubMed] [Google Scholar]
- 19.Cho HS, Liu CW, Damberger FF, Pelton JG, Nelson HC, et al. (1996) Yeast heat shock transcription factor N-terminal activation domains are unstructured as probed by heteronuclear NMR spectroscopy. Protein Sci 5: 262–269. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Hahn S (1993) Structure(?) and function of acidic transcription activators. Cell 72: 481–483. [DOI] [PubMed] [Google Scholar]
- 21.Herbig E, Warfield L, Fish L, Fishburn J, Knutson BA, et al. (2010) Mechanism of Mediator recruitment by tandem Gcn4 activation domains and three Gal11 activator-binding domains. Mol Cell Biol 30: 2376–2390. 10.1128/MCB.01046-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Green MR (2005) Eukaryotic transcription activation: right on target. Mol Cell 18: 399–402. [DOI] [PubMed] [Google Scholar]
- 23.Langlois C, Mas C, Di Lello P, Jenkins LMM, Legault P, et al. (2008) NMR structure of the complex between the Tfb1 subunit of TFIIH and the activation domain of VP16: Structural similarities between VP16 and p53. Journal of the American Chemical Society 130: 10596–10604. 10.1021/ja800975h [DOI] [PubMed] [Google Scholar]
- 24.Uesugi M, Nyanguile O, Lu H, Levine AJ, Verdine GL (1997) Induced alpha helix in the VP16 activation domain upon binding to a human TAF. Science 277: 1310–1313. [DOI] [PubMed] [Google Scholar]
- 25.Razeto A, Ramakrishnan V, Litterst CM, Giller K, Griesinger C, et al. (2004) Structure of the NCoA-1/SRC-1 PAS-B domain bound to the LXXLL motif of the STAT6 transactivation domain. Journal of Molecular Biology 336: 319–329. [DOI] [PubMed] [Google Scholar]
- 26.Kussie PH, Gorina S, Marechal V, Elenbaas B, Moreau J, et al. (1996) Structure of the MDM2 oncoprotein bound to the p53 tumor suppressor transactivation domain. Science 274: 948–953. [DOI] [PubMed] [Google Scholar]
- 27.Radhakrishnan I, PerezAlvarado GC, Parker D, Dyson HJ, Montminy MR, et al. (1997) Solution structure of the KIX domain of CBP bound to the transactivation domain of CREB: A model for activator:Coactivator interactions. Cell 91: 741–752. [DOI] [PubMed] [Google Scholar]
- 28.Cress WD, Triezenberg SJ (1991) Critical structural elements of the VP16 transcriptional activation domain. Science 251: 87–90. [DOI] [PubMed] [Google Scholar]
- 29.Dyson HJ, Wright PE (2005) Intrinsically unstructured proteins and their functions. Nature Reviews Molecular Cell Biology 6: 197–208. [DOI] [PubMed] [Google Scholar]
- 30.Tompa P, Fuxreiter M (2008) Fuzzy complexes: polymorphism and structural disorder in protein-protein interactions. Trends Biochem Sci 33: 2–8. [DOI] [PubMed] [Google Scholar]
- 31.Fuxreiter M, Tompa P (2012) Fuzzy complexes: a more stochastic view of protein function. Adv Exp Med Biol 725: 1–14. 10.1007/978-1-4614-0659-4_1 [DOI] [PubMed] [Google Scholar]
- 32.Kozakov D, Hall DR, Chuang GY, Cencic R, Brenke R, et al. (2011) Structural conservation of druggable hot spots in protein-protein interfaces. Proc Natl Acad Sci U S A 108: 13528–13533. 10.1073/pnas.1101835108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Hamelberg D, Mongan J, McCammon JA (2004) Accelerated molecular dynamics: A promising and efficient simulation method for biomolecules. Journal of Chemical Physics 120: 11919–11929. [DOI] [PubMed] [Google Scholar]
- 34.Pierce LC, Salomon-Ferrer R, Augusto FdOC, McCammon JA, Walker RC (2012) Routine Access to Millisecond Time Scale Events with Accelerated Molecular Dynamics. J Chem Theory Comput 8: 2997–3002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Gohlke H, Kiel C, Case DA (2003) Insights into protein-protein binding by binding free energy calculation and free energy decomposition for the Ras-Raf and Ras-RaIGDS complexes. Journal of Molecular Biology 330: 891–913. [DOI] [PubMed] [Google Scholar]
- 36.Huth JR, Bewley CA, Jackson BM, Hinnebusch AG, Clore GM, et al. (1997) Design of an expression system for detecting folded protein domains and mapping macromolecular interactions by NMR. Protein Science 6: 2359–2364. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Lee H, Mok KH, Muhandiram R, Park KH, Suk JE, et al. (2000) Local structural elements in the mostly unstructured transcriptional activation domain of human p53. J Biol Chem 275: 29426–29432. [DOI] [PubMed] [Google Scholar]
- 38.Hua QX, Jia WH, Bullock BP, Habener JF, Weiss MA (1998) Transcriptional activator-coactivator recognition: nascent folding of a kinase-inducible transactivation domain predicts its structure on coactivator binding. Biochemistry 37: 5858–5866. [DOI] [PubMed] [Google Scholar]
- 39.Kjaergaard M, Norholm AB, Hendus-Altenburger R, Pedersen SF, Poulsen FM, et al. (2010) Temperature-dependent structural changes in intrinsically disordered proteins: formation of alpha-helices or loss of polyproline II? Protein Sci 19: 1555–1564. 10.1002/pro.435 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Van Hoy M, Leuther KK, Kodadek T, Johnston SA (1993) The acidic activation domains of the GCN4 and GAL4 proteins are not alpha helical but form beta sheets. Cell 72: 587–594. [DOI] [PubMed] [Google Scholar]
- 41.Fierz B, Reiner A, Kiefhaber T (2009) Local conformational dynamics in alpha-helices measured by fast triplet transfer. Proc Natl Acad Sci U S A 106: 1057–1062. 10.1073/pnas.0808581106 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Warfield L, Tuttle LM, Pacheco D, Klevit RE, Hahn S (2014) A sequence-specific transcription activator motif and powerful synthetic variants that bind Mediator using a fuzzy protein interface. Proc Natl Acad Sci U S A 111: E3506–3513. 10.1073/pnas.1412088111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Wang J, Feng JA (2003) Exploring the sequence patterns in the alpha-helices of proteins. Protein Eng 16: 799–807. [DOI] [PubMed] [Google Scholar]
- 44.Lacroix E, Viguera AR, Serrano L (1998) Elucidating the folding problem of alpha-helices: local motifs, long-range electrostatics, ionic-strength dependence and prediction of NMR parameters. J Mol Biol 284: 173–191. [DOI] [PubMed] [Google Scholar]
- 45.Guido NJ, Wang X, Adalsteinsson D, McMillen D, Hasty J, et al. (2006) A bottom-up approach to gene regulation. Nature 439: 856–860. [DOI] [PubMed] [Google Scholar]
- 46.Shan YB, Kim ET, Eastwood MP, Dror RO, Seeliger MA, et al. (2011) How Does a Drug Molecule Find Its Target Binding Site? Journal of the American Chemical Society 133: 9181–9183. 10.1021/ja202726y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Landrieu I, Verger A, Baert JL, Rucktooa P, Cantrelle FX, et al. (2015) Characterization of ERM transactivation domain binding to the ACID/PTOV domain of the Mediator subunit MED25. Nucleic Acids Res 43: 7110–7121. 10.1093/nar/gkv650 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Cino EA, Choy WY, Karttunen M (2016) Characterization of the Free State Ensemble of the CoRNR Box Motif by Molecular Dynamics Simulations. J Phys Chem B. [DOI] [PubMed] [Google Scholar]
- 49.Zhang H, Kaneko K, Nguyen JT, Livshits TL, Baldwin MA, et al. (1995) Conformational transitions in peptides containing two putative alpha-helices of the prion protein. J Mol Biol 250: 514–526. [DOI] [PubMed] [Google Scholar]
- 50.Kosol S, Contreras-Martos S, Cedeno C, Tompa P (2013) Structural characterization of intrinsically disordered proteins by NMR spectroscopy. Molecules 18: 10802–10828. 10.3390/molecules180910802 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Szollosi D, Horvath T, Han KH, Dokholyan NV, Tompa P, et al. (2014) Discrete molecular dynamics can predict helical prestructured motifs in disordered proteins. PLoS One 9: e95795 10.1371/journal.pone.0095795 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Lindorff-Larsen K, Maragakis P, Piana S, Eastwood MP, Dror RO, et al. (2012) Systematic validation of protein force fields against experimental data. PLoS One 7: e32131 10.1371/journal.pone.0032131 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Cino EA, Choy WY, Karttunen M (2012) Comparison of Secondary Structure Formation Using 10 Different Force Fields in Microsecond Molecular Dynamics Simulations. J Chem Theory Comput 8: 2725–2740. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Iesmantavicius V, Dogan J, Jemth P, Teilum K, Kjaergaard M (2014) Helical propensity in an intrinsically disordered protein accelerates ligand binding. Angew Chem Int Ed Engl 53: 1548–1551. 10.1002/anie.201307712 [DOI] [PubMed] [Google Scholar]
- 55.Ptashne M (1992) A Genetic Switch: Phage lambda and Higher Organisms. 2nd edition Blackwell Scientific, Cambridge (USA). [Google Scholar]
- 56.Roberts SG, Green MR (1994) Activator-induced conformational change in general transcription factor TFIIB. Nature 371: 717–720. [DOI] [PubMed] [Google Scholar]
- 57.Taatjes DJ, Naar AM, Andel F 3rd, Nogales E, Tjian R (2002) Structure, function, and activator-induced conformations of the CRSP coactivator. Science 295: 1058–1062. [DOI] [PubMed] [Google Scholar]
- 58.Chi T, Carey M (1996) Assembly of the isomerized TFIIA—TFIID—TATA ternary complex is necessary and sufficient for gene activation. Genes Dev 10: 2540–2550. [DOI] [PubMed] [Google Scholar]
- 59.Krieger E, Vriend G (2014) YASARA View—molecular graphics for all devices—from smartphones to workstations. Bioinformatics 30: 2981–2982. 10.1093/bioinformatics/btu426 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Maier JA, Martinez C, Kasavajhala K, Wickstrom L, Hauser KE, et al. (2015) ff14SB: Improving the Accuracy of Protein Side Chain and Backbone Parameters from ff99SB. Journal of Chemical Theory and Computation 11: 3696–3713. 10.1021/acs.jctc.5b00255 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML (1983) Comparison of Simple Potential Functions for Simulating Liquid Water. Journal of Chemical Physics 79: 926–935. [Google Scholar]
- 62.Salomon-Ferrer R, Case DA, Walker RC (2013) An overview of the Amber biomolecular simulation package. WIREs Comput Mol Sci 3: 198–210. [Google Scholar]
- 63.Salomon-Ferrer R, Gotz AW, Poole D, Le Grand S, Walker RC (2013) Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 2. Explicit Solvent Particle Mesh Ewald. Journal of Chemical Theory and Computation 9: 3878–3888. 10.1021/ct400314y [DOI] [PubMed] [Google Scholar]
- 64.Zwanzig R (1973) Nonlinear generalized Langevin equations. Journal of Statistical Physics 9: 215–220. [Google Scholar]
- 65.Ryckaert JP, Ciccotti G, Berendsen HJC (1977) Numerical-Integration of Cartesian Equations of Motion of a System with Constraints—Molecular-Dynamics of N-Alkanes. Journal of Computational Physics 23: 327–341. [Google Scholar]
- 66.Essmann U, Perera L, Berkowitz ML, Darden T, Lee H, et al. (1995) A Smooth Particle Mesh Ewald Method. Journal of Chemical Physics 103: 8577–8593. [Google Scholar]
- 67.Brenke R, Kozakov D, Chuang GY, Beglov D, Hall D, et al. (2009) Fragment-based identification of druggable 'hot spots' of proteins using Fourier domain correlation techniques. Bioinformatics 25: 621–627. 10.1093/bioinformatics/btp036 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Frishman D, Argos P (1995) Knowledge-based protein secondary structure assignment. Proteins 23: 566–579. [DOI] [PubMed] [Google Scholar]
- 69.Humphrey W, Dalke A, Schulten K (1996) VMD: visual molecular dynamics. J Mol Graph 14: 33–38, 27–38. [DOI] [PubMed] [Google Scholar]
- 70.Roe DR, Cheatham Iii TE (2013) PTRAJ and CPPTRAJ: software for processing and analysis of molecular dynamics trajectory data. Journal of Chemical Theory and Computation 9: 3084–3095. 10.1021/ct400341p [DOI] [PubMed] [Google Scholar]
- 71.Grant BJ, Rodrigues AP, ElSawy KM, McCammon JA, Caves LS (2006) Bio3d: an R package for the comparative analysis of protein structures. Bioinformatics 22: 2695–2696. [DOI] [PubMed] [Google Scholar]
- 72.Skjrven L, Yao XQ, Scarabelli G, Grant BJ (2014) Integrating protein structural dynamics and evolutionary analysis with Bio3D. Bmc Bioinformatics 15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Team RDC (2010) R: a language and environment for statistical computing. [Google Scholar]
- 74.Kollman PA, Massova I, Reyes C, Kuhn B, Huo S, et al. (2000) Calculating structures and free energies of complex molecules: combining molecular mechanics and continuum models. Acc Chem Res 33: 889–897. [DOI] [PubMed] [Google Scholar]
- 75.Miller BR, McGee TD, Swails JM, Homeyer N, Gohlke H, et al. (2012) MMPBSA.py: An Efficient Program for End-State Free Energy Calculations. Journal of Chemical Theory and Computation 8: 3314–3321. 10.1021/ct300418h [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the paper and its Supporting Information files.