Abstract
Eukaryotic transcription factors are versatile mediators of specificity in gene regulation. This versatility is achieved through mutual specification by context-specific DNA binding on the one hand, and identity-specific protein-protein partnerships on the other. This interactivity, known as combinatorial control, enables a repertoire of complex transcriptional outputs that are qualitatively disjoint, or non-continuum, with respect to binding affinity. This feature contrasts starkly with prokaryotic gene regulators, whose activities in general vary quantitatively in step with binding affinity. Biophysical studies on prokaryotic model systems and more recent investigations on transcription factors highlight an important role for folded state dynamics and molecular hydration in protein/DNA recognition. Analysis of molecular models of combinatorial control and recent literature in low-affinity gene regulation suggest that transcription factors harbor unique conformational dynamics that are inaccessible or unused by prokaryotic DNA-binding proteins. Thus, understanding the intrinsic dynamics involved in DNA binding and co-regulator recruitment appears to be a key to understanding how transcription factors mediate non-continuum outcomes in eukaryotic gene expression, and how such capability might have evolved from ancient, structurally conserved counterparts.
Keywords: Combinatorial control, Conformational dynamics, Low-affinity binding, Molecular hydration, Transcription factors, Transcriptional regulation
1. Introduction
Complex multicellular organisms rely on a highly adaptive and finely tuned physiology to develop and survive. Many of the evolutionary innovations found in eukaryotes function at the transcriptional level and are physically mediated by DNA-binding transcription factor proteins. In single-celled prokaryotes such as bacteria, transcriptional repressors and activators directly interact with metabolites which modulate their binding affinity to target operators and regulate transcriptional output in an affinity-dependent manner. In eukaryotic organisms, gene expression must additionally be spatially coordinated with respect to cell type and body plan, as well as temporally coordinated with respect to homeostatic setpoints. In both prokaryotes and eukaryotes, the condensation of genomic DNA into chromatin represents a broad layer of regulation that operates over large clusters of genes at the same time (Szabo et al. 2019). The eukaryotic chromatin exhibits significant complexity in topology as well as DNA-modifying chemistry over their prokaryotic counterparts (Sikorska and Sexton 2020; Willenbrock and Ussery 2004; Li et al. 2011). However, transcriptional complexity in eukaryotes in the mesoscale is also accompanied by complexity at the molecular level. One needs but a glance at the array of transcription factors in the IFN-β enhanceosome (Panne et al. 2007; Panne 2008) to appreciate the complex machinery that regulates eukaryotic transcription (Fig. 1).
Fig. 1. The interferon-β enhanceosome exemplifies combinatorial control.
An atomic model of three transcription factors — interferon response factors NF-κB (green), IRF3/IRF7 (orange), and ATF-2/c-Jun (purple) — arrayed cooperatively to the IFN-β enhancer. Formation of this enhanceosome stimulates the transcription of the IFN-β gene. In this structure, virtually every base position is contacted by the bound factors
To meet the complex needs of multicellular physiology, eukaryotic proteins possess distinct molecular features that are absent in prokaryotes (Charoensawan et al. 2010). One is the prevalence of intrinsically disordered regions which are particularly associated with transcription factors (Liu et al. 2006; Singh and Dash 2007; Minezaki et al. 2006). Another eukaryotic feature that is receiving strong intense attention is the phase-separating properties of eukaryotic proteins that coalesce to form membrane-less nuclear organelles (Zhu and Brangwynne 2015; Wheeler and Hyman 2018). Liquid-liquid phase separation by globular proteins such as lysozyme has been known for some time (Ishimoto and Tanaka 1977), and this process is favored when disordered regions are involved (Uversky 2017). The third feature is the combinatorial control of eukaryotic transcription factors. Combinatorial control refers to the formation of functionally distinct factor/DNA complexes as directed by DNA sequence and protein-protein partnerships (Remenyi et al. 2004; Pan et al. 2009, 2010). These complexes, which vary discontinuously in structure and function with binding affinity, are postulated to arise from an allosteric relationship between DNA on the one hand and co-regulatory protein partners on the other. The re-purposing of eukaryotic factors through combinatorial control is central to their ability to regulate complex genomes using a restricted set of DNA-binding motifs (Vaquerizas et al. 2009; Charoensawan et al. 2010).
The profound differences between prokaryotic and eukaryotic gene regulation suggest that principles derived from bacterial and bacteriophage regulators are insufficient to understand functionally important properties of eukaryotic transcription factors. New experimental tools, notably high-throughput nucleic acid sequencing and array approaches, have revealed many novel properties of transcription factors. These results have not been always interpreted according to the combinatorial control paradigm in connection with the immediate research questions which they address. In this review, we will first review the basic tenets of combinatorial control. We will then assess recent literature on novel modes of genomic control by transcription factors, specifically low-affinity recognition. Integrating this new knowledge, we will propose a more general abstraction of combinatorial control that could help frame incisive new directions of research in this area.
2. Chapter 1. DNA Sequence and Protein-Protein Partnerships Mutually Direct Transcription Factor Interactions
In 1998, Lefstin and Yamamoto proposed that distinct DNA sequences direct conformational changes that specify interactions with other protein partners (Lefstin and Yamamoto 1998). In the context of the major transcription factor families that have been studied up to that time, such as the POU family, nuclear receptors, and AP-1, most of which recognize DNA as oligomers, Lefstin and Yamamoto drew attention to the ordering of quaternary structure by DNA sequence. Once formed, these DNA-directed complexes present alternate surfaces to recruit specific co-regulatory partners and elicit distinct transcriptional outputs. Respecting the adage that “structure equals function”, the disjoint nature of eukaryotic transcriptional responses implies significant differences in structure as they are tuned from eliciting one pattern of transcriptional behavior to another. A prescient observation by Lefstin and Yamamoto is that allosteric control by DNA is engrained in the DNA-binding domains (DBDs) and not strictly dependent on the transactivation or other domains of transcription factors.
Following the proposal by Lefstin and Yamamoto, many studies have found that subtle changes to factor-specific binding sites in regulatory regions of genes resulted in gross changes in gene expression patterns. For example, the cell type-specific regulation of growth hormone (GH) expression by the POU-family factor Pit-1 is determined by the sequence of a single Pit-1 recognition site in the GH promoter (Scully et al. 2000). Substitution of this sequence alone with an affinity-neutral counterpart abolishes the normal selectivity of GH expression in somatotropic cells. Similarly, single nucleotide changes in the cognate sequences for NF-κB, which binds DNA as dimers, alters its specificity for co-factors (Leung et al. 2004). Since the altered sequences do not differ significantly in terms the affinity of factor-specific binding in these examples, the divergence in gene expression is remarkable.
High-resolution details on how DNA sequence initiates divergent transcriptional effects come from co-crystallographic studies of several transcription factors that bind DNA in distinct modes as directed by sequence. One of the more dramatic examples is afforded by the POU-family transcription factors. Their DNA-binding (POU) domain is a bipartite combination of two helix-turn-helix motifs. POU consists of a globular POU-specific subdomain (POUS) connected via a flexible variable-length linker to a homeodomain (POUH) (Sturm and Herr 1988). Depending on the spacing of the POUS− and POUH-binding sequences in native POU cognate sites, which do not differ significantly in affinity, POU domains dimerize with the two sub-domains of each subunit arranged in different orientations along the response elements (Remenyi et al. 2001; Herr and Cleary 1995; Jacobson et al. 1997). These orientations, in which the intra-subunit sub-domains are roughly cis or trans with respect to the major groove, expose altogether different surfaces to each other as well as co-regulators (Fig. 2). Variation of the cognate binding site for the glucocorticoid receptor (GR), a nuclear hormone receptor, results in little correlation with binding affinity in vitro, but recruits different protein partners and different expression levels of GR-dependent genes (Meijsing et al. 2009; Thomas-Chollier et al. 2013). More recently, studies on the dimeric T-box transcription factor T-bet showed that subunits bound to core motifs located on different DNA duplexes can crosslink in a configuration that mobilizes more than five-fold more exposed surfaces than canonical dimers binding minimally spaced cores on the same duplex (Liu et al. 2016a). Thus, quaternary structure represents an efficient handle for allosteric control by DNA. Functionally, observation of these effects in factors belonging to unrelated structure classes (as exemplified by the IFN-β enhanceosome, Fig. 1) and in species ranging from yeast to humans suggested them as a paradigm of eukaryotes gene regulation. Not surprisingly, the factor-directing properties by DNA sequence (Jolma et al. 2015) have encouraged interest in deciphering the impact of spacing, density, and orientation of transcription factor binding sites, or “syntax”, on the cis-regulatory potential of promoters and enhancers (Baetu 2011; Barolo 2016; Witzany 2017).
Fig. 2. DNA-directed structures formed by POU-domain factor Oct-1.
Two characteristic cognate response elements, termed PORE and MORE, are bound with similar affinities. On the longer PORE site, POUS and POUH straddle the helix, while they sit on the same side on the shorter MORE site and induce van der Waals contacts on POUS (yellow). Arrows show the connectivity of the unresolved linker in the co-crystal structures (PDB IDs in brackets)
Allosteric control by DNA represents one lever of the combinatorial tug-of-war for transcription factor response. Co-regulating protein partners represent the other. To cite one dramatic example, a splice variant of GR harboring one additional Arg residue in the DBD, which does not alter binding affinity, differentially activates a subset of GR-dependent genes (Meijsing et al. 2009). The definition of combinatorial control under its current moniker formally acknowledges the reciprocal influences of DNA sequence and protein partner identity (Remenyi et al. 2004). A subsequent formulation by Nussinov and collaborators in 2009 sharpened this picture by framing transcription factor interactions in terms of the free energy landscape (Pan et al. 2009, 2010). Free energy landscapes are thermodynamic constructs that relate the relative free energies of a chemical species differing in terms of conformational change or binding. Thus, a species is energetically favored relative to all others and at equilibrium, the lowest-energy state is the most highly populated in the ensemble. The energy barriers between related states also provide information on the kinetics of interconversion. While quantitative resolution of free energy landscapes of interactions between macromolecules remains extraordinary in computational cost (Pan et al. 2019), and is not routinely affordable to most investigators, they are intuitive constructs for modeling the relative (thermodynamic) stabilities of non-covalently linked states.
Based on the Nussinov concept (Pan et al. 2009, 2010), Fig. 3 shows free energy landscapes for hypothetical transitions by an ensemble consisting of three distinct DNA sequences encoding binding sites for a (homodimeric) transcription factor, and two partner proteins that are recruited via strictly protein-protein interactions. Since DNA sequence represents the only variation among the ensembles, the scheme demonstrates how specific binding remodels the free energy landscapes such that different factor/DNA complexes become the thermodynamically favored species. To the extent that these species are sufficient in mobilizing the regulatory machinery, such schemes illustrate how combinatorial control may operate in thermodynamically consistent terms. Although the various bound states transition in Fig. 3 in a particular order (beginning with DNA binding), the relative stabilities of the states are not order-dependent as per thermodynamics. The barriers separating any two adjacent states would of course apply to that specifically ordered pair. In the sense that the free energy landscape provides for cooperative control equally from protein or DNA, it captures and generalizes the DNA-directed proposal by Lefstin and Yamamoto (Lefstin and Yamamoto 1998). It is also compatible with transcription regulation at the higher level of the nucleosome, namely the reciprocal effects of nucleosomal dynamics on transcription factor binding (He et al. 2010; Grossman et al. 2018; Ballare et al. 2013), particularly by so-called pioneer factors (Magnani et al. 2011; Soufi et al. 2015).
Fig. 3. Schematic free energy landscapes for combinatorial docking to three different cognate DNA sequences.
The black-and-white DNA cartoon is intended to represent any one of the three sequences. Sequence-specific binding by a homodimeric transcription factor with affinities which are not necessarily identical (as the case here) differentially remodels the free energy landscapes (presented in black, blue, and red) for the recruitment of other proteins. The resultant changes in the most favored functionally competent complex (denoted with correspondingly colored backgrounds) bias the transcriptional output towards the pattern regulated by that complex. Even in this illustrative example, many alternative landscapes featuring other possibilities, for example co-regulating partners that recognizes one complex but not another, are possible
The free energy framework has some limitations. First, while the DNA and protein partners are not treated differently in a simplified free energy landscape such as Fig. 3, features of the genomic environment (e.g., the embedding of the target DNA site between nonspecific DNA in topographically complex chromatin, other DNA-binding proteins) that are not included may introduce asymmetry into this tug-of-war. A more general limitation is that the free energy landscape assumes that conditions of thermodynamic equilibrium prevail, at least locally, such that the relative population among the possible states are determined by their relative free energies. This assumption also requires that all accessible states be interconvertible (a consequence of the principle of microscopic reversibility), which may be slow relative to physiologically relevant timescales (Xhani et al. 2020). Nevertheless, the free energy landscape provides a useful framework for abstracting molecular details provided that all the species are known and accurately included in thermodynamic terms. One may rationalize, for example, why the affinities of factor/DNA interactions have not evolved to be typically much closer to the upper limit for non-covalent interactions (~10−15 M in dissociation constant), as the subsequent landscape remodeling would render energetically unfavorable any high-order state with additional partners.
As is the case of all thermodynamic constructs, free energy landscapes provide no explicit structural information on the transition. The microscopic structures or dynamics responsible for allostery become thermodynamically manifest only through their effects on macroscopic experimental observables. Dissection of the free energy changes into more fundamental components can provide useful clues based on the changes in characteristic physicochemical properties of the transition. Changes in enthalpy, entropy, and heat capacity provide information (and in the best cases, yield semi-quantitative estimates) of the amount and type of surface area involved in a transition (Spolar and Record 1994). Perturbation by hydrostatic or osmotic pressure can be modeled to give complementary information about the change in dynamics and molecular hydration (Silva et al. 2001; Royer and Winter 2009). While the analytical methodology of these types of thermodynamic analyses are quite well characterized, their limitations understood, and have been widely reported in the literature for well-defined transitions involving model domains, their use in multi-component complexes that typify functionally interesting transcription factor/DNA species remain virtually unknown.
3. Chapter 2. Low-Affinity Transcription Regulation: New Perspectives to Combinatorial Control
Recent advances in high-throughput binding assays adapting next-generation sequencing and microarray technologies have significantly improved the definition of genomic occupancy by transcription factors in vivo (Rastogi et al. 2018). These technical refinements have provided access to the population of factor/DNA complexes in vivo that are significantly lower in affinity. These studies have revealed an unanticipated high level of low-affinity binding by eukaryotic factors (Wang et al. 2015; Crocker et al. 2016; Kribelbauer et al. 2019). More importantly, mounting evidence shows the absolute requirement of low-affinity binding for correct tissue development in cellular and animal models (Farley et al. 2015; Farley et al. 2016; Crocker et al. 2015; Cary et al. 2017), as well as the exclusivity of low-affinity complexes to specific cellular functions (Krishnamoorthy et al. 2017; Iwata et al. 2017). A major area of interest in low-affinity transcriptional control in recent years has been embryogenesis. Functional studies in model animals including Ciona sea squirts (Farley et al. 2015; Farley et al. 2016), Drosophila spp. (Jiang and Levine 1993; Crocker et al. 2015; Ramos and Barolo 2013), C. elegans (Gaudet and Mango 2002), sea stars and sea urchins (Cary et al. 2017), and the annelid P. dumerilii (Handberg-Thorsager et al. 2018) show that low-affinity binding by transcription factors is essential to correct spatiotemporal patterning of tissues. Low-affinity binding is thought to enable spatio-temporal control of genes by tuning the sensitivity of their enhancers to transcription factor levels as well as facilitating cooperative regulation by multiple factors (Crocker et al. 2016; Weingarten-Gabbay and Segal 2014). Substituting the wildtype low-affinity sites with high-affinity sequences results in ectopic and overly intense gene expression, demonstrating the functional significance of low-affinity signaling in regulating the level, location, timing, and specificity of developmentally sensitive genes.
The key question for low-affinity binding is how it can deliver functionally specific transcription factor complexes. Investigators have addressed this issue in their recent reviews on low-affinity site recognition (Crocker et al. 2016; Kribelbauer et al. 2019). Proposed mechanisms of specificities involving low-affinity binding fall broadly into two categories. The first consists of mechanisms that directly generate specificity through cooperative binding, either intramolecularly or intermolecularly, to unveil latent specificities. Intramolecular avidity can be derived from combining multiple DBDs from the same or different structural families within the same protein, with zinc finger proteins such as the CTCF insulator being dramatic examples (Lambert et al. 2018). Alternatively, multiple factors assemble as dictated by DNA sequence. Hematopoiesis, an established paradigm of transcriptional control (Orkin and Zon 2008), provides important examples. The hematopoietic master transcription factor PU.1, a member of the ETS transcription factor family, exhibits at least two distinct genomic binding modes in vivo. One type is cooperative binding with partners such as IRF4 at composite high-affinity sites (Mohaghegh et al. 2019; Hosokawa et al. 2018). In cell lineages in which PU.1 expression is silenced, such as most T cells, Ap-1 takes the place of PU.1 (Glasmacher et al. 2012), as exemplified by the IFN-β enhanceosome (Fig. 1). The second type of genomic PU.1 binding is collaborative binding with factors such as C/EBPα which bind at separate sites and is triggered at low-affinity PU.1 motifs. While high-resolution differences between high- and low-affinity bound PU.1 are not yet known, biophysical evidence supports significant differences in the disposition of solvent-exposed surface areas and their distinctiveness from other members of the ETS family (Wang et al. 2014; Xhani et al. 2017; Albrecht et al. 2018).
Alternatively, to overcome the intrinsic deficit in affinity, mechanisms are proposed to modulate the specificity of transcription factors indirectly through their local concentrations. Eukaryotic subcellular compartments, dubbed transcriptional hubs, are postulated to sequester transcription factors (Kribelbauer et al. 2019). Liquid-liquid phase separation (LLPS) into membrane-less organelles is thought to be one hub-generating mechanism (Zhu and Brangwynne 2015; Wheeler and Hyman 2018). Notwithstanding the interest in disordered proteins with LLPS, due to their abundance in nuclear proteins (Uversky 2017), structured globular proteins also form similar condensates under appropriate solution conditions (Cinar et al. 2019). It should be noted that, to be a useful pool, the sequestered transcription factor must be physically available for exchange with genomic targets. The DBD of GR, for example, binds with structured RNA encoded by different sequences with comparable affinity as canonical DNA response elements (Parsonnet et al. 2019). Since RNA-bound proteins are now widely known to undergo LLPS (Buchan 2014; Rhine et al. 2020), and GR could hardly be alone among transcription factors to bind RNA, sorting out such sequestration mechanisms in support of low-affinity transcriptional regulation represents a complicated proposition.
The widespread existence and functional importance of low-affinity binding motivate the need to interpret it in the context of the combinatorial control framework, which envisions sites selection among DNA sites of similar affinities in the free energy landscape (Pan et al. 2009, 2010). As long as some of the DNA-bound states are more stable than the unbound transcription factor, binding redistributes the bound and free states of the transcription factor with additional potential of remodeling the free energy landscape. Thus, allosteric control by low-affinity DNA does not necessarily present any contradiction. However, if one assumes stationary conditions at thermodynamic equilibrium, population of the various states is entirely dependent on concentration according to the law of mass action. Lower-affinity interactions therefore demand large concentrations of unbound components, particularly in the presence of higher-affinity competitors. This may mean concentration requirements for low-affinity targets that are nominally non-physiological. Living cells, of course, are not under thermodynamic equilibrium overall (though chemical potentials may be near their equilibrium values locally), and various sequestration mechanisms have been proposed that may significantly localize and boost effective concentrations (vide supra). Moreover, transcriptional effects only require sufficient kinetic persistence at the level of individual molecules, which reflects but is not dictated by equilibrium properties of macroscopic ensembles.
In addition to thermodynamic concerns, low-affinity transcriptional complexes pose structural challenges to the combinatorial control framework. A striking example is afforded by an alternative dimeric binding mode by GR to DNA motifs in which a high- and low-affinity core are oriented tail-to-tail (an “inverted repeat”) such that the subunits lie on opposite faces of the DNA (Weikum et al. 2017). Unlike the well-known canonical motif in which two high-affinity cores recruit the subunits avidly with positive cooperativity in a spaced head-to-head configuration (Luisi et al. 1991), inverted repeats are bound 50–100 times more poorly and with negative cooperativity (Hudson et al. 2013). The co-crystal structure shows a that the GR monomers engage each core as separated monomers in the inverted repeat, which characterizes repressive GR-responsive elements, in stark contrast to the cooperative dimers found in the canonical complex bound to activating response elements (Fig. 4). GR therefore demonstrates that low-affinity complexes can be structurally so different that their free energy landscapes would be altered quite beyond the fine-tuning that characterizes combinatorial control (c.f., Fig. 3) as originally envisioned for cognate sequences of comparable affinities.
Fig. 4. Alternate bound states of the glucocorticoid receptor (GR) DBD dimer.
The canonical GR/DNA complex (red; PDB ID: 1glu) and an alternate tail-to-tail orientation (blue; PDB ID: 4hn5) are structurally aligned at the middle monomer to highlight the inverted orientations (arrows). Zn2+ ions coordinated by the two zinc fingers in each monomer are shown as spheres. The hexamer core sequences recognized by the monomers are marked in grey. Unlike the canonical dimer, the latter orientation is characteristically low-affinity, exposes different surfaces to each monomer and co-regulators, and mediates trans-repressive functions of GR
4. Chapter 3. Rethinking Combinatorial Control: Concept of Non-continuum Response
In the combinatorial control framework, transcription factors are flexible species that can be allosterically repurposed by DNA and protein in an identity-dependent manner. The mechanisms of allosteric regulation are myriad. Investigations on the DNA side alone have revealed profound effect of short-range DNA sequence and longer-range cis-regulatory “syntax” (spacing, density, orientation) that extends well beyond the sequence of the transcription factor binding site itself. More dramatic though less common are locally non-canonical DNA structures, such as triplex and G-quadruplexes, which can directly bind (Cogoi et al. 2010; Gao et al. 2015) or modify recognition by transcription factors (Zhou et al. 2019). On the protein side, quaternary structure and heterotypic protein-protein partnerships can amplify permutations in binding both locally and with distal cis elements potentially hundreds of bp away in sequence. This flexibility in the structure as well as usage of transcription factors and their DNA binding sites has been evocatively termed “plasticity” and “pleiotropy” (e.g., Kyrmizi et al. 2006; Weikum et al. 2017; Preger-Ben Noon et al. 2018; Kalodimos et al. 2002). Readers may recognize these borrowed metaphors from the description of complex organismal and cellular phenotypes in the genetics and developmental biology literature.
To incorporate low-affinity regulation more seamlessly into combinatorial control, and secondarily to avoid the potentially contentious usage of domain-specific labels, we propose a conceptual framework in which transcription factors generate qualitatively disjoint, or non-continuum responses as specified by intermolecular interactions with DNA and protein partners. These partner-specific interactions direct the factors to broadcast the effects of specific transcriptional programs. Conceptually, the designation of “non-continuum” focuses attention on the effects of transcription factor action, rather than the manner in which they are achieved. As a result, it conveniently unites combinatorial control of transcription factors via at high- and low-affinity DNA sites, which have been alternately described as “suboptimal” or “submaximal” (Farley et al. 2015; Farley et al. 2016). Finally, it strikes an important contrast with the prokaryotic paradigm of transcriptional regulation.
Prokaryotic gene regulators represent some of the best understood systems in transcriptional regulation. Systems such as the cAMP-response protein (CRP, also known as catabolite activator protein or CAP) and the lac repressor are classic models in our understanding of the operon and protein/DNA allostery (Kalodimos et al. 2004b; Lawson et al. 2004). A general property in prokaryotic gene regulation is that specific regulators have narrow functional scopes, meaning that they mediate consistent functions at their DNA target sites, even though they may be combined in complex cis-regulatory logic (van Hijum et al. 2009). This description recognizes that control regions of prokaryotic operons may be expansive in organization, as exemplified by the multi-operator architecture of the lac and trp operons (Merino et al. 2008). Prokaryotic proteins also exhibit complex allostery that coordinates the mutually perturbative effects of binding to DNA and low-MW metabolites (Popovych et al. 2006). However, the activity and molecular interactions of these proteins track each other quantitatively along a gradient, generally as a function of their affinities for the DNA site or other ligands. Detailed NMR-based studies of CAP/DNA binding, for example, show a continuum transition in structure in step with affinity, as indicated by the chemical shifts of heteronuclear cross-peaks (Tzeng and Kalodimos 2012). Indeed, this continuum between structure and affinity makes possible the exhaustive characterization of DNA sequence by high-throughput screening of operator libraries (Zuo et al. 2015; Zuo and Stormo 2014).
With this backdrop, “non-continuum responses” denote the qualitatively distinct molecular behavior of eukaryotic transcription factors that are elicited upon binding different cognate DNA sequences and protein co-regulators (Fig. 5). Unlike gene regulation in prokaryotes, these outcomes in gene expression and interactions with partner proteins cannot be graded according to binding affinity with any one partner. As GR and others systems exemplify, eukaryotic transcriptional responses triggered by low-affinity binding are therefore not just fractional in magnitude relative to high-affinity binding. Instead, they are qualitatively non-equivalent and biologically irreplaceable as reviewed in Chapter 2. Thus, for both high- and low-affinity binding, “non-continuum responses” capture the essential molecular tug-of-war between DNA and protein co-regulators that remodels the conformations of transcription factors discontinuously in structure.
Fig. 5. Non-continuum nature of output marks eukaryotic transcription factors.
In prokaryotes, gene regulatory proteins elicit qualitatively consistent functions (e.g., repression of an operon) whose intensity quantitatively tracks the DNA binding affinity. Continuum behavior can be experimentally observed, for example, by solution NMR via heteronuclear chemical shifts e.g., (Tzeng and Kalodimos 2012). In stark contrast, eukaryotic transcription factors elicit qualitatively disjoint effects (e.g., activity in a particular cell type, collaboration with specific co-regulators) that do not correlate with binding affinities
5. Chapter 4. Dynamic Mechanisms in Non-continuum Responses
5.1. Section 4.1. Importance of Conformational Dynamics in Combinatorial Control
Biopolymers in general, including proteins and nucleic acids, are conformationally dynamic objects in solution. Dynamics arise most fundamentally from significant thermal motion at temperatures at which water is liquid. Globular protein domains and helical DNA (and folded RNA) are restrained dynamically relative to their disordered chains, as quantified thermodynamically by the configurational entropy. Nevertheless, significant dynamics remain in the folded states of proteins, as has been realized since at least the 1970s (Cooper 1976, 1984). In eukaryotic proteins, these dynamics can be concentrated locally in intrinsically disordered regions (IDRs). Compared with prokaryotic gene regulators, transcription factors are particularly enriched in intrinsic disorder, particularly near their structured DNA-binding domains (Guo et al. 2012). Among a host of other molecular functions, IDRs are also frequent targets of post-translational modifications (van der Lee et al. 2014). From yeasts to high-order eukaryotes, transcription factors therefore appear to accrue increasing dynamics in their natural molecular evolution, suggesting physiological significance.
Given the fundamental dynamics of proteins and nucleic acids, it is natural to consider how conformational dynamics of transcription factors mediate their responsiveness to DNA targets and partner proteins in an identity-dependent manner. These considerations dovetail to long-standing questions of conformational selection (capture of pre-existing conformations within the ensemble of the free state) versus induced fit (conformational adjustment), a classic debate with origin in the concerted and sequential models of allosteric transitions (Monod et al. 1965; Rubin and Changeux 1966). The abundance of intrinsic disorder in transcription factor adds to the functional relevance of these lines of inquiry. Following up on their work on combinatorial control, Nussinov and collaborators postulated that these dynamics primarily involve conformational selection by DNA or partner protein, potentially with induced fit along the binding pathway (Nussinov et al. 2014). Although this picture is in accord with contemporary consensus derived from hemoglobin, model enzymes and membrane receptors (Vogt et al. 2014; Vogt and Di Cera 2012, 2013; Changeux and Edelstein 2011), it is nevertheless inferred from static structures (Nussinov et al. 2014; Pan et al. 2010).
Commenting primarily on GR, Leftsin and Yamamoto had proposed that sequence-dependent signals are transduced from the protein/DNA interface to allosteric sites postulated to recruit protein co-regulators (Lefstin and Yamamoto 1998). To test such hypotheses, it would be necessary to define the dynamics of the unbound protein and the dynamic perturbations due to binding. The implied dynamic perturbations by high- and low-affinity DNA would be expected to differ substantially, with distinct structural and thermodynamic signatures. Unfortunately, direct investigations of transcription factor dynamics have received less attention relative to other classes of proteins. Moreover, this knowledge gap is not well addressed by the significant work done on prokaryotic regulators (Popovych et al. 2006; Villa et al. 2005; Kalodimos et al. 2004a), which act as continuum transcriptional modulators.
In the literature, the non-continuum dynamics of two transcription factor systems have been examined in some detail. The first is GR, whose DBD engages DNA via two Cys4 zinc fingers to cognate hexameric core sequences. The Cys4 zinc fingers are separated by a loop termed the “lever arm” which is not canonical to this common DNA-binding motif (Vandevyver et al. 2014). Comparison of co-crystal structures show that the lever arm is conformationally variable and modifies the relative orientation of the two zinc fingers discretely in a sequence-dependent manner (Meijsing et al. 2009). Lever arm dynamics have been examined directly by solution NMR spectroscopy (Thomas-Chollier et al. 2013) and in silico by explicit-solvent molecular dynamics (MD) simulations (Alvarez et al. 2017). The results show that lever arm dynamics are sensitive to clinically relevant point alterations in primary structure, and are propagated to distal interfaces that mediate GR dimer formation and interactions with co-regulators.
A second model system is represented by Ets-1, the archetypal member of the ETS family of transcription factors. ETS transcription factors comprise an ancient family of gene regulators and are conserved throughout the animal kingdom (Degnan et al. 1993; Wang and Zhang 2009). ETS proteins recognize diverse DNA sequences harboring a core 5′-GGA(A/T)-3′ consensus (Wei et al. 2010). Among many other roles in cellular housekeeping and hematopoiesis, Ets-1 is specifically implicated in correct development of the notochord in Ciona embryos (Farley et al. 2015; Farley et al. 2016). A low-affinity site in the Mnx enhancer is a specific and absolute requirement for correctly patterned Mnx expression in vivo (Farley et al. 2016). Ets-1 binds core DNA sites characteristically as monomers (Garvie et al. 2002). To resolve the structural and dynamic basis of low-affinity recognition, extensive explicit-solvent MD simulations were carried out for Ets-1 free and bound to cognate and nonspecific DNA (Huang et al. 2019). Principal component analysis (PCA) of the trajectories shows that high- (HA) and low-affinity (LA) cognate binding represent disjointed DNA recognition modes (Fig. 6a). While backbone dynamics separate cognate (HA, LA) from nonspecific (NS) DNA recognition, sidechain dynamics distinguish HA and LA binding. By both back-bone and sidechain dynamics, the HA complex is more similar to nonspecific than LA (black arrows, Fig. 6a). Thus, LA-bound Ets-1 is a non-continuum species residing outside the unbound-NS-HA coordinate.
Fig. 6. Non-continuum Ets-1/DNA recognition.
(a) Principal component analysis of MD trajectories of the ETS domain of Ets-1, free and in complex with DNA, reveals collective dynamics out of order with their binding affinities. Red arrows denote the collective sidechain motions along the first principal component, PC1. HA = high-, LA = low-affinity, NS = nonspecific, UB = unbound. (b) A triad of residues acts as a binary relay. LA binding breaks an allosteric salt bridge that is constitutively intact in the other states. (c) Disruption of this salt bridge floods an underlying hydrophobic core in helices H1 and H2. (d) DNA binding by a Q336L mutant, which renders the relay unresponsive in simulations, fails to trigger low-affinity binding. Data published in (Huang et al. 2019)
Scrutiny of the sidechain PCA and inter-residue contacts reveals a triad of residues that mediate the non-continuum LA response (Fig. 6b). Gln336 and Arg378 near the DNA-contact surface, together with a distal Glu343 residue, act as a binary relay that toggles between two sidechain configurations. In the default configuration (as present in unbound Ets-1 and preserved in nonspecific and HA-bound complexes), Glu343 and Arg378 form a salt bridge, while Gln336 contacts the DNA backbone. LA DNA uniquely triggers a phosphate contact with Arg378 that breaks the salt bridge with Glu343. Loss of this salt bridge destabilizes the local α-helical structure and exposes an underlying hydrophobic patch to solvent (Fig. 6c). Thus, cognate DNA recognition by Ets-1 involves an allosteric mechanism in which sequence variations are sensed by an interfacial Arg residue and transmitted to a distal salt bridge. Abrogation of this salt bridge exposes and hydrates the hydrophobic core, incurring a binding penalty. In simulations, a Q336L mutation desensitizes the relay to LA DNA. Experimentally, Q336L binds LA DNA with similarly high affinity as HA DNA, thus failing to trigger the LA-specific response (Fig. 6d). Q336L also fails to reproduce the site occupancy pattern of wildtype Ets-1 in a native enhancer context (Huang et al. 2019). Binding experiments with Ets-1 mutants therefore verified the functional effect of the triadic relay. Conservation of the triadic relay in ~50% of ETS members suggests that it may be a shared non-continuum mechanism in this family.
Though still in limited supply, these examples of transcription factors, from unrelated structural families and whose non-continuum transcriptional properties are functionally established, illustrate a diversity in their dynamic mechanisms of DNA recognition. These examples suggest a general theme in which DNA-binding motifs evolve novel mechanisms to overcome structural constraints to broaden their DNA sequence tolerance. The attendant evolutionary pressure offers one resolution to the general observation that “no simple code” exists in protein/DNA readout (Slattery et al. 2014) when average shape, but not dynamic properties, is explicitly taken into account.
5.2. Chapter 4.2. Molecular Hydration in Non-continuum Tuning
The nucleoplasm is a crowded microenvironment in which a complex mixture of solutes ranging over orders of magnitude in size, concentration, and physicochemical properties populate an aqueous milieu (Goodsell 1991). As the space-filling solvent, water remains the numerically dominant species, and its overwhelming demand for hydrogen bonding (Eaves et al. 2005) strongly influences biomolecular conformation and binding. Hydration is therefore an indispensable component in protein and nucleic acid dynamics in solution (Chalikian 2003; Chalikian and Macgregor 2007), including in vivo (Parsegian and Rau 1984; Rand et al. 2000). The close connection between hydration and dynamics is a classical topic in physical organic chemistry and continue to garner strong interest for their roles in biomolecular interactions. Their well-known thermodynamic coupling, enthalpy-entropy compensation, remains actively debated (Fox et al. 2018; Pan et al. 2015). In the literature, work has focused on the hydration dynamics of folded and unfolded states of proteins and nucleic acids (Duboue-Dijon et al. 2016; Laage et al. 2017; Persson et al. 2018). The results have updated our view of hydration water as highly heterogeneous and generally more mobile than previously considered. Work on biomolecular recognition has focused on protein binding by small ligands e.g., (Liu et al. 2016b; Breiten et al. 2013). The coupling of hydration and conformational dynamics in DNA recognition remains an uncultivated area despite obvious differences between DNA and low-MW substrates. As a polyanion, helical DNA has significant sequence-dependent dynamics and hydration that can impact binding. Addressing these fundamental concepts in sequence-directed tuning should expand our knowledge of protein/DNA recognition beyond time-averaged sequence-specific DNA structures and shape (Rohs et al. 2010; Joyce et al. 2015). The triadic relay in Ets-1, which works directly through hydration of a hydrophobic core in the low-affinity complex (Huang et al. 2019), represents one such example.
5.3. Chapter 4.3. Experimental and Computational Approaches to Molecular Dynamics and Hydration
X-ray crystallography is the classic high-resolution probe of macromolecular structure and provides dynamics information indirectly through the temperature factor and multiple occupancies of local electron densities (Wlodawer et al. 2008). Cryo-electron microscopy is now approaching, in the best cases, resolution of crystallography for large assemblages with dimensions in the mesoscopic length scale (10–1000 nm). The information content of Cryo-EM, including dynamics information, is extended still when structure fitting occurs in concert with crystallographic structures of components and MD methods. For true in-solution dynamics, high-field multi-dimensional NMR is a powerful technique for probing the dynamics of proteins and nucleic acids (Kleckner and Foster 2011; Purslow et al. 2020). As NMR measures the macroscopic ensemble, it benefits greatly from atomistic modeling of dynamical observables by MD approaches. MD is also highly complementary to single-molecule experiments provide direct observation of dynamics at the mesoscopic length scale (Deniz et al. 2008).
With respect to hydration, water molecules are frequently observed in crystal structures. However, discrete water observed in crystal structures constitutes the most stable, well-ordered sub-population of the water involved in hydration (Nakasako 2004). The more mobile water associated with dynamic structures do not produce observable electron densities. NMR is capable of probing hydration of macromolecular surfaces in solution, but reliably only if the macromolecules are encapsulated in reverse phase micelles to break the magnetic coupling between local and bulk water (Gallo et al. 2018). These complications have thus far limited the application of NMR in hydration studies. By far the most frequently used approach to hydration of nucleoprotein complexes is osmotic stress, using neutral compatible co-solutes such as betaine and sucrose to perturb hydration water via an increase in osmotic pressure in bulk solution (Parsegian et al. 1995). Another volumetric technique that has been applied in protein/DNA binding is hydrostatic pressure, though much more commonly for enzymes than for transcription factors (Macgregor 2002). A vast literature exists on the volumetric properties of the folded and unfolded states of model proteins, polypeptides, nucleic acids, and their interactions with low-MW ligands; reviewed e.g., (Chalikian 2003; Chalikian and Macgregor 2007; Royer and Winter 2009). As suggested by Sligar (Lynch and Sligar 2002), modeling of volumetric data with MD simulations have the potential to yield significant insight. In addition to conformational dynamics of macromolecules, MD simulations have also been used to analyze the dynamics of hydration water purely in silico (Laage and Hynes 2006; Duboue-Dijon et al. 2016; Persson et al. 2018).
6. Chapter 5. Evolutionary Origins of Non-continuum Tuning
How did non-continuum behavior arise in eukaryotic transcription factors? Some evolutionarily modern transcription factors harbor de novo DNA-binding motifs e.g., T-box proteins (Sebe-Pedros et al. 2013), or heavily modified classical motifs such as nuclear receptors (de Mendoza et al. 2013; Schmitz et al. 2016). However, many factors harbor DNA-binding motifs that are directly descended from prokaryotic gene regulators. Since the molecular properties that endow non-continuum transcriptional outputs are already engrained in the DBDs, this suggests an evolutionary relationship between continuum and non-continuum behavior in the most conserved motifs across the trees of life. The helix-turn-helix (HTH) domain is an ideal model in this regard, because it is one of the most ancient and conserved of the transcriptional apparatus in all organisms (Sauer et al. 1982) and was already present in the last universal common ancestor (LUCA) of extant life forms (Burton et al. 2016). HTH motifs are extraordinarily robust to sequence changes as evidenced by the myriad amino acid sequences capable of encoding them (Aravind et al. 2005). As examples, we discuss two lineages of HTH descendants: cAMP-response protein (CRP)-like motif and tetrahelical HTH bundles characterized by bacteriophage proteins.
6.1. Chapter 5.1. CRP-Like Motif
The CRP-like motif is a mixed α/β bundle in which a 4-stranded, anti-parallel β-sheet is interrupted by an HTH core in the primary structure (Aravind et al. 2005). The ETS, interferon regulatory factors (IRF), and heat shock factor (HSF) families represent the extant eukaryotic descendants of the CRP-like motif (Liang et al. 1994; Landsman and Wolffe 1995). In bacteria, the DBD of CRP is integrated with its N-terminal cAMP-sensor domain in a one-component system. Two CRP subunits bind DNA as a homodimer that is connected via the sensor domains only. The stimulatory effects of cAMP on DNA binding in CRP, as well as the cooperativity in cAMP binding across the two CRP subunits, represent a classic model of protein allostery (Lawson et al. 2004). The tertiary structure of the DBDs in CRP is strongly constrained by the encroaching sensor domain (Fig. 7a). Detailed NMR characterization has firmly established the continuum nature of the conformational CRP trajectory from unbound to the cognate complex (Tzeng and Kalodimos 2012). In contrast, all three eukaryotic families exhibit with non-continuum properties with low-affinity DNA (Santoro et al. 1998; Iwata et al. 2017; Krishnamoorthy et al. 2017). Beyond their divergent primary structures, the eukaryotic CRP-motifs are differentiated from CRP by their elaboration of structural elements, particularly loops (Fig. 7b). These elaborations inflate the eukaryotic CRP-motifs by up to ~40% in sequence length, suggesting that inflated domain structure may represent an enabling feature for non-continuum dynamics.
Fig. 7. Evolutionary relationships in two HTH-derived motifs.
(a) Average NMR structures of the three DBD families that descended from CRP. The non-DNA-binding sensor domains in homodimeric CRP are in grey. (b) Comparison of the secondary structure elements between the eukaryotic factors and CRP. Positive values indicate elaboration in the eukaryotic factors. Green bars represent loops adjoining neighboring structured elements. Preferential elaboration of loops is evident. (c) Structural alignment of the POUS domain with HTH bundles from the 434 bacteriophage (all NMR average structures). A conserved intramolecular salt bridge confirms and emphasizes this conservation (PDB IDs in brackets)
6.2. Chapter 5.2. POU Transcription Factors
The bipartite architecture of POU-domain factors, which has been discussed in Chapter 1, is a clear eukaryotic innovation and the linker’s diversity in length and composition has been characterized (van Leeuwen et al. 1997). The two POU sub-domains are not independent binding units, however, as the two complexes show distinct and non-overlapping POUS/POUH interfaces (Fig. 2). From an evolutionary perspective, homeodomains including POUH represent a phylogenetically distinct innovation in eukaryotes, with no phylogenetic counterparts among extant prokaryotes (Aravind et al. 2005). In contrast, the POUS domains, which are tetrahelical bundles, are highly conserved with the DBD of the bacteriophage 434 repressor and 434 CRO proteins (Fig. 7c), including a characteristic intramolecular Arg-Glu sidechain-mediated salt bridge (Assa-Munt et al. 1993). Bacteriophages are very ancient viruses, substantially predating the separation of archaea and eukaryotes from prokaryotes (Brüssow and Hendrix 2002). Moreover, binding studies show that POUS, not POUH, is the primary determinant for POU-specific binding (Ingraham et al. 1990). The evidence therefore indicates that the POUS domain is the major behavior-modifying component and the relevant species for cross-kingdom evolutionary analysis.
7. Concluding Remarks
Eukaryotic transcription factors are remarkably versatile mediators of transcriptional regulation. Recent biophysical, genomic, and biological studies have considerably advanced our understanding of the molecular basis and functional relevance of this flexibility. Integrating the existing literature on combinatorial control and low-affinity transcriptional regulation, we propose in this review the concept “non-continuum responses” to unite these ideas and contrast them with prokaryotic transcriptional regulation. In addition, we emphasize the importance of conformational dynamics of the folded state and their attendant effects on molecular hydration in sensitizing transcription factors to a large range of co-regulatory protein and DNA partners. Conformational dynamics and hydration are classic concepts in molecular biophysics whose theoretical underpinnings have been extensively studied. A frontier in these areas is now the application of computational approaches to domain-scale systems and their complexes, in explicit solvent, to model experimental observables by ensemble techniques, such as NMR and volumetric measurements, as well as single-molecule tracking. We expect these developments to meaningfully inform genome-scale data in protein binding and gene expression for an increasing number of physiologically and clinically interesting targets.
Acknowledgements
G.M.K.P is supported by NSF grant MCB 2028902 and NIH grants GM137160 and HL155178.
Footnotes
Conflicts of Interest The author declares no competing interests.
References
- Albrecht AV, Kim HM, Poon GMK (2018) Mapping interfacial hydration in ETS-family transcription factor complexes with DNA: a chimeric approach. Nucleic Acids Res 46(20):10577–10588. 10.1093/nar/gky894 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Alvarez LD, Presman DM, Pecci A (2017) Molecular dynamics simulations of the glucocorticoid receptor DNA-binding domain suggest a role of the lever-arm mobility in transcriptional output. PLoS One 12(12): e0189588. 10.1371/journal.pone.0189588 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Aravind L, Anantharaman V, Balaji S, Babu MM, Iyer LM (2005) The many faces of the helix-turn-helix domain: transcription regulation and beyond. FEMS Microbiol Rev 29(2):231–262. 10.1016/j.femsre.2004.12.008 [DOI] [PubMed] [Google Scholar]
- Assa-Munt N, Mortishire-Smith RJ, Aurora R, Herr W, Wright PE (1993) The solution structure of the Oct-1 POU-specific domain reveals a striking similarity to the bacteriophage lambda repressor DNA-binding domain. Cell 73(1):193–205. 10.1016/0092-8674(93)90171-l [DOI] [PubMed] [Google Scholar]
- Baetu TM (2011) A defense of syntax-based gene concepts in Postgenomics: genes as modular subroutines in the master genomic program. Philos Sci 78(5):712–723. 10.1086/662261 [DOI] [Google Scholar]
- Ballare C, Castellano G, Gaveglia L, Althammer S, Gonzalez-Vallinas J, Eyras E, Le Dily F, Zaurin R, Soronellas D, Vicent GP, Beato M (2013) Nucleosome-driven transcription factor binding and gene regulation. Mol Cell 49(1):67–79. 10.1016/j.molcel.2012.10.019 [DOI] [PubMed] [Google Scholar]
- Barolo S (2016) How to tune an enhancer. Proc Natl Acad Sci U S A 113(23):6330–6331. 10.1073/pnas.1606109113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Breiten B, Lockett MR, Sherman W, Fujita S, Al-Sayah M, Lange H, Bowers CM, Heroux A, Krilov G, Whitesides GM (2013) Water networks contribute to enthalpy/entropy compensation in protein-ligand binding. J Am Chem Soc 135 (41):15579–15584. 10.1021/ja4075776 [DOI] [PubMed] [Google Scholar]
- Brüssow H, Hendrix RW (2002) Phage genomics. Cell 108(1):13–16. 10.1016/s0092-8674(01)00637-7 [DOI] [PubMed] [Google Scholar]
- Buchan JR (2014) mRNP granules. Assembly, function, and connections with disease. RNA Biol 11 (8):1019–1030. 10.4161/15476286.2014.972208 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burton ZF, Opron K, Wei G, Geiger JH (2016) A model for genesis of transcription systems. Transcription 7 (1):1–13. 10.1080/21541264.2015.1128518 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cary GA, Cheatle Jarvela AM, Francolini RD, Hinman VF (2017) Genome-wide use of high- and low-affinity Tbrain transcription factor binding sites during echinoderm development. Proc Natl Acad Sci U S A 114 (23):5854–5861. 10.1073/pnas.1610611114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chalikian TV (2003) Volumetric properties of proteins. Annu Rev Biophys Biomol Struct 32:207–235. 10.1146/annurev.biophys.32.110601.141709 [DOI] [PubMed] [Google Scholar]
- Chalikian TV, Macgregor RB (2007) Nucleic acid hydration: a volumetric perspective. Phys Life Rev 4 (2):91–115. 10.1016/j.plrev.2006.11.001 [DOI] [Google Scholar]
- Changeux JP, Edelstein S (2011) Conformational selection or induced fit? 50 years of debate resolved. F1000 Biol Rep 3:19. 10.3410/B3-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Charoensawan V, Wilson D, Teichmann SA (2010) Genomic repertoires of DNA-binding transcription factors across the tree of life. Nucleic Acids Res 38 (21):7364–7377. 10.1093/nar/gkq617 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cinar H, Fetahaj Z, Cinar S, Vernon RM, Chan HS, Winter RHA (2019) Temperature, hydrostatic pressure, and Osmolyte effects on liquid-liquid phase separation in protein condensates: physical chemistry and biological implications. Chemistry 25(57):13049–13069. 10.1002/chem.201902210 [DOI] [PubMed] [Google Scholar]
- Cogoi S, Paramasivam M, Membrino A, Yokoyama KK, Xodo LE (2010) The KRAS promoter responds to Myc-associated zinc finger and poly(ADP-ribose) polymerase 1 proteins, which recognize a critical quadruplex-forming GA-element. J Biol Chem 285 (29):22003–22016. 10.1074/jbc.M110.101923 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cooper A (1976) Thermodynamic fluctuations in protein molecules. Proc Natl Acad Sci U S A 73 (8):2740–2741. 10.1073/pnas.73.8.2740 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cooper A (1984) Protein fluctuations and the thermodynamic uncertainty principle. Prog Biophys Mol Biol 44 (3): 181–214. 10.1016/0079-6107(84)90008-7 [DOI] [PubMed] [Google Scholar]
- Crocker J, Abe N, Rinaldi L, McGregor AP, Frankel N, Wang S, Alsawadi A, Valenti P, Plaza S, Payre F, Mann RS, Stern DL (2015) Low affinity binding site clusters confer hox specificity and regulatory robustness. Cell 160(1–2):191–203. 10.1016/j.cell.2014.11.041 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Crocker J, Noon EP, Stern DL (2016) The soft touch: low-affinity transcription factor binding sites in development and evolution. Curr Top Dev Biol 117:455–469. 10.1016/bs.ctdb.2015.11.018 [DOI] [PubMed] [Google Scholar]
- de Mendoza A, Sebe-Pedros A, Sestak MS, Matejcic M, Torruella G, Domazet-Loso T, Ruiz-Trillo I (2013) Transcription factor evolution in eukaryotes and the assembly of the regulatory toolkit in multicellular lineages. Proc Natl Acad Sci USA 110(50):E4858–E4866. 10.1073/pnas.1311818110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Degnan BM, Degnan SM, Naganuma T, Morse DE (1993) The ets multigene family is conserved throughout the Metazoa. Nucleic Acids Res 21(15):3479–3484 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deniz AA, Mukhopadhyay S, Lemke EA (2008) Single-molecule biophysics: at the interface of biology, physics and chemistry. J R Soc Interface 5(18):15–45. 10.1098/rsif.2007.1021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Duboue-Dijon E, Fogarty AC, Hynes JT, Laage D (2016) Dynamical disorder in the DNA hydration Shell. J Am Chem Soc 138(24):7610–7620. 10.1021/jacs.6b02715 [DOI] [PubMed] [Google Scholar]
- Eaves JD, Loparo JJ, Fecko CJ, Roberts ST, Tokmakoff A, Geissler PL (2005) Hydrogen bonds in liquid water are broken only fleetingly. Proc Natl Acad Sci U S A 102 (37):13019–13022. 10.1073/pnas.0505125102 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Farley EK, Olson KM, Zhang W, Brandt AJ, Rokhsar DS, Levine MS (2015) Suboptimization of developmental enhancers. Science 350(6258):325–328. 10.1126/science.aac6948 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Farley EK, Olson KM, Zhang W, Rokhsar DS, Levine MS (2016) Syntax compensates for poor binding sites to encode tissue specificity of developmental enhancers. Proc Natl Acad Sci U S A 113(23):6508–6513. 10.1073/pnas.1605085113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fox JM, Zhao M, Fink MJ, Kang K, Whitesides GM (2018) The molecular origin of enthalpy/entropy compensation in biomolecular recognition. Annu Rev Biophys 47:223–250. 10.1146/annurev-biophys-070816-033743 [DOI] [PubMed] [Google Scholar]
- Gallo PN, Iovine JC, Nucci NV (2018) Toward comprehensive measurement of protein hydration dynamics: facilitation of NMR-based methods by reverse micelle encapsulation. Methods 148:146–153. 10.1016/j.ymeth.2018.07.008 [DOI] [PubMed] [Google Scholar]
- Gao J, Zybailov BL, Byrd AK, Griffin WC, Chib S, Mackintosh SG, Tackett AJ, Raney KD (2015) Yeast transcription co-activator Sub1 and its human homolog PC4 preferentially bind to G-quadruplex DNA. Chem Commun (Camb) 51(33):7242–7244. 10.1039/c5cc00742a [DOI] [PMC free article] [PubMed] [Google Scholar]
- Garvie CW, Pufall MA, Graves BJ, Wolberger C (2002) Structural analysis of the autoinhibition of Ets-1 and its role in protein partnerships. J Biol Chem 277 (47):45529–45536. 10.1074/jbc.M206327200 [DOI] [PubMed] [Google Scholar]
- Gaudet J, Mango SE (2002) Regulation of organogenesis by the Caenorhabditis elegans FoxA protein PHA-4. Science 295(5556):821–825. 10.1126/science.1065175 [DOI] [PubMed] [Google Scholar]
- Glasmacher E, Agrawal S, Chang AB, Murphy TL, Zeng W, Vander Lugt B, Khan AA, Ciofani M, Spooner CJ, Rutz S, Hackney J, Nurieva R, Escalante CR, Ouyang W, Littman DR, Murphy KM, Singh H (2012) A genomic regulatory element that directs assembly and function of immune-specific AP-1-IRF complexes. Science 338(6109):975–980. 10.1126/science.1228309 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goodsell DS (1991) Inside a living cell. Trends Biochem Sci 16(6):203–206 [DOI] [PubMed] [Google Scholar]
- Grossman SR, Engreitz J, Ray JP, Nguyen TH, Hacohen N, Lander ES (2018) Positional specificity of different transcription factor classes within enhancers. Proc Natl Acad Sci U S A 115(30): E7222–E7230. 10.1073/pnas.1804663115 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guo X, Bulyk ML, Hartemink AJ (2012) Intrinsic disorder within and flanking the DNA-binding domains of human transcription factors. Pac Symp Biocomput:104–115. 10.1142/9789814366496_0011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Handberg-Thorsager M, Gutierrez-Mazariegos J, Arold ST, Kumar Nadendla E, Bertucci PY, Germain P, Tomancak P, Pierzchalski K, Jones JW, Albalat R, Kane MA, Bourguet W, Laudet V, Arendt D, Schubert M (2018) The ancestral retinoic acid receptor was a low-affinity sensor triggering neuronal differentiation. Sci Adv 4(2):eaao1261. 10.1126/sciadv.aao1261 [DOI] [PMC free article] [PubMed] [Google Scholar]
- He HH, Meyer CA, Shin H, Bailey ST, Wei G, Wang Q, Zhang Y, Xu K, Ni M, Lupien M, Mieczkowski P, Lieb JD, Zhao K, Brown M, Liu XS (2010) Nucleosome dynamics define transcriptional enhancers. Nat Genet 42(4):343–347. 10.1038/ng.545 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Herr W, Cleary MA (1995) The POU domain: versatility in transcriptional regulation by a flexible two-in-one DNA-binding domain. Genes Dev 9(14):1679–1693. 10.1101/gad.9.14.1679 [DOI] [PubMed] [Google Scholar]
- Hosokawa H, Ungerback J, Wang X, Matsumoto M, Nakayama KI, Cohen SM, Tanaka T, Rothenberg EV (2018) Transcription factor PU.1 represses and activates gene expression in early T cells by redirecting partner transcription factor binding. Immunity 49 (4):782. 10.1016/j.immuni.2018.09.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang K, Xhani S, Albrecht AV, Ha VLT, Esaki S, Poon GMK (2019) Mechanism of cognate sequence discrimination by the ETS-family transcription factor ETS-1. J Biol Chem 294(25):9666–9678. 10.1074/jbc.RA119.007866 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hudson WH, Youn C, Ortlund EA (2013) The structural basis of direct glucocorticoid-mediated transrepression. Nat Struct Mol Biol 20(1):53–58. 10.1038/nsmb.2456 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ingraham HA, Flynn SE, Voss JW, Albert VR, Kapiloff MS, Wilson L, Rosenfeld MG (1990) The POU-specific domain of Pit-1 is essential for sequence-specific, high affinity DNA binding and DNA-dependent Pit-1-Pit-1 interactions. Cell 61 (6):1021–1033. 10.1016/0092-8674(90)90067-o [DOI] [PubMed] [Google Scholar]
- Ishimoto C, Tanaka T (1977) Critical behavior of a binary mixture of protein and salt water. Phys Rev Lett 39 (8):474–477. 10.1103/PhysRevLett.39.474 [DOI] [PubMed] [Google Scholar]
- Iwata A, Durai V, Tussiwand R, Briseño CG, Wu X, Grajales-Reyes GE, Egawa T, Murphy TL, Murphy KM (2017) Quality of TCR signaling determined by differential affinities of enhancers for the composite BATF–IRF4 transcription factor complex. Nat Immunol 18(5):563–572. 10.1038/ni.3714 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jacobson EM, Li P, Leon-del-Rio A, Rosenfeld MG, Aggarwal AK (1997) Structure of Pit-1 POU domain bound to DNA as a dimer: unexpected arrangement and flexibility. Genes Dev 11(2):198–212. 10.1101/gad.11.2.198 [DOI] [PubMed] [Google Scholar]
- Jiang J, Levine M (1993) Binding affinities and cooperative interactions with bHLH activators delimit threshold responses to the dorsal gradient morphogen. Cell 72(5):741–752 [DOI] [PubMed] [Google Scholar]
- Jolma A, Yin Y, Nitta KR, Dave K, Popov A, Taipale M, Enge M, Kivioja T, Morgunova E, Taipale J (2015) DNA-dependent formation of transcription factor pairs alters their binding specificity. Nature 527 (7578):384–388. 10.1038/nature15518 [DOI] [PubMed] [Google Scholar]
- Joyce AP, Zhang C, Bradley P, Havranek JJ (2015) Structure-based modeling of protein: DNA specificity. Brief Funct Genomics 14(1):39–49. 10.1093/bfgp/elu044 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kalodimos CG, Bonvin AM, Salinas RK, Wechselberger R, Boelens R, Kaptein R (2002) Plasticity in protein-DNA recognition: lac repressor interacts with its natural operator 01 through alternative conformations of its DNA-binding domain. EMBO J 21(12):2866–2876. 10.1093/emboj/cdf318 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kalodimos CG, Biris N, Bonvin AM, Levandoski MM, Guennuegues M, Boelens R, Kaptein R (2004a) Structure and flexibility adaptation in nonspecific and specific protein-DNA complexes. Science 305(5682):386–389. 10.1126/science.1097064 [DOI] [PubMed] [Google Scholar]
- Kalodimos CG, Boelens R, Kaptein R (2004b) Toward an integrated model of protein-DNA recognition as inferred from NMR studies on the Lac repressor system. Chem Rev 104(8):3567–3586. 10.1021/cr0304065 [DOI] [PubMed] [Google Scholar]
- Kleckner IR, Foster MP (2011) An introduction to NMR-based approaches for measuring protein dynamics. Biochim Biophys Acta 1814(8):942–968. 10.1016/j.bbapap.2010.10.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kribelbauer JF, Rastogi C, Bussemaker HJ, Mann RS (2019) Low-affinity binding sites and the transcription factor specificity paradox in eukaryotes. Annu Rev Cell Dev Biol 35:357–379. 10.1146/annurev-cellbio-100617-062719 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krishnamoorthy V, Kannanganat S, Maienschein-Cline M, Cook SL, Chen J, Bahroos N, Sievert E, Corse E, Chong A, Sciammas R (2017) The IRF4 gene regulatory module functions as a read-write integrator to dynamically coordinate T helper cell fate. Immunity 47 (3):481–497 e487. 10.1016/j.immuni.2017.09.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kyrmizi I, Hatzis P, Katrakili N, Tronche F, Gonzalez FJ, Talianidis I (2006) Plasticity and expanding complexity of the hepatic transcription factor network during liver development. Genes Dev 20(16):2293–2305. 10.1101/gad.390906 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Laage D, Hynes JT (2006) A molecular jump mechanism of water reorientation. Science 311(5762):832–835. 10.1126/science.1122154 [DOI] [PubMed] [Google Scholar]
- Laage D, Elsaesser T, Hynes JT (2017) Water dynamics in the hydration shells of biomolecules. Chem Rev 117 (16):10694–10725. 10.1021/acs.chemrev.6b00765 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lambert SA, Jolma A, Campitelli LF, Das PK, Yin Y, Albu M, Chen X, Taipale J, Hughes TR, Weirauch MT (2018) The human transcription factors. Cell 172 (4):650–665. 10.1016/j.cell.2018.01.029 [DOI] [PubMed] [Google Scholar]
- Landsman D, Wolffe AP (1995) Common sequence and structural features in the heat-shock factor and Ets families of DNA-binding domains. Trends Biochem Sci 20(6):225–226. 10.1016/S0968-0004(00)89021-0 [DOI] [PubMed] [Google Scholar]
- Lawson CL, Swigon D, Murakami KS, Darst SA, Berman HM, Ebright RH (2004) Catabolite activator protein: DNA binding and transcription activation. Curr Opin Struct Biol 14(1):10–20. 10.1016/j.sbi.2004.01.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lefstin JA, Yamamoto KR (1998) Allosteric effects of DNA on transcriptional regulators. Nature 392 (6679):885–888. 10.1038/31860 [DOI] [PubMed] [Google Scholar]
- Leung TH, Hoffmann A, Baltimore D (2004) One nucleotide in a kappaB site can determine cofactor specificity for NF-kappaB dimers. Cell 118(4):453–464. 10.1016/j.cell.2004.08.007 [DOI] [PubMed] [Google Scholar]
- Li XY, Thomas S, Sabo PJ, Eisen MB, Stamatoyannopoulos JA, Biggin MD (2011) The role of chromatin accessibility in directing the widespread, overlapping patterns of Drosophila transcription factor binding. Genome Biol 12(4):ARTN R34671186/gb-2011–12-4-r34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liang H, Olejniczak ET, Mao X, Nettesheim DG, Yu L, Thompson CB, Fesik SW (1994) The secondary structure of the ets domain of human Fli-1 resembles that of the helix-turn-helix DNA-binding motif of the Escherichia coli catabolite gene activator protein. Proc Natl Acad Sci U S A 91(24):11655–11659 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu J, Perumal NB, Oldfield CJ, Su EW, Uversky VN, Dunker AK (2006) Intrinsic disorder in transcription factors. Biochemistry 45(22):6873–6888. 10.1021/bi0602718 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu CF, Brandt GS, Hoang QQ, Naumova N, Lazarevic V, Hwang ES, Dekker J, Glimcher LH, Ringe D, Petsko GA (2016a) Crystal structure of the DNA binding domain of the transcription factor T-bet suggests simultaneous recognition of distant genome sites. Proc Natl Acad Sci U S A 113(43):E6572–E6581. 10.1073/pnas.1613914113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu X, Speckhard DC, Shepherd TR, Sun YJ, Hengel SR, Yu L, Fowler CA, Gakhar L, Fuentes EJ (2016b) Distinct roles for conformational dynamics in protein-ligand interactions. Structure 24(12):2053–2066. 10.1016/j.str.2016.08.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luisi BF, Xu WX, Otwinowski Z, Freedman LP, Yamamoto KR, Sigler PB (1991) Crystallographic analysis of the interaction of the glucocorticoid receptor with DNA. Nature 352(6335):497–505. 10.1038/352497a0 [DOI] [PubMed] [Google Scholar]
- Lynch TW, Sligar SG (2002) Experimental and theoretical high pressure strategies for investigating protein-nucleic acid assemblies. Biochim Biophys Acta 1595 (1–2):277–282. 10.1016/s0167-4838(01)00350-8 [DOI] [PubMed] [Google Scholar]
- Macgregor RB (2002) The interactions of nucleic acids at elevated hydrostatic pressure. Biochim Biophys Acta Protein Struct Mol Enzymol 1595(1–2):266–276. 10.1016/s0167-4838(01)00349-1 [DOI] [PubMed] [Google Scholar]
- Magnani L, Eeckhoute J, Lupien M (2011) Pioneer factors: directing transcriptional regulators within the chromatin environment. Trends Genet 27 (11):465–474. 10.1016/j.tig.2011.07.002 [DOI] [PubMed] [Google Scholar]
- Meijsing SH, Pufall MA, So AY, Bates DL, Chen L, Yamamoto KR (2009) DNA binding site sequence directs glucocorticoid receptor structure and activity. Science 324(5925):407–410. 10.1126/science.1164265 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Merino E, Jensen RA, Yanofsky C (2008) Evolution of bacterial trp operons and their regulation. Curr Opin Microbiol 11(2):78–86. 10.1016/j.mib.2008.02.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Minezaki Y, Homma K, Kinjo AR, Nishikawa K (2006) Human transcription factors contain a high fraction of intrinsically disordered regions essential for transcriptional regulation. J Mol Biol 359(4):1137–1149. 10.1016/j.jmb.2006.04.016 [DOI] [PubMed] [Google Scholar]
- Mohaghegh N, Bray D, Keenan J, Penvose A, Andrilenas KK, Ramlall V, Siggers T (2019) NextPBM: a platform to study cell-specific transcription factor binding and cooperativity. Nucleic Acids Res 47(6):e31. 10.1093/nar/gkz020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Monod J, Wyman J, Changeux JP (1965) On the nature of allosteric transitions: a plausible model. J Mol Biol 12:88–118. 10.1016/s0022-2836(65)80285-6 [DOI] [PubMed] [Google Scholar]
- Nakasako M (2004) Water-protein interactions from high-resolution protein crystallography. Philos Trans R Soc Lond Ser B Biol Sci 359(1448):1191–1204; discussion 1204–1196. 10.1098/rstb.2004.1498 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nussinov R, Ma B, Tsai CJ (2014) Multiple conformational selection and induced fit events take place in allosteric propagation. Biophys Chem 186:22–30. 10.1016/j.bpc.2013.10.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Orkin SH, Zon LI (2008) Hematopoiesis: an evolving paradigm for stem cell biology. Cell 132(4):631–644. 10.1016/j.cell.2008.01.025 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pan Y, Tsai CJ, Ma B, Nussinov R (2009) How do transcription factors select specific binding sites in the genome? Nat Struct Mol Biol 16(11):1118–1120. 10.1038/nsmb1109-1118 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pan Y, Tsai CJ, Ma B, Nussinov R (2010) Mechanisms of transcription factor selectivity. Trends Genet 26 (2):75–83. 10.1016/j.tig.2009.12.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pan A, Biswas T, Rakshit AK, Moulik SP (2015) Enthalpy-Entropy Compensation (EEC) effect: a revisit. J Phys Chem B 119(52):15876–15884. 10.1021/acs.jpcb.5b09925 [DOI] [PubMed] [Google Scholar]
- Pan AC, Jacobson D, Yatsenko K, Sritharan D, Weinreich TM, Shaw DE (2019) Atomic-level characterization of protein-protein association. Proc Natl Acad Sci U S A 116(10):4244–4249. 10.1073/pnas.1815431116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Panne D (2008) The enhanceosome. Curr Opin Struct Biol 18(2):236–242. 10.1016/j.sbi.2007.12.002 [DOI] [PubMed] [Google Scholar]
- Panne D, Maniatis T, Harrison SC (2007) An atomic model of the interferon-beta enhanceosome. Cell 129 (6):1111–1123. 10.1016/j.cell.2007.05.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Parsegian VA, Rau DC (1984) Water near intracellular surfaces. J Cell Biol 99(1 Pt 2):196s–200s. 10.1083/jcb.99.1.196s [DOI] [PMC free article] [PubMed] [Google Scholar]
- Parsegian VA, Rand RP, Rau DC (1995) Macromolecules and water: probing with osmotic stress. Methods Enzymol 259:43–94 [DOI] [PubMed] [Google Scholar]
- Parsonnet NV, Lammer NC, Holmes ZE, Batey RT, Wuttke DS (2019) The glucocorticoid receptor DNA-binding domain recognizes RNA hairpin structures with high affinity. Nucleic Acids Res 47 (15):8180–8192. 10.1093/nar/gkz486 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Persson F, Soderhjelm P, Halle B (2018) The spatial range of protein hydration. J Chem Phys 148(21):215104. 10.1063/1.5031005 [DOI] [PubMed] [Google Scholar]
- Popovych N, Sun S, Ebright RH, Kalodimos CG (2006) Dynamically driven protein allostery. Nat Struct Mol Biol 13(9):831–838. 10.1038/nsmb1132 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Preger-Ben Noon E, Sabaris G, Ortiz DM, Sager J, Liebowitz A, Stern DL, Frankel N (2018) Comprehensive analysis of a cis-regulatory region reveals pleiotropy in enhancer function. Cell Rep 22(11):3021–3031. 10.1016/j.celrep.2018.02.073 [DOI] [PubMed] [Google Scholar]
- Purslow JA, Khatiwada B, Bayro MJ, Venditti V (2020) NMR methods for structural characterization of protein-protein complexes. Front Mol Biosci 7:9. 10.3389/fmolb.2020.00009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ramos AI, Barolo S (2013) Low-affinity transcription factor binding sites shape morphogen responses and enhancer evolution. Philos Trans R Soc Lond Ser B Biol Sci 368(1632):20130018. 10.1098/rstb.2013.0018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rand RP, Parsegian VA, Rau DC (2000) Intracellular osmotic action. Cell Mol Life Sci 57(7):1018–1032. 10.1007/PL00000742 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rastogi C, Rube HT, Kribelbauer JF, Crocker J, Loker RE, Martini GD, Laptenko O, Freed-Pastor WA, Prives C, Stern DL, Mann RS, Bussemaker HJ (2018) Accurate and sensitive quantification of protein-DNA binding affinity. Proc Natl Acad Sci U S A 115(16):E3692–E3701. 10.1073/pnas.1714376115 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Remenyi A, Tomilin A, Pohl E, Lins K, Philippsen A, Reinbold R, Scholer HR, Wilmanns M (2001) Differential dimer activities of the transcription factor Oct-1 by DNA-induced interface swapping. Mol Cell 8 (3):569–580. 10.1016/s1097-2765(01)00336-7 [DOI] [PubMed] [Google Scholar]
- Remenyi A, Scholer HR, Wilmanns M (2004) Combinatorial control of gene expression. Nat Struct Mol Biol 11(9):812–815. 10.1038/nsmb820 [DOI] [PubMed] [Google Scholar]
- Rhine K, Vidaurre V, Myong S (2020) RNA droplets. Annu Rev Biophys 49:247–265. 10.1146/annurev-biophys-052118-115508 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rohs R, Jin X, West SM, Joshi R, Honig B, Mann RS (2010) Origins of specificity in protein-DNA recognition. Annu Rev Biochem 79:233–269. 10.1146/annurev-biochem-060408-091030 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Royer CA, Winter R (2009) Volumetric properties of proteins and the role of solvent in conformational dynamics. In: Kuwajima K, Goto Y, Hirata F, Kataoka M, Terazima M (eds) Water and biomolecules. Biological and medical physics, biomedical engineering. Springer, Berlin/Heidelberg, pp 173–186. 10.1007/978-3-540-88787-4_9 [DOI] [Google Scholar]
- Rubin MM, Changeux JP (1966) On the nature of allosteric transitions: implications of non-exclusive ligand binding. J Mol Biol 21(2):265–274. 10.1016/0022-2836(66)90097-0 [DOI] [PubMed] [Google Scholar]
- Santoro N, Johansson N, Thiele DJ (1998) Heat shock element architecture is an important determinant in the temperature and transactivation domain requirements for heat shock transcription factor. Mol Cell Biol 18(11):6340–6352. 10.1128/mcb.18.11.6340 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sauer RT, Yocum RR, Doolittle RF, Lewis M, Pabo CO (1982) Homology among DNA-binding proteins suggests use of a conserved super-secondary structure. Nature 298(5873):447–451. 10.1038/298447a0 [DOI] [PubMed] [Google Scholar]
- Schmitz JF, Zimmer F, Bornberg-Bauer E (2016) Mechanisms of transcription factor evolution in Metazoa. Nucleic Acids Res 44(13):6287–6297. 10.1093/nar/gkw492 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scully KM, Jacobson EM, Jepsen K, Lunyak V, Viadiu H, Carriere C, Rose DW, Hooshmand F, Aggarwal AK, Rosenfeld MG (2000) Allosteric effects of Pit-1 DNA sites on long-term repression in cell type specification. Science 290(5494):1127–1131. 10.1126/science.290.5494.1127 [DOI] [PubMed] [Google Scholar]
- Sebe-Pedros A, Ariza-Cosano A, Weirauch MT, Leininger S, Yang A, Torruella G, Adamski M, Adamska M, Hughes TR, Gomez-Skarmeta JL, Ruiz-Trillo I (2013) Early evolution of the T-box transcription factor family. Proc Natl Acad Sci U S A 110 (40):16050–16055. 10.1073/pnas.1309748110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sikorska N, Sexton T (2020) Defining functionally relevant spatial chromatin domains: it is a TAD complicated. J Mol Biol 432(3):653–664. 10.1016/j.jmb.2019.12.006 [DOI] [PubMed] [Google Scholar]
- Silva JL, Foguel D, Royer CA (2001) Pressure provides new insights into protein folding, dynamics and structure. Trends Biochem Sci 26(10):612–618. 10.1016/s0968-0004(01)01949-1 [DOI] [PubMed] [Google Scholar]
- Singh GP, Dash D (2007) Intrinsic disorder in yeast transcriptional regulatory network. Proteins 68 (3):602–605. 10.1002/prot.21497 [DOI] [PubMed] [Google Scholar]
- Slattery M, Zhou T, Yang L, Dantas Machado AC, Gordan R, Rohs R (2014) Absence of a simple code: how transcription factors read the genome. Trends Biochem Sci 39(9):381–399. 10.1016/j.tibs.2014.07.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Soufi A, Garcia MF, Jaroszewicz A, Osman N, Pellegrini M, Zaret KS (2015) Pioneer transcription factors target partial DNA motifs on nucleosomes to initiate reprogramming. Cell 161(3):555–568. 10.1016/j.cell.2015.03.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Spolar RS, Record MT Jr (1994) Coupling of local folding to site-specific binding of proteins to DNA. Science 263(5148):777–784 [DOI] [PubMed] [Google Scholar]
- Sturm RA, Herr W (1988) The POU domain is a bipartite DNA-binding structure. Nature 336(6199):601–604. 10.1038/336601a0 [DOI] [PubMed] [Google Scholar]
- Szabo Q, Bantignies F, Cavalli G (2019) Principles of genome folding into topologically associating domains. Sci Adv 5(4):eaaw1668. 10.1126/sciadv.aaw1668 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thomas-Chollier M, Watson LC, Cooper SB, Pufall MA, Liu JS, Borzym K, Vingron M, Yamamoto KR, Meijsing SH (2013) A naturally occurring insertion of a single amino acid rewires transcriptional regulation by glucocorticoid receptor isoforms. Proc Natl Acad Sci U S A 110(44):17826–17831. 10.1073/pnas.1316235110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tzeng SR, Kalodimos CG (2012) Protein activity regulation by conformational entropy. Nature 488 (7410):236–240. 10.1038/nature11271 [DOI] [PubMed] [Google Scholar]
- Uversky VN (2017) Intrinsically disordered proteins in overcrowded milieu: membrane-less organelles, phase separation, and intrinsic disorder. Curr Opin Struct Biol 44:18–30. 10.1016/j.sbi.2016.10.015 [DOI] [PubMed] [Google Scholar]
- van der Lee R, Buljan M, Lang B, Weatheritt RJ, Daughdrill GW, Dunker AK, Fuxreiter M, Gough J, Gsponer J, Jones DT, Kim PM, Kriwacki RW, Oldfield CJ, Pappu RV, Tompa P, Uversky VN, Wright PE, Babu MM (2014) Classification of intrinsically disordered regions and proteins. Chem Rev 114 (13):6589–6631. 10.1021/cr400525m [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Hijum SA, Medema MH, Kuipers OP (2009) Mechanisms and evolution of control logic in prokaryotic transcriptional regulation. Microbiol Mol Biol Rev 73(3):481–509, Table of Contents. 10.1128/MMBR.00037-08 [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Leeuwen HC, Strating MJ, Rensen M, de Laat W, van der Vliet PC (1997) Linker length and composition influence the flexibility of Oct-1 DNA binding. EMBO J 16(8):2043–2053. 10.1093/emboj/16.8.2043 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vandevyver S, Dejager L, Libert C (2014) Comprehensive overview of the structure and regulation of the glucocorticoid receptor. Endocr Rev 35(4):671–693. 10.1210/er.2014-1010 [DOI] [PubMed] [Google Scholar]
- Vaquerizas JM, Kummerfeld SK, Teichmann SA, Luscombe NM (2009) A census of human transcription factors: function, expression and evolution. Nat Rev Genet 10(4):252–263. 10.1038/nrg2538 [DOI] [PubMed] [Google Scholar]
- Villa E, Balaeff A, Schulten K (2005) Structural dynamics of the lac repressor-DNA complex revealed by a multiscale simulation. Proc Natl Acad Sci U S A 102 (19):6783–6788. 10.1073/pnas.0409387102 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vogt AD, Di Cera E (2012) Conformational selection or induced fit? A critical appraisal of the kinetic mechanism. Biochemistry 51(30):5894–5902. 10.1021/bi3006913 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vogt AD, Di Cera E (2013) Conformational selection is a dominant mechanism of ligand binding. Biochemistry 52(34):5723–5729. 10.1021/bi400929b [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vogt AD, Pozzi N, Chen Z, Di Cera E (2014) Essential role of conformational selection in ligand binding. Biophys Chem 186:13–21. 10.1016/j.bpc.2013.09.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Z, Zhang Q (2009) Genome-wide identification and evolutionary analysis of the animal specific ETS transcription factor family. Evol Bioinformatics Online 5:119–131 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang S, Linde MH, Munde M, Carvalho VD, Wilson WD, Poon GM (2014) Mechanistic heterogeneity in site recognition by the structurally homologous DNA-binding domains of the ETS family transcription factors Ets-1 and PU.1. J Biol Chem 289 (31):21605–21616. 10.1074/jbc.M114.575340 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang J, Malecka A, Trøen G, Delabie J (2015) Comprehensive genome-wide transcription factor analysis reveals that a combination of high affinity and low affinity DNA binding is needed for human gene regulation. BMC Genomics 16(7):S12. 10.1186/1471-2164-16-S7-S12 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wei GH, Badis G, Berger MF, Kivioja T, Palin K, Enge M, Bonke M, Jolma A, Varjosalo M, Gehrke AR, Yan J, Talukder S, Turunen M, Taipale M, Stunnenberg HG, Ukkonen E, Hughes TR, Bulyk ML, Taipale J (2010) Genome-wide analysis of ETS-family DNA-binding in vitro and in vivo. EMBO J 29(13):2147–2160. 10.1038/emboj.2010.106 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weikum ER, Knuesel MT, Ortlund EA, Yamamoto KR (2017) Glucocorticoid receptor control of transcription: precision and plasticity via allostery. Nat Rev Mol Cell Biol 18(3):159–174. 10.1038/nrm.2016.152 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weingarten-Gabbay S, Segal E (2014) The grammar of transcriptional regulation. Hum Genet 133 (6):701–711. 10.1007/s00439-013-1413-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wheeler RJ, Hyman AA (2018) Controlling compartmentalization by non-membrane-bound organelles. Philos Trans R Soc Lond Ser B Biol Sci 373(1747). 10.1098/rstb.2017.0193 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Willenbrock H, Ussery DW (2004) Chromatin architecture and gene expression in Escherichia coli. Genome Biol 5(12):252. 10.1186/gb-2004-5-12-252 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Witzany G (2017) Two genetic codes: repetitive syntax for active non-coding RNAs; non-repetitive syntax for the DNA archives. Commun Integr Biol 10(2):e1297352. 10.1080/19420889.2017.1297352 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wlodawer A, Minor W, Dauter Z, Jaskolski M (2008) Protein crystallography for non-crystallographers, or how to get the best (but not more) from published macromolecular structures. FEBS J 275(1):1–21. 10.1111/j.1742-4658.2007.06178.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xhani S, Esaki S, Huang K, Erlitzki N, Poon GM (2017) Distinct roles for interfacial hydration in site-specific DNA recognition by ETS-family transcription factors. J Phys Chem B 121(13):2748–2758. 10.1021/acs.jpcb.7b00325 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xhani S, Lee S, Kim HM, Wang S, Esaki S, Ha VLT, Khanezarrin M, Fernandez GL, Albrecht AV, Aramini JM, Germann MW, Poon GMK (2020) Intrinsic disorder controls two functionally distinct dimers of the master transcription factor PU.1. Sci Adv 6(8): eaay3178. 10.1126/sciadv.aay3178 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhou Z, Giles KE, Felsenfeld G (2019) DNA.RNA triple helix formation can function as a cis-acting regulatory mechanism at the human beta-globin locus. Proc Natl Acad Sci U S A 116(13):6130–6139. 10.1073/pnas.1900107116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhu L, Brangwynne CP (2015) Nuclear bodies: the emerging biophysics of nucleoplasmic phases. Curr Opin Cell Biol 34:23–30. 10.1016/j.ceb.2015.04.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zuo Z, Stormo GD (2014) High-resolution specificity from DNA sequencing highlights alternative modes of Lac repressor binding. Genetics 198 (3):1329–1343. 10.1534/genetics.114.170100 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zuo Z, Chang Y, Stormo GD (2015) A quantitative understanding of lac repressor’s binding specificity and flexibility. Quant Biol 3(2):69–80. 10.1007/s40484-015-0044-z [DOI] [PMC free article] [PubMed] [Google Scholar]







