Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2011 May 20;39(16):6854–6863. doi: 10.1093/nar/gkr272

Control of gene expression by modulated self-assembly

Jose M G Vilar 1,2, Leonor Saiz 3,*
PMCID: PMC3167614  PMID: 21602261

Abstract

Numerous transcription factors self-assemble into different order oligomeric species in a way that is actively regulated by the cell. Until now, no general functional role has been identified for this widespread process. Here, we capture the effects of modulated self-assembly in gene expression with a novel quantitative framework. We show that this mechanism provides precision and flexibility, two seemingly antagonistic properties, to the sensing of diverse cellular signals by systems that share common elements present in transcription factors like p53, NF-κB, STATs, Oct and RXR. Applied to the nuclear hormone receptor RXR, this framework accurately reproduces a broad range of classical, previously unexplained, sets of gene expression data and corroborates the existence of a precise functional regime with flexible properties that can be controlled both at a genome-wide scale and at the individual promoter level.

INTRODUCTION

A recurrent theme in gene regulation is the self-assembly of transcription factors (TF) into coexisting populations of dimers, tetramers and other higher order oligomers that can bind simultaneously single and multiple DNA sites. This behavior has been observed explicitly in the tumor suppressor p53 (1), the nuclear factor κB (NF-κB) (2,3), the signal transducers and activators of transcription (STATs) (4), the octamer-binding proteins (Oct) (5,6), and the retinoid nuclear hormone receptor (7) (Table 1). In these systems, the properties of self-assembly, and the partitioning into low and high order oligomeric species, are strongly regulated and modulated by several types of signals, such as ligand binding (8), protein binding (9,10), acetylation (11) and phosphorylation (6,12). The general implications of this modulation, however, are not clear.

Table 1.

MSA of TFs

TFa Self-assembly modulation Oligomerization statesb DNA binding
RXR Ligand-binding (8) Monomer, dimer*, tetramer* (7) 1 site, 4 consecutive half-sites (15), 2 separated sites (16)
p53 Protein-binding (9,10), acetylation (11) Monomer, dimer, tetramer*, stacked-tetramers* (1) 1 site, 2 separated half-sites, 2 separated sites (17,18)
NF-κB Protein-mediated Dimer*, tetramer* (2,3) 2 separated sites (19)
STAT Phosphorylation (12) Dimer*, tetramer* (4) Tandem sites (20,21)
Oct Phosphorylation (6) Monomer*, dimer* (5), tetramer* (6) 1 site, 2 separated sites (6)

aFor each TF, the table shows the experimentally observed mechanism of the self-assembly modulation process, the oligomerization states involved, and the corresponding arrangement of DNA binding sites at the promoter.

bThe symbol * indicates the oligomeric species that have been observed to substantially bind DNA.

At the level of single DNA sites, it is well established that the effects of TF are finely determined by their concentration and cognate DNA sequences (13). Processes based on interactions with different molecules and post-transcriptional modifications are assumed to affect mainly the DNA binding properties of the TFs or their ability to recruit coregulators. This idea is entrenched in the field of gene regulation and is systematically used as a guiding principle in the ongoing development of molecular therapies against diverse diseases (14). But TFs rarely act through just a single binding site (6,15–21) (Table 1). Modulated self-assembly (MSA) provides a key mechanism for controlling the ability of TFs to bind two or more DNA sites simultaneously.

To determine the common wide-ranging effects of MSA, we have developed a general quantitative framework that accurately links MSA with control of gene expression (Figure 1). It focuses on the general aspects of the core control mechanism shared by the wide variety of regulatory systems where MSA is present, which include TF self-assembly and its modulation, binding of the TF oligomers to DNA, and the resulting transcriptional responses. This quantitative framework allowed us to uncover modulation of the oligomeric states of TFs as a flexible mechanism for precise sensing of molecular signals in the presence of intracellular fluctuations. Precision ensures that the transcriptional response is consistently triggered at a given modulator signal strength irrespective of the TF concentration. Flexibility allows the precise triggering point to be changed, up to several orders of magnitude, both at the individual promoter level by changing its DNA sequence and at a genome-wide scale by changing the molecular self-assembly properties.

Figure 1.

Figure 1.

Quantitative modeling of control of gene expression by modulated self-assembly. Intracellular signals are processed through ‘Modulated self-assembly’ into populations of different oligomeric species that upon ‘DNA binding’ engage in ‘Transcriptional control’. Modulated self-assembly: the intensity of a self-assembly modulator signal Inline graphic, e.g. ligand or active kinase concentration, regulates the formation of high order oligomers by modifying (represented as a yellow spark) the low order oligomers and preventing their self-assembly into the high order species. DNA binding: the oligomeric species bound to DNA (in orange/red) are described by their free energies with the statistical weights (Inline graphic) shown for each binding state (expression in black). The parenthesized number, in blue, labels each of the 17 states and the molecular representations illustrate the binding combinations of the transcriptional regulator to the two DNA sites (site 1 and site 2). The top left box summarizes the notation. Transcriptional control: one state (state 2) can trigger response R1, in which an enhancer is positioned in the vicinity of the promoter region, and twelve states (states 6–17) can potentially trigger response R2, in which a coactivator is recruited to the promoter region. Dimers and tetramers have been drawn as compositions of the nuclear hormone receptor RXR structures from the PDB files 1BY4 (DNA binding domains bound to the two half-sites on DNA, or RXR response elements) and 1G1U (ligand binding domains).

This methodology identified a core set of features needed to implement control of transcription by MSA that are present in a wide variety of structurally different systems (Table 1). As an exemplar of these systems, we have considered explicitly the nuclear hormone receptor RXR. In this case, the quantitative framework accurately reproduced, in some instances even without free parameters, a broad range of classical, previously unexplained gene expression experimental data and demonstrated how flexible precise control of gene expression can be achieved directly at the molecular level through modulation of the oligomerization state of transcriptional regulators.

MATERIALS AND METHODS

The first step in the signaling cascade orchestrated by MSA is the regulation of the relative abundance of the oligomerization states of the TF (Figure 1). The self-assembly modulator, such as a ligand that binds to a TF or a kinase that phosphorylates the TF, affects the low-order oligomers to promote or prevent their self-assembly. We consider explicitly tetramers, Inline graphic, dimers, Inline graphic and non-tetramerizing dimers, Inline graphic, as relevant high and low order oligomeric species. Other oligomerization pairs, such as octamer–tetramers or dimer–monomers, are mathematically equivalent to tetramer–dimers.

We quantitate the effects of self-assembly modulation through the modulator function Inline graphic, which describes, in terms of concentrations, the partitioning into the tetramerizing and non-tetramerizing dimers by the self-assembly modulator, Inline graphic. This process affects dimer and tetramer concentrations, which are related to each other through Inline graphic, where Inline graphic is the tetramer–dimer dissociation constant.

The precise form of the modulator function is given by the specific mode of action of the modulator. An explicit example is Inline graphic for a ligand Inline graphic that upon binding to the dimer Inline graphic, with dissociation constant Inline graphic, renders it unable to tetramerize in the form Inline graphic. Another relevant, mechanistically different situation corresponds to Inline graphic for phosphorylation in the linear regime of the non-tetramerizing, Inline graphic, into the tetramerizing, Inline graphic, dimer species. In this case, Inline graphic is the concentration of active kinases and Inline graphic and Inline graphic are the phosphorylation and dephosphorylation rate constants, respectively. In general, several mechanisms can be involved at the same time in controlling the oligomerization properties. For instance, the case in which the two previous processes are combined so that the dimer has to be both free of ligand and phosphorylated to be able to tetramerize leads to a two-variable modulator function given by Inline graphicInline graphic, where Inline graphic is the ligand concentration and Inline graphic is the concentration of active kinases.

Binding of the different TF oligomers to the DNA sites mediates the transcriptional effects of the self-assembly modulator (Figure 1). Typically, tetramers and both types of dimers bind single DNA sites in a very similar way, with free energies Inline graphic and Inline graphic, for site 1 and 2, respectively. These quantities are related to the corresponding dissociation constants through Inline graphic and Inline graphic. Tetramers, in addition, can bind two sites simultaneously because they have two DNA-binding domains, one from each of its two constituent dimers, which contribute with Inline graphic and Inline graphic to the free energy. The simultaneous binding of two domains is typically accompanied by conformational changes, e.g. twisting and bending, in both the tetramer and DNA (22,23), which contributes with an additional conformational term, Inline graphic, to the free energy. Therefore, the standard free energy of the state with the tetramer bound to two sites is given by Inline graphic. This conformational contribution has been studied in detail in the case of DNA looping by prokaryotic TFs and is dependent, among others, on the TF and DNA flexibility, the relative position of the DNA-binding sites, and the DNA supercoiling state (22,23).

We use statistical thermodynamics to quantitatively describe binding to DNA in terms of free energies and concentrations of the different oligomeric species (24–26). The key quantity is the statistical weight, or Boltzmann factor, defined as Inline graphic, which relates the relative probability of the binding state Inline graphic with its standard free energy Inline graphic. The exponents Inline graphic, Inline graphic and Inline graphic correspond to the number of tetramers, dimers and non-tetramerizing dimers in the state Inline graphic, respectively. The factor Inline graphic is the gas constant, Inline graphic, times the absolute temperature, Inline graphic. The probability of a given group of binding states Inline graphic, Inline graphic, is obtained by adding the statistical weights of its states and normalizing by the sum for all the possible states.

For a system with two binding sites, there are 17 binding states (Figure 1). These states are those with both sites empty; one occupied by a dimer or a tetramer; two sites occupied by two dimers, by two tetramers or a dimer and tetramer; and two sites occupied simultaneously by a single tetramer. In the case of states with dimers, one has to take into account that a dimer can either be in the form that allows or prevents tetramerization. In general, each binding state includes a constellation of molecular substates with different DNA conformations. For instance, the state with both sites empty can include a bent DNA conformation, as in the case when the two sites are occupied simultaneously by a single tetramer, but the lack of a tetramer to stabilize the conformation makes this conformation highly unlikely. This type of effects has been described in detail for other TFs that bind two DNA sites simultaneously, such as the lac repressor (27).

There is also the possibility that oligomerization is so weak in solution that it is only observed on DNA. This effect can be put in quantitative terms with our framework by considering that the state with the tetramer bound simultaneously to two DNA sites (Figure 1) can also be described as two interacting dimers that bind cooperatively to DNA. The statistical weight of this state is given by Inline graphic in terms of tetramer concentration and by Inline graphic in terms of dimer concentration, which can be rewritten as Inline graphic with Inline graphicInline graphic. Thus, a very high dissociation constant that does not lead to significant tetramerization in solution is sufficient to promote tetramerization on DNA when the conformational free energy is sufficiently low. Intuitively, tetramerization is observed on DNA because binding to DNA brings the tetramerization domains close to each other and increases their local concentration.

Two differentiated types of transcriptional responses can be constructed from the binding states of the TF on DNA (Figure 1).

The first type, referred to as response R1, involves a high-order oligomer that simultaneously binds two non-adjacent DNA sites. Upon binding, the high-order oligomer loops out the intervening DNA and positions a distal enhancer in the vicinity of the promoter region to control transcription. The probability Inline graphic of the state with the tetramer bound to the two DNA sites simultaneously (Table 2) determines the effective transcription rate through the expression Inline graphic, which weights the transcription rates that the system has with, Inline graphic, and without, Inline graphic, the distal enhancer close to the promoter.

Table 2.

Probability, Pc, of the different groups of binding states

Inline graphic Statesa Full expressionb Simplified expressionc
Inline graphic 2 Inline graphic Inline graphic
Inline graphic 6, 7, 15, 16 Inline graphic Inline graphic
Inline graphic 9, 10, 12, 13 Inline graphic Inline graphic
Inline graphic 8, 11, 14, 17 Inline graphic Inline graphic

aStates involved in the group as described in Figure 1.

bThe expressions for the probabilities follow from the statistical thermodynamic approach with the free energies of each state as described in Figure 1.

cSimplified expressions for the probabilities in the functional regime.

The second type, denoted here response R2, takes advantage of the differentiated recruitment abilities of different oligomerization states. This mode of regulation applies to a coactivator that is recruited by a low-order oligomer by binding to a molecular surface that is occluded in the high-order oligomer. In this case, the effective transcription rate is given by Inline graphic. The subscripts do, od and dd of the transcription rates Inline graphic and probabilities Inline graphic refer to the group of states with dimers bound to just site 1, to just site 2 and to both sites, respectively (Table 2). Inline graphic is the transcription rate with no dimers bound, including empty sites and sites occupied by tetramers.

Responses R1 and R2 embrace the prototypical cases mediated by long and short range interactions between regulatory elements. They are controlled by the relative occupancy of DNA binding sites by the different oligomeric species. This mode of functioning differs from other systems with multiple binding sites, like the lac operon, which are controlled by the absolute occupancy of their sites by a single oligomeric species (28). For instance, IPTG, an inducer of the lac operon, does not affect the oligomerization state of the tetrameric lac repressor but prevents each of its two DNA binding domains from significantly binding their cognate sites (28,29).

RESULTS

To uncover the unique characteristics that emerge from the core structure of MSA in such a general wide variety of structurally different systems (Table 1), we focus on a functional regime that guarantees that there is response to changes in the self-assembly modulator concentration. This regime considers two properties. The first one is that the TF concentration is sufficiently high for it to significantly bind DNA. In mathematical terms, it implies Inline graphic and Inline graphic. The second one is that the tetramer concentration is sufficiently low, Inline graphic, so that they do not take over the binding completely. The reason is that for typical values of Inline graphic, tetramers bind more strongly to two DNA sites simultaneously than dimers do to a single DNA site (27,30,31).

The key implication of this regime is that the probabilities of the different groups of binding states simplify in such a way (see Table 2) that the transcriptional responses are governed by the reduced expressions

graphic file with name gkr272m1.jpg (1)

with

graphic file with name gkr272m2.jpg (2)

which show that responses R1 and R2, despite being mechanistically different, follow the same control logics. In both cases, the two-site binding of the tetramer, quantified by Inline graphic, determines the contributions of the reference and activated transcriptional states. The end result is even more remarkable because the particular form of Inline graphic imparts precision and flexibility to the transcriptional responses, two properties that are the cornerstone of natural gene expression systems but that have proved to be highly elusive because of their seemingly antagonistic character (13).

Precision ensures that the transcriptional response is consistently triggered at a given modulator signal strength irrespective of the particular TF concentration, which cancels out in the reduced equations that govern the system behavior. Flexibility, on the other hand, allows the precise triggering point to be altered, up to several orders of magnitude, both at the individual promoter level by changing its organization—Inline graphic depends on the distance between the two DNA binding sites (17,22)—and at a genome-wide scale by changing the molecular self-assembly properties—Inline graphic and Inline graphic affect the regulation of all genes in the same way.

All these results can be observed explicitly in the retinoid X receptor (RXR), an exemplar of the essential regulators that share the central features of MSA (Table 1). RXR controls a large number of genes by binding to DNA as homodimer, homotetramer or obligatory heterodimerization partner for other nuclear receptors. Nuclear retinoid receptors are highly significant because they mediate the pleiotropic effects of retinoic acid, which include cell proliferation, differentiation and embryonic development and affect the carcinogenic process in a number of organs (32).

The canonical self-assembly modulator of RXR is the hormone 9-cis-retinoic acid (9cRA), a derivative of Vitamin A, which binds each RXR subunit independently of its oligomerization state (33) and prevents dimers with their two subunits occupied from tetramerizing (8). This behavior is consistent with Inline graphic being an apo-dimer and with Inline graphic being a holo-dimer, as observed in the respective crystal structures of the dimers with no ligand bound (34) and with two ligands bound (35). The crystal structure of one tetramer with two ligands bound (36) shows that two dimers with just one ligand each can form tetramers with a structure similar to those of two apo-dimers. In addition to 9cRA, there are other ligands of RXR, as for instance, the oleic acid, docosahexaenoic acid, methoprene acid and phytanic acid (35).

These early steps in sensing 9cRA and other ligand concentrations are taken into account by the explicit form of the modulator function, which we obtain from the mass action law as

graphic file with name gkr272m3.jpg (3)

where Inline graphic and Inline graphic are the ligand–RXR dissociation constant and the ligand concentration, respectively (see Supplementary Data).

To compare with the experimental data, we normalize the fold induction, a measure of relative changes in transcriptional activity, so that its variation ranges from 0 to 1. This quantity, referred to as normalized fold induction (NFI), is defined explicitly as Inline graphic, where Inline graphic is the maximum value of the fold induction Inline graphic. In terms of the NFI, the results do not depend on parameters related to the baseline and maximum expression levels and it becomes possible to effectively compare experiments on different promoters and cell lines (see Supplementary Data). The only parameters needed to characterize the shape of the response in the functional regime are Inline graphic and Inline graphic, which have been measured experimentally, and Inline graphic, which can be inferred by adjusting its value to reproduce the experimental data.

This approach accurately describes the experimental observations (16) for the ligand 9cRA and a promoter with two non-adjacent DNA binding sites for RXR and a distal enhancer (Figure 2A). Simultaneous binding of an RXR tetramer to the two sites loops out the intervening DNA and brings the enhancer close to the promoter region (response R1). Increasing the concentration of 9cRA prevents the formation of RXR tetramers and leads to deactivation of transcription.

Figure 2.

Figure 2.

Prediction of RXR-mediated transcriptional responses to 9cRA and atRA ligands. The results of the model (lines) for the functional regime are compared to the normalized fold induction (NFI) from experimental data (symbols) for different promoters and ligands. The model uses the experimental values Inline graphic for 9cRA (52) or Inline graphic for atRA (53), and Inline graphic (7). The conformational free energy Inline graphic (shown in kcal/mol) is inferred from just the experimental data for 9cRA by minimizing the mean squared error between model and experiments and the resulting value is used subsequently for responses to atRA. (A) Response to 9cRA for a system with two separated DNA binding sites for RXR and a distal enhancer. Experimental gene expression data is taken from Figure 5b of Yasmin et al. (16), which used COS-7 cells transfected with the reporter, consisting of double RXRE and a UAS site 300-bp upstream, in a vector encoding GAL4-VP16. The NFI was computed as Inline graphic (see Supplementary Data) with Equations (2) and (3). In this case, the half-maximum response concentration, or EC50, is about 35 times higher than the 9cRA–RXR dissociation constant. (B) Responses to 9cRA for systems with contiguous DNA binding sites for RXR. The variability of the dose–response curves, including 10-fold changes in the EC50 and different slopes, is accurately captured by the model by just adjusting Inline graphic. The three different curves correspond to three different experimental systems, reported in Figure 5a of Heyman et al. (40) (top), which used S2 cells cotransfected with the expression plasmid A5C-hRXRα and the reporter plasmid ADH-CRBPII-LUC; Figure 4b of Levin et al. (41) (center), which used CV-1 cells cotransfected with the reporter CRBPII-RXRE-CAT construct and plasmid RXRα; and Figure 5b of Heyman et al. (40) (bottom), which used CV-1 cells cotransfected with the expression plasmid pRSh-RXRα and the reporter plasmid TK-CRBPII-LUC. The NFI was computed as Inline graphic (see Supplementary Data) with Equations (2) and (3). (C) Responses to atRA for systems with contiguous DNA binding sites for RXR. The three different dose–response curves correspond to the three systems of Figure 2B with the all-trans retinoic acid (atRA) as ligand of RXR instead of 9cRA. The highly variable dose–response curves are fully predicted without free parameters using the values of Inline graphic inferred in Figure 2B.

The very same approach also captures in detail the observed behavior when the two DNA binding sites are next to each other, as in the classic set of experiments that uncovered 9cRA as the cognate ligand of RXR, for different promoters and cell lines (Figure 2B). In these cases, only the dimeric forms of RXR with ligand bound can recruit a coactivator (response R2) and increasing the concentration of 9cRA results in the activation of transcription. The extent of activation is modulated by the RXR AF-1 domain and RXR phosphorylation (37–39).

This framework has the much-sought ability to fully predict, without free parameters, the responses to different ligands from the values of Inline graphic obtained in response to just a single ligand. Applied to the all-trans retinoic acid (atRA), which was tested early on as a potential cognate ligand of RXR (40,41), the approach closely recapitulates its effects on transcription for different cell types and promoters from the values of Inline graphic inferred in the responses to 9cRA (Figure 2C). This ability to fully predict responses without free parameters is especially important because it provides a direct avenue to transfer specific molecular information of the ligand–TF interaction, as described by the measured or computed parameters, across scales up to the transcriptional effects.

The high variability of the transcriptional responses, as observed in Figure 2, has been a long-standing recurrent issue in RXR gene regulation. In particular, the half-maximum response point, characterized by the EC50, ranges from just above the RXR–ligand dissociation constant up to values 30-fold higher (Table 3). Our results have identified MSA as a potential mechanism to control the EC50 at the single-gene level through the value of Inline graphic (Table 3). This promoter-dependent flexibility indicates that for these systems, the observed variability is not a random aspect of the experimental setup but the result of RXR precisely tailoring the response to each individual gene.

Table 3.

EC50 control by ligand binding strength and conformational free energy

EC50 (nM) Ligand Inline graphic (nM) Inline graphic (kcal/mol)
287.4 9cRA 8 8.03
77.8 9cRA 8 9.47
18.3 9cRA 8 10.76
14.3 9cRA 8 10.92
3403.2 atRA 350 9.47
798.7 atRA 350 10.76
626.0 atRA 350 10.92

The EC50 is defined as the ligand concentration that gives the half-maximum response.

The observed variability can be collapsed in the form of response landscapes (Figure 3), which represent the transcriptional activity as a function of the conformational free energy in addition to just the usual ligand concentration of dose–response curves. The landscapes explicitly show the ability of RXR to shape the molecular response to ligand binding in a promoter-dependent way. The response landscapes show how the EC50 increases as the conformational free energy decreases in a way that closely matches the experimental observations (Figure 3).

Figure 3.

Figure 3.

Local and global flexibility in the response landscapes. The normalized fold induction (NFI) for RXR from the model, computed as in Figure 2, is shown as a function of both the self-assembly modulator intensity (either 9cRA or atRA concentration) and the conformational free energy Inline graphic (in kcal/mol). The figures on the bottom are density-plot projections of the corresponding NFI on the top, with dark and light gray corresponding to low and high values of the NFI, respectively. The red line corresponds to NFI = 0.5 and shows the dependence of the half-maximum response concentration, or EC50, with the conformational free energy. The EC50 can be changed locally, at the single-promoter level, by changing the value of Inline graphic, or globally, at a genome-wide scale by changing Inline graphic, the strength of the ligand binding to RXR. The 3D plots show the same experimental data (symbols) as in (A) Figure 2A, (B) Figure 2B, and (C) Figure 2C along with the dose–response curves (black lines) for the corresponding values of the conformational free energy. The values of the parameters used are Inline graphic (7) and either Inline graphic for 9cRA (52) or Inline graphic for atRA (53).

To investigate the extent to which typical experimental conditions fall within the functional regime (which, as previously described, is characterized by Inline graphic, Inline graphic and Inline graphic), we considered the model for RXR in the whole-parameter space. All groups of binding states were considered explicitly without simplifications of the expressions for the corresponding probabilities (Table 2). In addition to the relevant quantities of the functional regime, the whole-parameter space includes the experimentally measured free energies of binding to DNA, the RXR dimer–monomer dissociation constant and the nuclear RXR concentration. The results (Figure 4) are virtually independent of the precise value of the total nuclear RXR protein concentration over, at least, a 10-fold range and accurately capture the diverse dose–response curves observed in the experiments, in agreement with the results for the functional regime. In all cases, the ranges of concentrations include 550 nM, the estimated RXR nuclear concentration in HL-60 cells (7). Therefore, the ability to elicit flexible and precise responses, as uncovered in the general analysis, is also present when the particularities of RXR-mediated transcriptional responses are taken into account.

Figure 4.

Figure 4.

Precision and flexibility of the transcriptional responses. The model for the whole-parameter space (see Supplementary Data) reproduces the different dose–response curves regardless of 10-fold changes in total RXR concentration. The normalized fold induction from experimental data (symbols) and the model (lines) is shown for the same systems as in (A) Figure 2A, (B) Figure 2B, and (C) Figure 2C. The results agree with the experimental data over 10-fold changes of the total RXR concentration, given by Inline graphic, which confirms the presence of a precise functional regime for typical molecular parameters. The experimental values used are Inline graphic for 9cRA (52) or Inline graphic for atRA (53); Inline graphic and Inline graphic (7). The free energies of binding are obtained through their corresponding dissociation constants as Inline graphic and Inline graphic with Inline graphic (54). Inline graphic, Inline graphic and Inline graphic are the dimer–monomer, RXR-site 1 and RXR-site 2 dissociation constants, respectively. The values of the conformational free energies were inferred in Figure 2. The 10-fold range of total RXR concentration was chosen for each specific system so that it best reproduces the experimental data in Figures 4A and 4B. The explicit value of Inline graphic is shown on the top of each graph (in nM units). The results of Figure 4C are predictions without free parameters.

DISCUSSION

Cellular processes rely on intricate molecular mechanisms to function in extraordinarily diverse intra- and extra-cellular environments. Eukaryotic gene expression, in particular, has shown to be exceedingly complex (42–44). Just the core of the transcriptional machinery itself involves a wide variety of components with oscillatory patterns of macromolecular assembly and phosphorylation (45). On top of the constitutive processes, there are many other molecular interactions that provide regulation, enhancing or reducing gene expression and adjusting to changing cellular conditions (46). To understand how these different levels of molecular complexity contribute to the observed behavior, one needs the right approaches (47,48).

The quantitative framework we have developed provides an efficient avenue to connect the molecular properties of MSA with its effects in the control of gene expression. This framework allowed us to uncover unique properties of control of gene expression by MSA that lead to a flexible mechanism for precise sensing of diverse types of self-assembly modulation signals, irrespective of changes in TF concentration. Application of this methodology to the nuclear hormone receptor RXR accurately describes the experimentally observed transcriptional responses for both enhancers (response R1) and coactivators (response R2) from just the molecular properties of the components (Figures 2A and 2B), and successfully predicts the observed behavior without free parameters (Figure 2C). A detailed analysis of the whole-parameter space reveals that regulation by RXR is functioning in a precise regime, with minimal dependence on RXR nuclear concentration (Figure 4), in which the responses are highly diverse as a result of the inherent flexibility that accompanies precision in the control of gene expression by MSA (Figure 3).

The observed TF-concentration insensitivity of control of gene expression by MSA contrasts with the traditional role of RXR as obligatory heterodimerization partner for other nuclear receptors, which relies on the absolute occupancy of the cognate binding sites by the heterodimer. In the case of RXRα:PPARγ regulation of adipogenesis, however, it has been observed that several promoters are controlled rather by the relative occupancy between RXRα:PPARγ heterodimers and other RXRα heterodimers or homo-oligomers (49). Our framework provides a starting point to consider these more complex situations by coupling MSA with hetero-oligomerization and to combine these extensions with recent bioinformatics methods (50,51) to make accurate predictions on gene expression based on the binding profiles observed in the experimental data (49).

The combined presence of flexibility and precision in the control of gene expression by MSA, as explicitly shown for RXR, allows a single TF to simultaneously regulate multiple genes with promoter-tailored dose–response curves that consistently maintain their diverse shapes for a broad range of the TF concentration changes. These features are especially important because essential TFs like p53, NF-κB, STATs, Oct and RXR, each of which have all the core elements that form the backbone of control of gene expression by MSA, regulate multiple genes that engage in processes as diverse as cancer, inflammation, autoimmune diseases and cellular differentiation. These results indicate that the prospects for devising more effective molecular therapies for systems controlled by MSA will greatly benefit from shifting potential intervention points from those that affect absolute concentrations and single-site binding to those that can tackle concentration ratios and promoter properties.

SUPPLEMENTARY DATA

Supplementary Data is available from NAR Online.

FUNDING

MICINN (grant FIS2009-10352, to J.M.G.V.); the University of California, Davis (to L.S.). Funding for open access charge: University of California, Davis.

Conflict of interest statement. None declared.

Supplementary Material

Supplementary Data

REFERENCES

  • 1.Wang P, Reed M, Wang Y, Mayr G, Stenger JE, Anderson ME, Schwedes JF, Tegtmeyer P. p53 domains: structure, oligomerization, and transformation. Mol. Cell. Biol. 1994;14:5182–5191. doi: 10.1128/mcb.14.8.5182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Phelps CB, Sengchanthalangsy LL, Malek S, Ghosh G. Mechanism of kappa B DNA binding by Rel/NF-kappa B dimers. J. Biol. Chem. 2000;275:24392–24399. doi: 10.1074/jbc.M003784200. [DOI] [PubMed] [Google Scholar]
  • 3.Sengchanthalangsy LL, Datta S, Huang DB, Anderson E, Braswell EH, Ghosh G. Characterization of the dimer interface of transcription factor NFkappaB p50 homodimer. J. Mol. Biol. 1999;289:1029–1040. doi: 10.1006/jmbi.1999.2823. [DOI] [PubMed] [Google Scholar]
  • 4.Zhang X, Darnell JE., Jr Functional importance of Stat3 tetramerization in activation of the alpha 2-macroglobulin gene. J. Biol. Chem. 2001;276:33576–33581. doi: 10.1074/jbc.M104978200. [DOI] [PubMed] [Google Scholar]
  • 5.Tomilin A, Remenyi A, Lins K, Bak H, Leidel S, Vriend G, Wilmanns M, Scholer HR. Synergism with the coactivator OBF-1 (OCA-B, BOB-1) is mediated by a specific POU dimer configuration. Cell. 2000;103:853–864. doi: 10.1016/s0092-8674(00)00189-6. [DOI] [PubMed] [Google Scholar]
  • 6.Kang J, Gemberling M, Nakamura M, Whitby FG, Handa H, Fairbrother WG, Tantin D. A general mechanism for transcription regulation by Oct1 and Oct4 in response to genotoxic and oxidative stress. Genes. Dev. 2009;23:208–222. doi: 10.1101/gad.1750709. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Kersten S, Kelleher D, Chambon P, Gronemeyer H, Noy N. Retinoid X receptor alpha forms tetramers in solution. Proc. Natl Acad. Sci. USA. 1995;92:8645–8649. doi: 10.1073/pnas.92.19.8645. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Chen ZP, Iyer J, Bourguet W, Held P, Mioskowski C, Lebeau L, Noy N, Chambon P, Gronemeyer H. Ligand- and DNA-induced dissociation of RXR tetramers. J. Mol. Biol. 1998;275:55–65. doi: 10.1006/jmbi.1997.1413. [DOI] [PubMed] [Google Scholar]
  • 9.van Dieck J, Fernandez-Fernandez MR, Veprintsev DB, Fersht AR. Modulation of the oligomerization state of p53 by differential binding of proteins of the S100 family to p53 monomers and tetramers. J. Biol. Chem. 2009;284:13804–13811. doi: 10.1074/jbc.M901351200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Hanson S, Kim E, Deppert W. Redox factor 1 (Ref-1) enhances specific DNA binding of p53 by promoting p53 tetramerization. Oncogene. 2005;24:1641–1647. doi: 10.1038/sj.onc.1208351. [DOI] [PubMed] [Google Scholar]
  • 11.Li AG, Piluso LG, Cai X, Wei G, Sellers WR, Liu X. Mechanistic insights into maintenance of high p53 acetylation by PTEN. Mol. Cell. 2006;23:575–587. doi: 10.1016/j.molcel.2006.06.028. [DOI] [PubMed] [Google Scholar]
  • 12.Wenta N, Strauss H, Meyer S, Vinkemeier U. Tyrosine phosphorylation regulates the partitioning of STAT1 between different dimer conformations. Proc. Natl Acad. Sci. USA. 2008;105:9238–9243. doi: 10.1073/pnas.0802130105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Ptashne M, Gann A. Transcriptional activation by recruitment. Nature. 1997;386:569–577. doi: 10.1038/386569a0. [DOI] [PubMed] [Google Scholar]
  • 14.de Lera AR, Bourguet W, Altucci L, Gronemeyer H. Design of selective nuclear receptor modulators: RAR and RXR as a case study. Nat. Rev. Drug Discov. 2007;6:811–820. doi: 10.1038/nrd2398. [DOI] [PubMed] [Google Scholar]
  • 15.Mangelsdorf DJ, Umesono K, Kliewer SA, Borgmeyer U, Ong ES, Evans RM. A direct repeat in the cellular retinol-binding protein type II gene confers differential regulation by RXR and RAR. Cell. 1991;66:555–561. doi: 10.1016/0092-8674(81)90018-0. [DOI] [PubMed] [Google Scholar]
  • 16.Yasmin R, Yeung KT, Chung RH, Gaczynska ME, Osmulski PA, Noy N. DNA-looping by RXR tetramers permits transcriptional regulation “at a distance”. J. Mol. Biol. 2004;343:327–338. doi: 10.1016/j.jmb.2004.08.070. [DOI] [PubMed] [Google Scholar]
  • 17.Jackson P, Mastrangelo I, Reed M, Tegtmeyer P, Yardley G, Barrett J. Synergistic transcriptional activation of the MCK promoter by p53: tetramers link separated DNA response elements by DNA looping. Oncogene. 1998;16:283–292. doi: 10.1038/sj.onc.1201491. [DOI] [PubMed] [Google Scholar]
  • 18.Stenger JE, Tegtmeyer P, Mayr GA, Reed M, Wang Y, Wang P, Hough PV, Mastrangelo IA. p53 oligomerization and DNA looping are linked with transcriptional activation. EMBO J. 1994;13:6011–6020. doi: 10.1002/j.1460-2075.1994.tb06947.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Leung TH, Hoffmann A, Baltimore D. One nucleotide in a kappaB site can determine cofactor specificity for NF-kappaB dimers. Cell. 2004;118:453–464. doi: 10.1016/j.cell.2004.08.007. [DOI] [PubMed] [Google Scholar]
  • 20.Vinkemeier U, Cohen SL, Moarefi I, Chait BT, Kuriyan J, Darnell JE., Jr DNA binding of in vitro activated Stat1 alpha, Stat1 beta and truncated Stat1: interaction between NH2–terminal domains stabilizes binding of two dimers to tandem DNA sites. EMBO J. 1996;15:5616–5626. [PMC free article] [PubMed] [Google Scholar]
  • 21.Xu X, Sun YL, Hoey T. Cooperative DNA binding and sequence-selective recognition conferred by the STAT amino-terminal domain. Science. 1996;273:794–797. doi: 10.1126/science.273.5276.794. [DOI] [PubMed] [Google Scholar]
  • 22.Saiz L, Rubi JM, Vilar JMG. Inferring the in vivo looping properties of DNA. Proc. Natl Acad. Sci. USA. 2005;102:17642–17645. doi: 10.1073/pnas.0505693102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Saiz L, Vilar JMG. DNA looping: the consequences and its control. Curr. Opin. Struct. Biol. 2006;16:344–350. doi: 10.1016/j.sbi.2006.05.008. [DOI] [PubMed] [Google Scholar]
  • 24.Hill TL. An Introduction to Statistical Thermodynamics. Reading, MA: Addison-Wesley; 1960. [Google Scholar]
  • 25.Ackers GK, Johnson AD, Shea MA. Quantitative model for gene regulation by lambda phage repressor. Proc. Natl Acad. Sci. USA. 1982;79:1129–1133. doi: 10.1073/pnas.79.4.1129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Segal E, Widom J. From DNA sequence to transcriptional behaviour: a quantitative approach. Nat. Rev. Genet. 2009;10:443–456. doi: 10.1038/nrg2591. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Vilar JMG, Saiz L. DNA looping in gene regulation: from the assembly of macromolecular complexes to the control of transcriptional noise. Curr. Opin. Genet. Dev. 2005;15:136–144. doi: 10.1016/j.gde.2005.02.005. [DOI] [PubMed] [Google Scholar]
  • 28.Saiz L, Vilar JM. Ab initio thermodynamic modeling of distal multisite transcription regulation. Nucleic Acids Res. 2008;36:726–731. doi: 10.1093/nar/gkm1034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Kuhlman T, Zhang Z, Saier MH, Jr, Hwa T. Combinatorial transcriptional control of the lactose operon of Escherichia coli. Proc. Natl Acad. Sci. USA. 2007;104:6043–6048. doi: 10.1073/pnas.0606717104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Vilar JMG, Leibler S. DNA looping and physical constraints on transcription regulation. J. Mol. Biol. 2003;331:981–989. doi: 10.1016/s0022-2836(03)00764-2. [DOI] [PubMed] [Google Scholar]
  • 31.Saiz L, Vilar JMG. Stochastic dynamics of macromolecular-assembly networks. Mol. Syst. Biol. 2006;2:0024. doi: 10.1038/msb4100061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Altucci L, Gronemeyer H. The promise of retinoids to fight against cancer. Nat. Rev. Cancer. 2001;1:181–193. doi: 10.1038/35106036. [DOI] [PubMed] [Google Scholar]
  • 33.Kersten S, Dawson MI, Lewis BA, Noy N. Individual subunits of heterodimers comprised of retinoic acid and retinoid X receptors interact with their ligands independently. Biochemistry. 1996;35:3816–3824. doi: 10.1021/bi952737k. [DOI] [PubMed] [Google Scholar]
  • 34.Bourguet W, Ruff M, Chambon P, Gronemeyer H, Moras D. Crystal structure of the ligand-binding domain of the human nuclear receptor RXR-alpha. Nature. 1995;375:377–382. doi: 10.1038/375377a0. [DOI] [PubMed] [Google Scholar]
  • 35.Egea PF, Mitschler A, Moras D. Molecular recognition of agonist ligands by RXRs. Mol. Endocrinol. 2002;16:987–997. doi: 10.1210/mend.16.5.0823. [DOI] [PubMed] [Google Scholar]
  • 36.Gampe RT, Jr, Montana VG, Lambert MH, Wisely GB, Milburn MV, Xu HE. Structural basis for autorepression of retinoid X receptor by tetramer formation and the AF-2 helix. Genes. Dev. 2000;14:2229–2241. doi: 10.1101/gad.802300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Mascrez B, Mark M, Krezel W, Dupe V, LeMeur M, Ghyselinck NB, Chambon P. Differential contributions of AF-1 and AF-2 activities to the developmental functions of RXR alpha. Development. 2001;128:2049–2062. doi: 10.1242/dev.128.11.2049. [DOI] [PubMed] [Google Scholar]
  • 38.Nagpal S, Friant S, Nakshatri H, Chambon P. RARs and RXRs: evidence for two autonomous transactivation functions (AF-1 and AF-2) and heterodimerization in vivo. EMBO J. 1993;12:2349–2360. doi: 10.1002/j.1460-2075.1993.tb05889.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Chambon P. The nuclear receptor superfamily: a personal retrospect on the first two decades. Mol. Endocrinol. 2005;19:1418–1428. doi: 10.1210/me.2005-0125. [DOI] [PubMed] [Google Scholar]
  • 40.Heyman RA, Mangelsdorf DJ, Dyck JA, Stein RB, Eichele G, Evans RM, Thaller C. 9-cis retinoic acid is a high affinity ligand for the retinoid X receptor. Cell. 1992;68:397–406. doi: 10.1016/0092-8674(92)90479-v. [DOI] [PubMed] [Google Scholar]
  • 41.Levin AA, Sturzenbecker LJ, Kazmer S, Bosakowski T, Huselton C, Allenby G, Speck J, Kratzeisen C, Rosenberger M, Lovey A, et al. 9-cis retinoic acid stereoisomer binds and activates the nuclear receptor RXR alpha. Nature. 1992;355:359–361. doi: 10.1038/355359a0. [DOI] [PubMed] [Google Scholar]
  • 42.Levine M, Tjian R. Transcription regulation and animal diversity. Nature. 2003;424:147–151. doi: 10.1038/nature01763. [DOI] [PubMed] [Google Scholar]
  • 43.Simicevic J, Deplancke B. DNA-centered approaches to characterize regulatory protein-DNA interaction complexes. Mol. Biosyst. 2010;6:462–468. doi: 10.1039/b916137f. [DOI] [PubMed] [Google Scholar]
  • 44.Ptashne M, Gann A. Genes & signals. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press; 2002. [Google Scholar]
  • 45.Metivier R, Penot G, Hubner MR, Reid G, Brand H, Kos M, Gannon F. Estrogen receptor-alpha directs ordered, cyclical, and combinatorial recruitment of cofactors on a natural target promoter. Cell. 2003;115:751–763. doi: 10.1016/s0092-8674(03)00934-6. [DOI] [PubMed] [Google Scholar]
  • 46.Agalioti T, Lomvardas S, Parekh B, Yie J, Maniatis T, Thanos D. Ordered recruitment of chromatin modifying and general transcription factors to the IFN-beta promoter. Cell. 2000;103:667–678. doi: 10.1016/s0092-8674(00)00169-0. [DOI] [PubMed] [Google Scholar]
  • 47.Vilar JMG, Guet CC, Leibler S. Modeling network dynamics: the lac operon, a case study. J. Cell Biol. 2003;161:471–476. doi: 10.1083/jcb.200301125. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Lee E, Bussemaker HJ. Identifying the genetic determinants of transcription factor activity. Mol. Syst. Biol. 2010;6:412. doi: 10.1038/msb.2010.64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Nielsen R, Pedersen TA, Hagenbeek D, Moulos P, Siersbaek R, Megens E, Denissov S, Borgesen M, Francoijs KJ, Mandrup S, et al. Genome-wide profiling of PPARgamma:RXR and RNA polymerase II occupancy reveals temporal activation of distinct metabolic pathways and changes in RXR dimer composition during adipogenesis. Genes. Dev. 2008;22:2953–2967. doi: 10.1101/gad.501108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Vilar JMG. Accurate prediction of gene expression by integration of DNA sequence statistics with detailed modeling of transcription regulation. Biophys. J. 2010;99:2408–2413. doi: 10.1016/j.bpj.2010.08.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Vilar JM, Saiz L. CplexA: a Mathematica package to study macromolecular-assembly control of gene expression. Bioinformatics. 2010;26:2060–2061. doi: 10.1093/bioinformatics/btq328. [DOI] [PubMed] [Google Scholar]
  • 52.Bissonnette RP, Brunner T, Lazarchik SB, Yoo NJ, Boehm MF, Green DR, Heyman RA. 9-cis retinoic acid inhibition of activation-induced apoptosis is mediated via regulation of fas ligand and requires retinoic acid receptor and retinoid X receptor activation. Mol. Cell Biol. 1995;15:5576–5585. doi: 10.1128/mcb.15.10.5576. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Vuligonda V, Lin Y, Chandraratna RAS. Synthesis of highly potent RXR-specific retinoids: The use of a cyclopropyl group as a double bond isostere. Bioorg. Medicinal Chem. Lett. 1996;6:213–218. [Google Scholar]
  • 54.Widom RL, Rhee M, Karathanasis SK. Repression by ARP-1 sensitizes apolipoprotein AI gene responsiveness to RXR alpha and retinoic acid. Mol. Cell Biol. 1992;12:3380–3389. doi: 10.1128/mcb.12.8.3380. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES