Abstract
Most genes change expression levels across conditions, but it is unclear which of these changes represents specific regulation and what determines their quantitative degree. Here, we accurately measured activities of ∼900 S. cerevisiae and ∼1800 E. coli promoters using fluorescent reporters. We show that in both organisms 60–90% of promoters change their expression between conditions by a constant global scaling factor that depends only on the conditions and not on the promoter’s identity. Quantifying such global effects allows precise characterization of specific regulation—promoters deviating from the global scale line. These are organized into few functionally related groups that also adhere to scale lines and preserve their relative activities across conditions. Thus, only several scaling factors suffice to accurately describe genome-wide expression profiles across conditions. We present a parameter-free passive resource allocation model that quantitatively accounts for the global scaling factors. It suggests that many changes in expression across conditions result from global effects and not specific regulation, and provides means for quantitative interpretation of expression profiles.
Keywords: gene expression, growth rate, modeling, promoter activity, transcription regulation
Introduction
Quantitative characterization of gene expression is a fundamental yet complex challenge. One of the major challenges stems from the dynamic nature of gene expression, whereby every gene can change its expression value across conditions. Although genome-wide analyses of expression are among the most commonly used methods in modern biology (Edgar, 2002), most studies produce lists of upregulated and downregulated genes, with limited focus on the numerical change in values. Here, we identify quantitative relationships between expression profiles under different conditions and observe a unifying behavior that simplifies our quantitative understanding of gene regulation.
Traditionally, gene expression research has focused on isolated genes and has generally shown that the transcriptional response is highly and specifically regulated. For example, upon exposure to lactose, bacteria respond by transcribing lactose-assimilating genes (Jacob and Monod, 1961). More recently, microarray and sequencing technologies have challenged this paradigm by enabling a genome-wide view of expression, and establishing that the responses to different conditions involve changes in expression of thousands of genes (Pedersen et al, 1978; DeRisi, 1997; Spellman et al, 1998; Gasch et al, 2000, 2001; O’Rourke and Herskowitz, 2002; Boer et al, 2003; Saldanha et al, 2004; Tu et al, 2005; Lai et al, 2005; Shalem et al, 2008; Chechik et al, 2008; Brauer et al, 2008; Yassour et al, 2009; Costenoble et al, 2011; Tirosh et al, 2011). Such massive expression changes between conditions raise several fundamental questions. Primarily, it is unclear why the expression of so many genes changes even between conditions whose phenotypic differences appear to be minor. As one example, it is unclear why growing yeast on either glucose or its epimer galactose leads to detectable expression changes in over half of the yeast genome (Gasch et al, 2000; Chechik et al, 2008) even though only a few enzymatic reactions separate the two substrates.
Initial attempts to bridge the gap between specific regulation and the wide spread changes observed in the data suggested that specific responses actually encompass more genes than initially appreciated (Spellman et al, 1998; Gasch et al, 2000, 2001; Tu et al, 2005). More recently, it was shown that many changes in expression are correlated with growth rate (Pedersen et al, 1978; Regenberg et al, 2006; Castrillo et al, 2007; Brauer et al, 2008; Fazio and Jewett, 2008; Zaslaver et al, 2009; Klumpp et al, 2009; Levy and Barkai, 2009), as proposed decades ago by the Copenhagen school (Maaloe, 1969; Ingraham and Ole Maaløe, 1983; Neidhardt, 1999), suggesting that they may result from global factors affecting many genes. Although several new works have attempted to incorporate global factors into gene expression models by analyzing synthetically constructed constitutive promoters (Klumpp et al, 2009; Scott et al, 2010; Gerosa et al, 2013), to date there is still no methodology to tease apart and decouple global and specific regulation. Therefore, it remains unknown what fraction of the gene expression changes observed upon a change in the growth condition can be explained by changes in global cellular parameters and which genes are specifically regulated. The ability to differentiate the two is critical for understanding the gene expression regulation. Moreover, except for isolated works (Van de Peppel et al, 2003; Bakel and Holstege, 2008; Islam et al, 2011), the importance of global changes in expression remains under-appreciated and is typically overlooked when analyzing high-throughput expression data. Such practice may lead to major misinterpretation of expression changes, as recently shown in Lovén et al (2012).
A second question invoked by the large magnitude of expression changes across conditions regards the combinatorics and complexity of gene expression programs. Considering all genes in all possible conditions, there is practically an infinite range of possible expression patterns that cells can reach. For example, yeast has ∼6000 potential transcriptional degrees of freedom as each of its ∼6000 promoters can potentially attain a different rate of transcription under each growth condition. It is intriguing to consider the degree to which this complexity is realized in cells. Indeed, genome-wide surveys of expression suggest that not all expression patterns are possible, as functionally related genes tend to be co-regulated (Pedersen et al, 1978; DeRisi, 1997; Gasch et al, 2000; Gasch et al, 2001; O’Rourke and Herskowitz, 2002; Boer et al, 2003; Saldanha et al, 2004; Lai et al, 2005; Chechik et al, 2008; Shalem et al, 2008; Brauer et al, 2008; Yassour et al, 2009; Costenoble et al, 2011; Tirosh et al, 2011). However, since most of these studies typically focus on the directionality of co-regulation (up or down) and not on the numerical change in values, to date the quantitative aspects of expression changes across conditions and their compliance with both classic (Maaloe, 1969; Ingraham and Ole Maaløe, 1983) and recent (Klumpp et al, 2009; Zaslaver et al, 2009; Scott et al, 2010) models of expression remain largely unaddressed. We aim to extend the existing qualitative description of co-regulated gene modules (Ihmels et al, 2002; Gasch and Eisen, 2002; Segal et al, 2003) and identify quantitative relationships between expression profiles under different conditions, thus reducing the space of possible expression patterns.
Here, we accurately measured the activities of ∼900 S. cerevisiae and ∼1800 E. coli promoters in 10 and 9 environmental conditions, respectively, using libraries of fluorescent reporters (Zaslaver et al, 2006, 2009; Zeevi et al, 2011). Notably, we found that most promoters (60–90%, depending on the pair of conditions compared) change their expression between conditions by a constant scaling factor that depends only on the conditions and not on the promoters’ identity. Thus, although there is a major change in values for nearly all promoters, the relative activity levels are preserved. Accounting for global effects allows precise quantification of more limited specific regulation—promoters deviating from global scaling. These can be organized into a handful of functionally related groups, such that within each group, promoters also preserve their relative activity levels across conditions in which they are activated. Hence, we can accurately describe 97% of the variability of the apparently complex promoter activity profiles across conditions using only several scaling factors. Finally, we present a parameter-free model that encompasses growth rate and specific gene expression and accounts for ∼90% of the observed variability in the global scaling factors. Our results provide a mean to decouple global and specific changes in activity between conditions, and a first quantitative characterization of the global response. They suggest that most changes in expression across conditions result from global effects and propose that proportional scaling is a major determinant of genome-wide expression profiles.
Results
Obtaining accurate measurements of promoter activity across different growth conditions
To obtain accurate measurements of promoter activity in yeast, we employed an experimental system based on the genomic fusion of promoters to fluorescent reporters (Zaslaver et al, 2006; Zeevi et al, 2011). We selected 867 native yeast promoters of genes that represent a wide variety of cellular functions, processes, and compartments (Supplementary Table S1). Although these genes cover only ∼1/6 of the S. cerevisiae genome, they cover the various Gene Ontology (GO) categories (Ashburner et al, 2000), promoter types (divergent/unique) (Saccharomyces Genome Database, available at: http://www.yeastgenome.org/), promoter architectures (OPN/DPN) (Tirosh and Barkai, 2008), and transcription regulation strategies (TFIID/SAGA-dominated) (Huisinga and Pugh, 2004). In addition, their combined expression represents ∼60% of the protein mass expressed in rich media (Wang et al, 2012) and thus accounts for much of the cellular activity under standard growth conditions (Supplementary Table S1, Supplementary material 1.1). We genomically integrated each promoter upstream of a yellow fluorescent protein (YFP) and used a robotically automated plate fluorometer to track the amount of reporter expression over time, in living cells, and across various growth conditions. Altogether, 859/867 (99%) promoters were successfully constructed. Simultaneous measurements of optical density (OD), indicative of population mass (Bremer and Dennis, 1987), enabled us to extract the doubling time of the culture and calculate the YFP production rate per OD unit per second (Methods, Zeevi et al, 2011), hereafter referred to as the promoter activity (Figure 1). These strains represent the largest library of native promoter-reporter fusions in eukaryotes to date.
The use of fluorescent reporters for measuring expression is a well-established approach (Bronstein et al, 1994; Kalir et al, 2001; Zaslaver et al, 2004; Newman et al, 2006; Ligr et al, 2006; Murphy et al, 2007; Cox et al, 2007; Gertz et al, 2009; Zeevi et al, 2011; Raveh-Sadka et al, 2012; Sharon et al, 2012), with several pronounced advantages. Unlike most current high-throughput techniques, which require cell lysis, fluorescence enables to perform live non-invasive imaging of the same cells over time with high temporal resolution. Accordingly, it does not require elaborate analysis and normalization techniques as required by other high-throughput measurement systems, such as microarrays or sequencing (Churchill, 2002; Marshall, 2004; Frantz, 2005; Bammler et al, 2005; Tang et al, 2007; Balázsi and Oltvai, 2007; Oshlack and Wakefield, 2009). This is especially important for our current study question as such normalizations may obliterate shared global effects (Lovén et al, 2012). It was also shown that fluorescent reporters provide highly precise and reproducible values (Bronstein et al, 1994; Kalir et al, 2001; Zaslaver et al, 2004; Ligr et al, 2006; Cox et al, 2007; Murphy et al, 2007; Gertz et al, 2009; Zeevi et al, 2011; Raveh-Sadka et al, 2012; Sharon et al, 2012). Here, we used replicate biological measurements to validate that our system provides highly sensitive and precise measurements for our set of promoters (CV ranging from 0.05 to 0.36 for promoters with high to very low activity, see Materials and methods and Supplementary Figure S2); more reproducible than those obtained by microarrays, sequencing, or mass spectrometry (Supplementary Figure S3). Together, these features of the system are critical for the ability to detect and quantify the global and specific changes in expression reported here.
To make sure that our synthetic fluorescence measurements are representative of the true promoter activity (of the native gene in its native genomic location), we performed several analyses to gauge the integrity and accuracy of the system. We compared the promoter activities with quantitative real-time PCR measurements of 18 selected strains under two growth conditions, and confirmed that YFP levels are an accurate proxy for the corresponding mRNA levels (R=0.99 and R=0.98, Supplementary Figure S4A and B). In addition, we compared our promoter activity values with three microarray studies (Holstege et al, 1998; Shalem et al, 2008; Lipson et al, 2009), three RNA-seq studies (Nagalakshmi et al, 2008; Lipson et al, 2009; Yassour et al, 2009), protein abundance obtained by immuno-tagged proteins (Ghaemmaghami et al, 2003), fluorescently tagged proteins (Stewart-Ornstein et al, 2012), mass spectrometry (De Godoy et al, 2008), and a curated data set of protein abundances integrated from five different data sets (Wang et al, 2012). In all of these comparisons, our promoter activity data correlated well with mRNA and protein abundance (R=0.72–0.81 and R=0.57–0.74 respectively, similar to the correlations between these data sets; Supplementary Figure S5), suggesting that promoter activity as measured by our system is a major determinant of these properties (Supplementary material 1.2). For further discussion of the experimental system, see Supplementary material 1.3.
Most promoters’ expression changes between conditions by a constant factor dependent only on condition and not on the promoter’s identity
To compare promoter activities across conditions, we measured our library under 10 different environmental growth conditions, known to affect the expression of genes in the library (Materials and methods, Supplementary material 1.1, Supplementary Table S2). In line with observations using other methods, most promoters changed their activity levels between every pair of conditions, indicating that a major fraction of the genome responds to growth under different conditions (Supplementary Tables S3 and S4).
Next, we plotted promoter activities in every pair of conditions (Figure 2; Supplementary Figure S6). Strikingly, we found that in each such pair, most promoters change their expression between conditions by a constant factor that depends only on the conditions and not on the promoter’s identity. This result indicates that although most promoters change their activity between conditions, they preserve their relative values. To quantify this global effect, we robustly fitted a scale line to the promoter activities of each pair of conditions, with the slope of the line ranging from 0.19 to 1 (arbitrarily setting glucose to 1, Materials and methods, Supplementary Table S4). We quantified the extent to which promoters adhere to the scale line by three independent methods: (A) Analysis of variance: We found that the scale line captures 80–99% of the variance in the data (P<10−45), depending on the pair of conditions compared. (B) We termed a promoter as behaving according to the global scale line if it deviated from it by less than three experimental standard deviations (Materials and methods). We found that between any two conditions, 60–90% of genes (depending on the pair of conditions compared) change expression by the same factor to within our relatively precise observation capacity. (C) Limiting the analysis to promoters that are moderately active in at least one of the two conditions being compared (>0.1, for which our measurements yield lower STD values; Supplementary Figure S2), we found that 58–88% are within ±30% of the global scale line (Materials and methods). These independent analyses confirm that for most promoters, their activity in condition B is equal to their activity in condition A multiplied by a single number (the slope of the scale line between conditions A and B). We term this number the global scaling factor between conditions A and B and return to analyze it below.
In addition to the dominating global response, for each pair of conditions there remains a smaller subset of promoters that do not scale according to the global scaling factor (Figure 2, gray dots). We termed a promoter as condition specific if it is deviated from the scale line by more than three experimental standard deviations (Materials and methods), using glucose as a reference condition. We note that the relatively small sizes of the specifically responding groups does not appear to result from low representation of these promoters in the library, since our library was initially designed to represent different groups of genes (Supplementary Table S1; Supplementary material 1.1). Additionally, genes and environmental conditions were chosen together to include genes that are known to respond to the conditions (based on Gasch et al, 2000, 2001). Furthermore, we repeated the analysis, excluding known growth-related groups of genes, such as those involved in protein synthesis, and obtained very similar results (Supplementary Figure S10), indicating the robustness of our results with respect to input genes. We found that for each condition, its set of condition-specific promoters showed remarkable agreement with our understanding of yeast physiology and encompassed known co-regulated gene modules (Gasch and Eisen, 2002; Ihmels et al, 2002; Segal et al, 2003). For example, the specific response to galactose includes almost all the promoters whose corresponding genes belong to the galactose utilization pathway (8/9, P<10−3), while the specific response to amino-acid starvation is enriched for amino-acid metabolism (28/50, P<10−10, Figure 2C).
Our results agree with previous studies (Pedersen et al, 1978; DeRisi, 1997; Gasch et al, 2000, 2001; O’Rourke and Herskowitz, 2002; Boer et al, 2003; Saldanha et al, 2004; Lai et al, 2005; Chechik et al, 2008; Shalem et al, 2008; Brauer et al, 2008; Yassour et al, 2009; Costenoble et al, 2011; Tirosh et al, 2011), showing that a considerable fraction of the yeast genome changes activity levels between every pair of conditions. Here, the accuracy of our experimental system, together with the use of replicates, which allowed the construction of a reliable error model, enabled us, for the first time, to provide concrete means to tease apart global and specific regulation using the global scale line. This decoupling provides a novel reconciliation between the known specificity of gene expression regulation and the ubiquitous changes in expression observed in high-throughput data. It suggests that between conditions there exists a highly regulated, relatively small specific response augmented by many global changes. It further implies that in most cases, and in contrast to many common interpretations, changes in expression are not indicative of specific regulation.
In addition to decoupling global and specific changes in promoter activities between conditions, we provide a quantitative characterization of the global response. We find that global changes in activities between conditions are accurately captured by a single scaling factor, thus providing a first experimental validation for the theory of proportional scaling suggested decades ago by the Copenhagen school (Maaloe, 1969; Ingraham and Ole Maaløe, 1983). We note that the observed linearity is far from trivial, as different theoretical models exert different predictions regarding the expected behavior of unregulated promoters across conditions, including static, linear, non-linear, and promiscuous responses (Supplementary material 1.4; Klumpp et al, 2009; Scott et al, 2010; Gerosa et al, 2013). Importantly, a prominent feature of the observed proportionality is that it preserves the ratio between most promoters across conditions. Specifically, it provides experimental evidence that complexes and pathways preserve the internal stoichiometry of their components, which may serve proper cell function. This finding reduces the degrees of freedom needed to characterize genetic programs, and opens the way for interrogation of the expression ratios being preserved.
Condition-specific promoters follow alternative scale lines
We further wished to examine whether proportional scaling is also a prominent feature in the specific responses. Previous studies have shown that functionally related genes are usually co-regulated (Pedersen et al, 1978; DeRisi, 1997; Gasch et al, 2000, 2001; O’Rourke and Herskowitz, 2002; Ihmels et al, 2002; Gasch and Eisen, 2002; Boer et al, 2003; Saldanha et al, 2004; Lai et al, 2005; Chechik et al, 2008; Shalem et al, 2008; Brauer et al, 2008; Yassour et al, 2009; Costenoble et al, 2011; Tirosh et al, 2011). However, to date it was not determined whether individual members within these co-regulated groups maintain proportionality throughout conditions. To this end, we studied promoters which are off the scale line under two different conditions: when comparing condition A with B and also when comparing condition A with C. Comparing these promoters in conditions B and C shows that they too fall on a scale line (Figure 3A–C; Supplementary Figure S7). In most cases, its scaling factor is very close to the global scaling factor between conditions B and C (Figure 3C–E). This finding can be interpreted as follows: the off-line genes have altered specific regulation in condition A (e.g., due to specific transcription factors (TFs)), but in conditions B and C they are not differentially regulated and thus follow the global scaling along with most other genes.
In other cases, we found that functionally related condition-specific genes fall on a separate scale line (Figure 3G; Supplementary Figure S7). One interesting example is the comparison of growth on ethanol (a respiratory condition) and galactose (a semi-respiratory condition). Both conditions require activation of respiration-related promoters, yet to a different extent (Fendt and Sauer, 2010). We found that between these conditions, respiration-related promoters preserve their relative levels, but with a scaling factor that was nearly two-fold higher than the global scaling factor (0.5 versus 0.28; Figure 3F and G), consistent with the increased utilization of the genes associated with these promoters in ethanol. Thus, for most (87%) promoters, the relationship between their activities in ethanol and galactose can be characterized either by a global scaling factor that accurately accounts for 77% of the promoters, or by a respiration-related scaling factor that accurately accounts for another 10% of the promoters. The remaining promoters are specific to either ethanol or galactose.
Altogether, we found, as have many before us (Pedersen et al, 1978; DeRisi, 1997; Gasch et al, 2000, 2001; O’Rourke and Herskowitz, 2002; Gasch and Eisen, 2002; Ihmels et al, 2002; Segal et al, 2003; Boer et al, 2003; Saldanha et al, 2004; Lai et al, 2005; Chechik et al, 2008; Shalem et al, 2008; Brauer et al, 2008; Yassour et al, 2009; Costenoble et al, 2011; Tirosh et al, 2011), that functionally related genes are co-regulated. Our results additionally show that promoters within these functionally related groups tend to preserve proportionality, changing their expression according to a common scaling factor. Adding to previous observations on co-regulation, we show that the ratios between co-regulated genes remain constant. This finding provides another layer of structure to our quantitative understanding of the organization of the genome-wide response to different conditions, on top of the dominating global response. We propose that the prevalent specific response of yeast to different conditions is to alter the degree to which an entire group of co-regulated genes is activated, without changing the internal stoichiometry between the member genes of the group.
Scaling factors accurately represent the entire promoter activity profile across conditions
Following the observations that proportional scaling underlies both global and specific responses in pairwise comparisons between conditions, we asked to what extent can the entire expression profile of yeast across conditions be represented by groups of genes that scale proportionally; and if so, how many such groups exist, and what are their sizes, scaling factors, and biological characteristics. To this end, we subjected the entire data set to k-means clustering with the cosine metric, as it has the property of preserving relative values within each cluster (Materials and methods). Notably, we found that partitioning our set of promoters into six well-separated clusters (Materials and methods, Supplementary Figure S8; Supplementary Table S3) accounts for 97% of the variance in our entire data set (Figure 4). This result indicates that if we could visualize our 10-dimentional expression space (in which every axis represents a different condition), promoter activities would not occupy the entire space, but rather adhere to only six lines (Figure 4; Supplementary Table S5, for more analysis see Supplementary material 1.5).
We performed several tests to assess the above clustering, both technically and biologically. First, we examined the clusters in terms of biological function by subjecting them to enrichment analysis of GO terms, TFs, promoter architectures (OPN/DPN), promoter types (divergent/unique), and transcription regulation strategies (TFIID/SAGA-dominated) (Materials and methods, Supplementary material 1.6). We found highly significant enrichments across all six clusters (Figure 4; Supplementary Tables S6 and S7). The first cluster contains most of the promoters (77%), and corresponds to promoters that scale according to the global scaling factor across all examined conditions. It includes all of the promoters in our data set whose corresponding genes represent components of the translation, transcription, chromatin, cell cycle, cytoskeleton machineries and most (85%) TFs. It is enriched for constitutive, TATA-less, TFIID-regulated promoters with an open chromatin architecture. This cluster represents promoters that are not differentially regulated across our set of conditions, and whose coordinated change in activity is mediated solely by global effects.
The other clusters exhibit condition-specific responses in a subset of the tested conditions and are highly enriched for families of genes that are known to respond to these conditions and their known regulators (Figure 4; Supplementary Tables S6 and S7; Supplementary material 1.6). For example, cluster 2 is upregulated both in fully aerobic (ethanol and glycerol) and in partially aerobic (galactose and galactose lacking amino acids) conditions, and is highly enriched with respiratory genes (15/16, P<10−14). It also includes promoters of six TFs, and four of these (HAP4, CAT8, ADR1, and USV1) are major known regulators of respiration (Haurie et al, 2001; Segal et al, 2003; Tachibana et al, 2005; Fendt and Sauer, 2010). Thus, these clusters may represent complete regulatory units that are co-regulated across conditions in a manner that largely preserves their internal stoichiometry. Whereas the identification that these genes are co-regulated is not surprising, our finding that they preserve their relative activities is novel.
An important implication of our clustering results is that proportional scaling of promoter activities transcends the usual partition of promoters to housekeeping/condition-specific, growth-regulated/stress-regulated, open/closed NFR, TFIID/SAGA-dominated. Notably, even clusters 2–6, which displayed condition-specific behaviors in one or more conditions, were not differentially regulated between most conditions. Whether active or inactive, their scaling factors mostly coincided with the global scaling factor (Figures 3C–E and 4; Supplementary Table S5). This is consistent with the complementary observation that cluster 1 contains condition-specific promoters that are not differentially regulated across our tested set of conditions and therefore scale according to the global scaling factors across the entire data set (e.g., ER stress-associated proteins, which probably would have clustered separately if ER stress-inducing conditions were tested). These observations hint that the mechanisms responsible for global proportional scaling are not unique to a limited set of genes, promoter architecture, or transcription regulation strategy. Accordingly, known growth-related sequence motifs, such as binding sites for Rap1/Ifh1/Fhl1/Sfp1/, RRPE and PAC (Hughes et al, 2000; Wade et al, 2001; Badis et al, 2008; Zhu et al, 2009) could not account for global scaling throughout all clusters. Thus, global proportional scaling of promoter activities is probably the result of a basic mechanism, shared across all promoter classes and architectures.
Next, we gauged the integrity and robustness of our clustering by means of cross-validation. For each of our conditions, we clustered the promoters based on all other conditions. We then used the activity values of only 10 predefined representative promoters in the tested condition to retrieve the scaling factors and thus predict the activity levels for the hundreds of other promoters (Materials and methods). If our clustering truly captures the variation in the underlying data, then we would expect these predictions to match the measured activities. Indeed, using only 10 promoters, we obtained highly accurate predictions, whereby in every condition we can explain over 85% of the variance of the activities of at least 98% of the promoters (Figure 5; Supplementary Figure S9; Supplementary material 1.7). These results suggest that describing the response to a new condition may only require the description of the behavior of a relatively small number of clusters, representing the few degrees of freedom the cell actually utilizes.
Finally, we validated that our results were not determined by growth-related groups of genes, such as protein synthesis which are overrepresented in our library, by excluding these promoters from the data set and repeating the analysis (Materials and methods). We obtained very similar clustering results with 95% of the genes preserving their cluster identities (Supplementary Figure S10), indicating the robustness of our clustering with respect to input genes.
Our results present a first attempt to quantitatively analyze the transcriptional response of yeast across conditions in terms of proportional scaling. In addition to partitioning the genes into groups which are co-regulated, as is the usual practice in the field, here we show that individual members within these groups scale together and preserve proportionality. We find that the ∼900 yeast promoters that we measured do not span the entire possible space of expression values, but rather adhere to a small number of scaling lines. This allows us to accurately describe the activity levels of the majority of promoters across conditions using only a handful of numbers representing the scaling factors of each of the different clusters. Moreover, we provide a first estimation for the relative magnitude of global and specific responses, showing that the former can account for much of the expression changes across many conditions. Together, our results suggest that gene expression profiles across different environmental conditions may exhibit less degrees of freedom and more structure than previously appreciated. These findings could probably be largely extended to hold not only for promoter activities, but also for mRNA and protein levels based on the analysis of existing data sets (Supplementary Figures S5 and S11; Supplementary material 1.2).
A simple passive resource allocation model accounts for a large fraction of the global scaling factors across conditions
Next, we asked what mechanisms can account for the quantitative values of the global scaling factors. Since enrichment analysis of promoter types, architectures, and regulation strategies did not provide mechanistic insights (Supplementary material 1.6), we examined two plausible theoretical models that result in proportional changes across promoters and compared their ability to explain the global scaling factor of each condition. In both models, we have no free parameters and we assume that promoter activity is an adequate proxy for protein production (Supplementary Figure S5; Supplementary material 1.2).
Many genes were shown to change their expression in accordance with the growth rate (Pedersen et al, 1978; Bremer and Dennis, 1987; Neidhardt, 1999; Regenberg et al, 2006; Castrillo et al, 2007; Brauer et al, 2008; Fazio and Jewett, 2008; Zaslaver et al, 2009; Klumpp et al, 2009; Levy and Barkai, 2009; Scott et al, 2010), and we thus first tested whether growth rate provides a good approximation for the values of the observed global scaling factors. In this model, we expect that in conditions where cells divide twice as fast, the activity of most promoters would double, overall preserving their concentration. Formally, this model predicts that the global scaling factor from condition A to condition B, S[A,B], will be given by:
where τ[A] and τ[B] represent the doubling time in conditions A and B, respectively. A possible mechanism for implementing such regulation is discussed in Supplementary material 1.4. This model entails that, per doubling time, the sum of activities of all unregulated promoters will be preserved across conditions (Figure 6A). We found that there is indeed a good correlation between the global scaling factors and growth rate (Supplementary Figure S12a) and that the scaling factors predicted by this model deviate by 30±28% from the measured scaling factors (3 of the 9 conditions are within 15%; Figure 6C).
Alternatively, in line with earlier models discussing differential allocation of resources between conditions (Ehrenberg and Kurland, 1984; Koch, 1988; Zaslaver et al, 2009; Molenaar et al, 2009; Scott et al, 2010), the overall promoter activity can be thought of as a fixed resource available to the cell per doubling time (Maaloe, 1969; Ingraham and Ole Maaløe, 1983). This is supported by our data, in which the total promoter activity per doubling time is relatively conserved (CV=0.13, Supplementary Table S4, Supplementary Figure S12b). In each condition, this resource is differentially partitioned between the condition-specific genes, Gspe, and the globally responding genes, Gglo, where the exact partition is determined by the varying magnitudes of the specific response required in each condition (Figure 6B). Thus, the value of the global scaling factor will accommodate both changes in growth rate and the magnitude of the specific response in each condition. Formally, this model predicts that the global scaling factor from condition A to condition B, S[A,B], is given by:
where represents the fractions of the total resources that are allocated to the global response promoters of conditions A and similarly for B. Gglo represents the set of globally responding genes, Gspe the set of condition-specific genes, and P[g,A] and P[g,B] represent the activity of promoter g in conditions A and B, respectively. Notably, this model achieves better explanatory power, deviating by only 12±19% from the measured scaling factors (6 of the 9 conditions are within 15%; Figure 6D).
We thus propose this passive resource allocation model as a potential explanation for the global scaling of promoter activities between conditions. We suggest that an environmental change requires changes in the activation of condition-specific gene clusters, which in turn affects the amount of cellular resources available to all other genes. Most promoters in the cell are not specifically regulated under a given condition and therefore their activity level will passively vary based on the availability of global cellular resources, as captured by the global scaling factors, but still preserving their relative values (Supplementary Material 1.8).
We note that there remains a fraction of the proportional response unexplained by the resource allocation model. This may partly stem from inaccuracies in our estimates of the total activity of all promoters, since our library does not include all promoters. Additionally, this simplified model is per biomass unit as measured by OD and does not account for other global properties, such as cell size and macromolecular composition, which are known to vary in different conditions along with the doubling time (Schaechter, 1958; Bremer and Dennis, 1987; Neidhardt, 1999). Nevertheless, the accuracy with which this model matches the measured global scaling factors based on the doubling times and magnitude of the specific responses suggests that these factors are likely to be major determinants of the global scaling factors.
Together, global proportional scaling of promoter activities across conditions and the passive resource allocation model, which quantitatively accounts for the global scaling factors, can potentially shed a new light on the previously reported correlations between the expression of many genes and growth rate (Pedersen et al, 1978; Bremer and Dennis, 1987; Neidhardt, 1999; Regenberg et al, 2006; Castrillo et al, 2007; Brauer et al, 2008; Fazio and Jewett, 2008; Zaslaver et al, 2009; Klumpp et al, 2009; Levy and Barkai, 2009; Scott et al, 2010). First, in contrast to previous reports, which reported that different genes have different correlations with growth rate, here we suggest that different genes have the same quantitative coordination with growth rate (Supplementary Figure S13). Second, our results agree with classical theoretical models (Maaloe, 1969; Ingraham and Ole Maaløe, 1983), and show that the correlation with growth rate is not limited to a specific subset of genes, as was suggested by these works (Regenberg et al, 2006; Castrillo et al, 2007; Brauer et al, 2008; Fazio and Jewett, 2008). Rather, it is an inherent trait of expression resulting from global cellular constraints, and thus common to all genes in all conditions in which they are not differentially regulated. Finally, the resource allocation model suggests that global proportional changes in expression should extend far beyond the previously observed correlations with growth rate. We predict that such changes should occur not only in exponential growth under different environmental conditions, but in all cases where there are changes in global cellular resources. These include, for example, dynamic changes in the environmental conditions, mutations, different stages of the cell cycle, and different stages of the metabolic cycle. Indeed, a recent study showed that global regulation is growth rate dependent not only during steady state but also during dynamic changes in growth rate (Gerosa et al, 2013). Thus, scaling lines can potentially provide a conceptual framework for analyzing expression changes, whatever the condition.
Promoter activities in E. coli scale proportionally between different growth conditions
To examine the generality of our results, we asked whether they also hold in other organisms. To this end, we measured promoter activities of ∼1800 promoters of the model bacteria, E. coli, using a library of GFP-reporter strains (Zaslaver et al, 2006, 2009, Supplementary Figure S14A). Measurements were performed in nine growth conditions (Supplementary Table S8) and promoter activity was calculated by the rate of GFP production per OD unit, at the time of maximal growth, as described (Materials and methods, Supplementary Table S9; Supplementary Figure S14). In total, 969 promoters were active above background in at least one condition. Using replicate measurements, we validated that this system provides precise promoter activity values (Materials and methods, Supplementary Figure S13).
Notably, the results for E. coli were highly analogous to those obtained for S. cerevisiae, whereby between each pair of conditions 70–90% of the promoters preserve their relative activities (Figure 7B). As in S. cerevisiae, the global scaling factors for the different conditions span a wide range (0.51–1.68, arbitrarily setting M9+glucose to 1, Supplementary Table S10). The scaling factors correspond to the growth rate and magnitude of the specific response predicted by the passive resource allocation model (85% of the variability explained; Supplementary Figure S16). Moreover, the specific responses are highly enriched with genes known to respond to the tested conditions (Supplementary Figure S17). Under conditions where a certain specific response was not differentially regulated, its promoters scaled according to the global scaling factor (Supplementary Figure S18). Here too, we can accurately predict the activity of most promoters in a new condition, using only a handful (five) of representative promoters (Materials and methods, Supplementary Figure S19).
Taken together, these results demonstrate that S. cerevisiae and E. coli exhibit similar global properties of transcriptional regulation, and that proportional scaling of promoter activity could be a basic and general trait across both prokaryotes and eukaryotes.
Discussion
We quantitatively characterized promoter activities under different growth conditions. In agreement with previous studies (Pedersen et al, 1978; DeRisi, 1997; Gasch et al, 2000, 2001; O’Rourke and Herskowitz, 2002; Boer et al, 2003; Saldanha et al, 2004; Lai et al, 2005; Chechik et al, 2008; Shalem et al, 2008; Brauer et al, 2008; Yassour et al, 2009; Costenoble et al, 2011; Tirosh et al, 2011), we find that a considerable fraction of the yeast genome changes activity levels between conditions. We further show that most of this change is accurately captured by a global linear function whose single scaling factor corresponds to the change in growth rate and magnitude of the condition-specific response. When specific groups of genes are activated, these too change according to scaling factors, changing the degree to which the entire group is activated, while preserving the ratios between genes within the group. We show that the activity levels of all promoters in any condition can be accurately described using only a handful of numbers representing the scaling factors of each of the different groups, thus showing the quantitative manifestation of the hierarchical nature of gene expression. To the best of our knowledge, this is the first time that proportional scaling of expression was analyzed and demonstrated to have such a major role in genome-wide expression profiles.
Our results imply that in many cases, changes in expression are not necessarily indicative of specific regulation, as recently suggested by several studies (Klumpp et al, 2009; Scott et al, 2010; Lovén et al, 2012; Gerosa et al, 2013). The global proportional scaling that we detected provides concrete means by which to tease apart global and specific regulation. The ability to differentiate genes that are actively regulated from those that are merely responding to changes in global parameters is critical for the understanding of gene expression regulation, and may provide a powerful tool for focusing on ‘functional biology’. Moreover, accounting for global effects allows precise quantification of specific regulation. Notably, we find that in each condition, only a relatively small and highly specific group of genes is actively regulated. The majority of expression changes across conditions, while real, seem to be the result of changes in global cellular factors and thus not informative about the regulation specific to the condition. This has broad practical implications, suggesting that global factors should be carefully taken into consideration when designing and analyzing studies that aim to understand gene expression regulation (Lovén et al, 2012).
Our work extends observations that the expression of many genes across conditions is correlated with growth rate (Pedersen et al, 1978; Bremer and Dennis, 1987; Neidhardt, 1999; Regenberg et al, 2006; Castrillo et al, 2007; Brauer et al, 2008; Fazio and Jewett, 2008; Zaslaver et al, 2009; Klumpp et al, 2009; Levy and Barkai, 2009; Scott et al, 2010). Here, we considerably strengthen this correlation by further showing that different genes have the same quantitative coordination with growth rate (Supplementary Figure S13). Moreover, generalizing classical theoretical models (Maaloe, 1969; Ingraham and Ole Maaløe, 1983), we claim that the correlation with growth rate is not limited to a specific subset of ‘growth-related’ genes, as previously suggested (Regenberg et al, 2006; Castrillo et al, 2007; Brauer et al, 2008; Fazio and Jewett, 2008). Rather, it is an inherent trait of expression resulting from global cellular constraints, and thus common to all genes in all conditions in which they are not differentially regulated. The shared quantitative relation of many genes to growth rate greatly reduces the degrees of freedom of the expression program across conditions (as we show by our ability to accurately predict the activity of ∼70% of the promoters in a new condition from measurement of only a single promoter), and simplifies our understanding of expression regulation. Proportional scaling paves the way for mechanistic understanding of the previously reported associations between growth rate and expression.
We also found that a simple parameter-free passive resource allocation model can account for the quantitative values of the global changes in promoter activities across conditions. If correct, then this model suggests that our findings regarding proportional changes in expression and their implications regarding specific and global modes of regulation extend far beyond the previously observed correlations with growth rate. We suggest that global proportional changes should occur not only in exponential growth at different growth rates, but in all cases where there are changes in global cellular resources. These include, for example, dynamic environmental changes, mutations, different stages of the cell cycle, different stages of the metabolic cycle, or different cell sizes. Thus, scaling lines can potentially provide a conceptual framework for analyzing expression changes, whatever the condition. It is interesting to speculate whether proportional scaling will be observed under such conditions and what will be the major determinant of the global scaling factors in these cases in which growth rate presumably has a minor role.
Our study opens many questions for future interrogation. Primarily, it will be interesting to learn how much of the proportionality observed for promoter activities is preserved by additional layers of regulation and perpetuates at the mRNA and protein levels, at both population level and in single cells. Additionally, it is interesting to consider what mechanisms are responsible for the observed proportionality and the coordination between promoter activity and growth rate. In bacteria, it has been suggested that cAMP (You et al, 2013), ppGpp (Magnusson et al, 2005), and use of alternative sigma factors (Klumpp and Hwa, 2008; Zaslaver et al, 2009) may contribute to the differential allocation of resources to different groups of genes. In yeast, transporters, cAMP, master growth regulators (e.g., TOR) and different ribosomal subunits have been suggested to perform a similar task (Broach, 2012). However, to date the molecular mechanisms underlying proportional responses, the identity of the limiting resources, and the role that cell size, shape, and composition may have in this process remain unknown.
Finally, our observations regarding proportional scaling of condition-specific genes adds yet another layer of structure to our quantitative understanding of the organization of the genome-wide response to different conditions. We show that individual genes within co-regulated gene groups respond in a similar quantitative manner, thereby maintaining proportionality, that is, their relative expression values, across conditions. An important implication of this preserved proportionality is that complexes and pathways preserve the internal stoichiometry of their components across conditions. This greatly reduces the degrees of freedom needed to characterize a genetic program and quantitatively manifests the hierarchical nature of gene expression regulation. It invites further interrogation of the expression ratios being preserved and the numerical values of the different scaling factors.
Materials and methods
A complete Materials and methods can be found in Supplementary material.
Library design and construction
Promoters were chosen to cover a wide variety of cellular functions and processes (Supplementary material 1.1). Promoter sequences were defined as the genomic region between the translation start site (TrSS) and the end of the upstream neighboring gene. All strains were constructed as previously described (Zeevi et al, 2011), based on the genomic integration of promoter sequences into a common master strain, upstream of an YFP reporter (Supplementary material 2.1). Final strains were validated by sequencing, growth curves, and mCherry expression levels, and abnormal strains were removed. For a full list of promoters, primers, and sequences, see Supplementary Table S1.
Promoter activity measurements
Cells were inoculated from frozen stocks into synthetic complete dextrose (SCD) (150 μl, 96-well plate) and grown at 30°C for 48 h, reaching complete saturation. Cells were then diluted 1:36 in fresh medium to a total volume of 180 μl and were grown at 30°C for at least 16 h in 96-well plates while being measured. Measurements were carried out every 20 min using a robotic system (Tecan Freedom EVO) with a plate reader (Tecan Infinite F500). Each measurement included optical density (OD600), YFP fluorescence, and RFP fluorescence. Measurements of each plate at every growth condition were repeated 1–6 times. The growth media are outlined in Supplementary Table S2.
Computing promoter activity levels
Basic analysis of measured OD, YFP, and RFP was done as previously described and included removal of strains with abnormal growth curves and RFP expression, subtraction of background levels of OD and auto-fluorescence, and smoothing of outlier measurements for each strain (Zeevi et al, 2011). Promoter activities were calculated as previously described (Zeevi et al, 2011; Raveh-Sadka et al, 2012) , with t1 and t2 defining a window of two doublings at the maximal growth rate (Supplementary material 2.4). The system’s detection level was assessed by examining the distribution of promoter activity levels for a strain containing an RFP gene but no YFP gene. For each condition, >30 biological replicates of the strain were measured and fitted to a normal distribution (Supplementary Figure S1), and the 95th percentile of the distribution was taken to be the detection level. For each strain in every condition, we took the final promoter activity levels to be the average of the strain across replicates. If this average was below the detection level, then we set the promoter activity to the detection level.
Experimental variability
The relative error was estimated by the coefficient of variation (CV) of replicate measurements. Mean values across expression bins were used to estimate the CV of any promoter activity level, by linear interpolation (Supplementary material 2.6). The CV values ranged from 0.36 for very low promoter activity levels to 0.05 for high promoter activity levels (Supplementary Figure S2).
Global scaling factors and error model
For all conditions, the global scaling factor represents the best robust fit to the data (Supplementary material 2.7). For each pairwise comparison, we determined the variance explained by the global scale line. The promoter activity values, v(p), of each promoter p was projected to the global scale line. Denoting the difference between the vector and its projection by d(p), the variance explained by the clustering was calculated as 1−variance(d(p))/variance(v(p)). To obtain a P-value for the explained variance, for each comparison between conditions X and Y, we randomized the promoter activities of condition Y and quantified the variance explained by the original global scaling line, as described above. This was repeated 1000 times for each pairwise comparison.
For each pairwise comparison, we then determined which promoters behave according to the global trend between these two conditions using two separate methods: (A) We analyzed all data points above detection and estimated their probability to behave according to the global trend. For each promoter, let (x, y) and (sx, sy) denote its activity level and standard deviation in conditions x and y, respectively. Denoting the scaling factor between the respective conditions by a, then promoters were defined as part of the global trend if <3sy or <3sx. (B) We restricted each pairwise comparison to promoters with an activity of >0.1 in both conditions. For such values, we found the average CV to remain constant at 0.05 (Supplementary Figure S2). For each pairwise comparison, we defined a promoter as part of the global trend if its value deviated by no more than 30% from its expected value according to the global scaling factor.
These two methods complement each other as the first is relative yet it enables the analysis of the entire data set, taking into account our different level of confidence in low and high values. The second is restricted to only parts of the data, yet it enables to determine absolute values. Both yielded similar results with ∼60–90% of promoters (depending on the pair of conditions compared) changing between conditions according to the global scaling factor.
Quantitative PCR analysis
Eighteen representative strains belonging to cluster 1 (RPB10, TEF1, DPM1, SEC61, SHP1, CDC10, RPS3, GLY1, RPL3, RPL33A, RPL8B, RPS7A, RPS11B, RPL4B, RPL28) or cluster 6 (GAL1, GAL2, GAL7) were inoculated from frozen stocks into SCD (150 μl, 96-well plate) and grown at 30°C for 48 h, reaching complete saturation. Cells were then diluted 1:36 in fresh medium and pelleted at mid-exponential phase. RNA was extracted using the EPICENTER Yeast MasterPURE RNA extraction kit, and cDNA was created using random hexamers (sigma). Quantitative PCR was performed by RT–PCR (StepOnePlus, Applied Biosystems) using a ready-mix kit (KAPA, KK4605). For each strain, measurements were performed in two sets of triplicates, measuring both YFP and RFP mRNA. Reported values are of mean YFP/RFP from nine replicates derived from three independent experiments (Supplementary material 2.8).
Functional annotation and enrichment analysis
Sets of genes were assigned process, function, and cellular components according to the annotations from the GO (Ashburner et al, 2000). The significant representation of GO terms in the set was evaluated by Gorilla GO Term Finder (Eden et al, 2009) with a P-value threshold of 10−3. For TF analysis, we examined the distribution of known TF promoters (Badis et al, 2008; Zhu et al, 2009) across the different clusters. For enrichment analysis, promoters were classified as previously described according to their properties as: OPN/DPN (Tirosh and Barkai, 2008), SAGA-dominated/TFIID-dominated (Huisinga and Pugh, 2004), divergent/unique (this study, based on the Saccharomyces Genome Database, available at: http://www.yeastgenome.org/). P-values were computed according to the HG distribution and corrected for multiple hypothesis testing using FDR correction (Benjamini and Hochberg, 1995).
Clustering promoter activities
To partition the promoters into clusters that preserve proportionality, we used K-means clustering with the cosine metric (defined by (x,y)=1−cos, where x and y are vectors of promoter activity levels in a given condition and O is the origin). The clustering was repeated 100 times with different random starting points and the clustering that minimized the sum of distances from the centers was chosen. The number of clusters, K, was determined as the largest K for which the distance between any two centers is at least 0.05 (Supplementary Figure S8), thereby ensuring a minimal separation between any two clusters (Supplementary material 2.10). For generation of Supplementary Figure S10, this analysis was repeated excluding all ribosomal promoters.
Variance explained by clustering
For each promoter p, its vector of promoter activity levels across conditions, v(p), was projected to the center of the corresponding cluster. Denoting the difference between the vector and its projection by d(p), the variance explained by the clustering was calculated as 1−variance(d(p))/variance(v(p)).
Predicting promoter activity levels
We used the following scheme to predict promoter activity levels under growth condition Y from measurements of several other conditions x1,…,xm. First, the number of clusters k for all promoters under the measured m conditions was determined using above criterion. Then, the promoters were clustered by the k-means algorithm using the cosine metric. Denote the centers of the clusters by c1,…,ck. A small number of representative promoters were chosen as the training set, and their promoter activity levels under the new condition Y were used for the prediction task. For each cluster t, an extended center of size m+1 was calculated from the representative promoters that belong to cluster t. The activity level of a promoter under the new condition Y was predicted to be , where x is the vector of activity levels for that promoter. The representative promoters were chosen such that an equal number of promoters were chosen from each cluster, which are closest (by the cosine metric) to the centers c1,…,ck of the relevant clusters (Supplementary material 2.12).
E. coli
Growth conditions
All media for bacterial growth were based on a defined M9 minimal medium (Supplementary material 3.1). Specific growth conditions and the respective growth rates in each condition are listed in Supplementary Table S8.
Robotic assay for genome-wide promoter activity data
The library of reporter strains, each bearing a low-copy plasmid with one of E .coli promoters controlling fast-folding GFP (Supplementary Figure S14A; Cormack et al, 1996) was previously described (Zaslaver et al, 2006). This library includes 1820 reporter strains that represent ∼75% of E. coli promoters. Reporter strains were inoculated from frozen stocks into 96-well plates containing M9 minimal medium supplemented with 11 mM glucose, 0.05% casamino acids and 50 μg/ml kanamycin and grown overnight in a shaker at 37°C. All steps from this point were carried out using a programmable robotic system (Freedom Evo, Tecan Inc.). Overnight cultures were diluted 1:33 into M9 medium followed by a second 1:15 dilution into flat bottom microwell plate (nunc) containing one of the growth media (Supplementary Table S8) in a final culture volume of 150 μl. Bacteria were grown in an incubator with shaking (6 Hz) at 37°C for about 20 h. Every 8 min the plate was transferred by the robotic arm into a multiwall fluorometer (Infinite F200, Tecan) that reads the OD (600 nm) and GFP (excitation 480 (20), emission 515(10)). After 5 h of incubation, NaCl or casamino was added to the appropriate plates by automated pipetting.
Computing promoter activity levels, detection level, experimental variability and error model
Promoter activity was calculated by the rate of GFP production per OD unit, as described above for yeast (Supplementary material 2.4) for the 3-h window around mid-exponential growth (Supplementary Figure S14). For conditions in which a compound was added to the media, promoter activity was calculated for the window of time after its addition. Background fluorescence was measured using a promoter-less control strain in each plate. Promoter activities lower than 3 STDs above the mean background promoter activity were set to zero. In total, 969 promoters were active above background in at least one condition. Experimental variability was assessed as described above, using three replicate measurements in M9 glucose (Supplementary Figure S15) and error model was calculated as for S. cerevisiae.
Identification of representative promoters for predictions was done iteratively. At each iteration, we calculated the best linear sum of the representative promoter, which predicted the experimental data, and added an additional representative promoter, which contribute the most to predict the experimental data.
Supplementary Material
Acknowledgments
This work was supported by grants from the European Research Council (ERC) and the US National Institutes of Health (NIH) to ES. ES is the incumbent of the Soretta and Henry Shapiro career development chair. RM is the incumbent of the Anna and Maurice Boukstein career development chair. We thank Michal Levo for helpful discussion.
Author contributions: LK and ES conceived the project. LK designed the yeast promoter library, and together with MLP constructed the strains. DZ and AW developed protocols for robotic strain assembly and constructed the yeast master strain. LK performed expression measurements and wrote the pipeline to extract promoter activity data. OZ performed analyses of clustering, prediction, and model evaluation. LK performed enrichment analysis and qPCR. LK, OZ, UB, RM, and ES conceived and evaluated the models. VS, GA, and AB performed measurements of E. coli, and ED analyzed the data. LK wrote the manuscript. UA, RM, and ES supervised and guided the research.
Footnotes
The authors declare that they have no conflict of interest.
References
- Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G (2000) Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 25: 25–29 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Badis G, Chan ET, van Bakel H, Pena-Castillo L, Tillo D, Tsui K, Carlson CD, Gossett AJ, Hasinoff MJ, Warren CL, Gebbia M, Talukder S, Yang A, Mnaimneh S, Terterov D, Coburn D, Li Yeo A, Yeo ZX, Clarke ND, Lieb JD et al. (2008) A library of yeast transcription factor motifs reveals a widespread function for Rsc3 in targeting nucleosome exclusion at promoters. Mol Cell 32: 878–887 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bakel H Van, Holstege FCP (2008) A tutorial for DNA microarray expression profiling. Cell Evaluating: 22–28 , Available at: http://download.cell.com/images/edimages/etbr/VANBAKEL.PDF [Google Scholar]
- Balázsi G, Oltvai ZN (2007) A pitfall in series of microarrays: the position of probes affects the cross-correlation of gene expression profiles. Methods Mol Biol (Clifton, NJ) 377: 153–162 [DOI] [PubMed] [Google Scholar]
- Bammler T, Beyer RP, Bhattacharya S, Boorman GA, Boyles A, Bradford BU, Bumgarner RE, Bushel PR, Chaturvedi K, Choi D, Cunningham ML, Deng S, Dressman HK, Fannin RD, Farin FM, Freedman JH, Fry RC, Harper A, Humble MC, Hurban P et al. (2005) Standardizing global gene expression analysis between laboratories and across platforms. Nat Meth 2: 351–356 [DOI] [PubMed] [Google Scholar]
- Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Series B Methodol 57: 289–300 [Google Scholar]
- Boer VM, de Winde JH, Pronk JT, Piper MDW (2003) The genome-wide transcriptional responses of Saccharomyces cerevisiae grown on glucose in aerobic chemostat cultures limited for carbon, nitrogen, phosphorus, or sulfur. J Biol Chem 278: 3265–3274 [DOI] [PubMed] [Google Scholar]
- Brauer MJ, Huttenhower C, Airoldi EM, Rosenstein R, Matese JC, Gresham D, Boer VM, Troyanskaya OG, Botstein D (2008) Coordination of growth rate, cell cycle, stress response, and metabolic activity in yeast. Mol Biol Cell 19: 352–367 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bremer H, Dennis P (1987) Modulation of chemical composition and other parameters of the cell by growth rate. Escherichia coli and Salmonella. Available at: http://ctbp.ucsd.edu/qbio/beemer96.pdf [Accessed 7 November 2011] [Google Scholar]
- Broach JR (2012) Nutritional control of growth and development in yeast. Genetics 192: 73–105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bronstein I, Fortin J, Stanley PE, Stewart GS, Kricka LJ (1994) Chemiluminescent and bioluminescent reporter gene assays. Anal Biochem 219: 169–181 [DOI] [PubMed] [Google Scholar]
- Castrillo JI, Zeef LA, Hoyle DC, Zhang N, Hayes A, Gardner DCJ, Cornell MJ, Petty J, Hakes L, Wardleworth L, Rash B, Brown M, Dunn WB, Broadhurst D, O’Donoghue K, Hester SS, Dunkley TPJ, Hart SR, Swainston N, Li P et al. (2007) Growth control of the eukaryote cell: a systems biology study in yeast. J Biol 6: 4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chechik G, Oh E, Rando O, Weissman J, Regev A, Koller D (2008) Activity motifs reveal principles of timing in transcriptional control of the yeast metabolic network. Nat Biotechnol 26: 1251–1259 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Churchill GA (2002) Fundamentals of experimental design for cDNA microarrays. Nat Genet 32(Suppl): 490–495 [DOI] [PubMed] [Google Scholar]
- Cormack BP, Valdivia RH, Falkow S (1996) FACS-optimized mutants of the green fluorescent protein (GFP). Gene 173: 33–38 [DOI] [PubMed] [Google Scholar]
- Costenoble R, Picotti P, Reiter L, Stallmach R, Heinemann M, Sauer U, Aebersold R (2011) Comprehensive quantitative analysis of central carbon and amino-acid metabolism in Saccharomyces cerevisiae under multiple conditions by targeted proteomics. Mol Syst Biol 7: 464. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cox RS, Surette MG, Elowitz MB (2007) Programming gene expression with combinatorial promoters. Mol Syst Biol 3: 145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- De Godoy LMF, Olsen J V, Cox J, Nielsen ML, Hubner NC, Fröhlich F, Walther TC, Mann M (2008) Comprehensive mass-spectrometry-based proteome quantification of haploid versus diploid yeast. Nature 455: 1251–1254 [DOI] [PubMed] [Google Scholar]
- DeRisi JL (1997) Exploring the metabolic and genetic control of gene expression on a genomic scale. Science 278: 680–686 [DOI] [PubMed] [Google Scholar]
- Eden E, Navon R, Steinfeld I, Lipson D, Yakhini Z (2009) GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists. BMC Bioinformatics 10: 48. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Edgar R (2002) Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res 30: 207–210 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ehrenberg M, Kurland CG (1984) Costs of accuracy determined by a maximal growth rate constraint. Q Rev Biophys 17: 45–82 [DOI] [PubMed] [Google Scholar]
- Fazio A, Jewett M (2008) Transcription factor control of growth rate dependent genes in Saccharomyces cerevisiae: a three factor design. BMC Available at: http://www.biomedcentral.com/1471-2164/9/341 [Accessed 7 November 2011] [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fendt S-M, Sauer U (2010) Transcriptional regulation of respiration in yeast metabolizing differently repressive carbon substrates. BMC Syst Biol 4: 12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Frantz S (2005) An array of problems. Nat Rev Drug Discov 4: 362–363 [DOI] [PubMed] [Google Scholar]
- Gasch AP, Eisen MB (2002) Exploring the conditional coregulation of yeast gene expression through fuzzy k-means clustering. Genome Biol 3: RESEARCH0059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gasch AP, Huang M, Metzner S, Botstein D, Elledge SJ, Brown PO (2001) Genomic expression responses to DNA-damaging agents and the regulatory role of the yeast ATR homolog Mec1p. Mol Biol Cell 12: 2987–3003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gasch AP, Spellman PT, Kao CM, Carmel-Harel O, Eisen MB, Storz G, Botstein D, Brown PO (2000) Genomic expression programs in the response of yeast cells to environmental changes. Mol Biol Cell 11: 4241–4257 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gerosa L, Kochanowski K, Heinemann M, Sauer U (2013) Dissecting specific and global transcriptional regulation of bacterial gene expression. Mol Syst Biol 9: 658. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gertz J, Siggia ED, Cohen BA (2009) Analysis of combinatorial cis-regulation in synthetic and genomic promoters. Nature 457: 215–218 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ghaemmaghami S, Huh W-K, Bower K, Howson RW, Belle A, Dephoure N, O’Shea EK, Weissman JS (2003) Global analysis of protein expression in yeast. Nature 425: 737–741 [DOI] [PubMed] [Google Scholar]
- Haurie V, Perrot M, Mini T, Jenö P, Sagliocco F, Boucherie H (2001) The transcriptional activator Cat8p provides a major contribution to the reprogramming of carbon metabolism during the diauxic shift in Saccharomyces cerevisiae. J Biol Chem 276: 76–85 [DOI] [PubMed] [Google Scholar]
- Holstege FC, Jennings EG, Wyrick JJ, Lee TI, Hengartner CJ, Green MR, Golub TR, Lander ES, Young RA (1998) Dissecting the regulatory circuitry of a eukaryotic genome. Cell 95: 717–728 [DOI] [PubMed] [Google Scholar]
- Hughes JD, Estep PW, Tavazoie S, Church GM (2000) Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae. J Mol Biol 296: 1205–1214 [DOI] [PubMed] [Google Scholar]
- Huisinga KL, Pugh BF (2004) A genome-wide housekeeping role for TFIID and a highly regulated stress-related role for SAGA in Saccharomyces cerevisiae. Mol Cell 13: 573–585 [DOI] [PubMed] [Google Scholar]
- Ihmels J, Friedlander G, Bergmann S, Sarig O, Ziv Y, Barkai N (2002) Revealing modular organization in the yeast transcriptional network. Nat Genet 31: 370–377 [DOI] [PubMed] [Google Scholar]
- Ingraham JL, Maaløe O, Neidhardt FC (1983) Growth of the Bacterial Cell Sunderland, MA: Sinauer Assoc., [Google Scholar]
- Islam S, Kjällquist U, Moliner A, Zajac P, Fan J-B, Lönnerberg P, Linnarsson S (2011) Characterization of the single-cell transcriptional landscape by highly multiplex RNA-seq. Genome Res 21: 1160–1167 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jacob F, Monod J (1961) Genetic regulatory mechanisms in the synthesis of proteins. J Mol Biol 3: 318–356 [DOI] [PubMed] [Google Scholar]
- Kalir S, McClure J, Pabbaraju K, Southward C, Ronen M, Leibler S, Surette MG, Alon U (2001) Ordering genes in a flagella pathway by analysis of expression kinetics from living bacteria. Science (New York, NY) 292: 2080–2083 [DOI] [PubMed] [Google Scholar]
- Klumpp S, Hwa T (2008) Growth-rate-dependent partitioning of RNA polymerases in bacteria. Proc Natl Acad Sci USA 105: 20245–20250 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klumpp S, Zhang Z, Hwa T (2009) Growth rate-dependent global effects on gene expression in bacteria. Cell 139: 1366–1375 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Koch AL (1988) Why can’t a cell grow infinitely fast? Can J Microbiol 34: 421–426 [DOI] [PubMed] [Google Scholar]
- Lai L-C, Kosorukoff AL, Burke P V, Kwast KE (2005) Dynamical remodeling of the transcriptome during short-term anaerobiosis in Saccharomyces cerevisiae: differential response and role of Msn2 and/or Msn4 and other factors in galactose and glucose media. Mol Cell Biol 25: 4075–4091 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Levy S, Barkai N (2009) Coordination of gene expression with growth rate: a feedback or a feed-forward strategy? FEBS Lett 583: 3974–3978 [DOI] [PubMed] [Google Scholar]
- Ligr M, Siddharthan R, Cross FR, Siggia ED (2006) Gene expression from random libraries of yeast promoters. Genetics 172: 2113–2122 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lipson D, Raz T, Kieu A, Jones DR, Giladi E, Thayer E, Thompson JF, Letovsky S, Milos P, Causey M (2009) Quantification of the yeast transcriptome by single-molecule sequencing. Nat Biotechnol 27: 652–658 [DOI] [PubMed] [Google Scholar]
- Lovén J, Orlando DA, Sigova AA, Lin CY, Rahl PB, Burge CB, Levens DL, Lee TI, Young RA (2012) Revisiting global gene expression analysis. Cell 151: 476–482 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maaloe O (1969) An analysis of bacterial growth. Dev Biol 3: 33–58 [Google Scholar]
- Magnusson LU, Farewell A, Nyström T (2005) ppGpp: a global regulator in Escherichia coli. Trends Microbiol 13: 236–242 [DOI] [PubMed] [Google Scholar]
- Marshall E (2004) Getting the noise out of gene arrays. Science (New York, NY) 306: 630–631 [DOI] [PubMed] [Google Scholar]
- Molenaar D, van Berlo R, de Ridder D, Teusink B (2009) Shifts in growth strategies reflect tradeoffs in cellular economics. Mol Syst Biol 5: 323. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Murphy KF, Balázsi G, Collins JJ (2007) Combinatorial promoter design for engineering noisy gene expression. Proc Natl Acad Sci USA 104: 12726–12731 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nagalakshmi U, Wang Z, Waern K, Shou C, Raha D, Gerstein M, Snyder M (2008) The transcriptional landscape of the yeast genome defined by RNA sequencing. Science (New York, NY) 320: 1344–1349 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Neidhardt FC (1999) Bacterial growth: constant obsession with dN/dt. J Bacteriol 181: 7405–7408 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Newman JRS, Ghaemmaghami S, Ihmels J, Breslow DK, Noble M, DeRisi JL, Weissman JS (2006) Single-cell proteomic analysis of S. cerevisiae reveals the architecture of biological noise. Nature 441: 840–846 [DOI] [PubMed] [Google Scholar]
- Oshlack A, Wakefield MJ (2009) Transcript length bias in RNA-seq data confounds systems biology. Biol Direct 4: 14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- O’Rourke SM, Herskowitz I (2002) A third osmosensing branch in Saccharomyces cerevisiae requires the Msb2 protein and functions in parallel with the Sho1 branch. Mol Cell Biol 22: 4739–4749 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pedersen S, Bloch PL, Reeh S, Neidhardt FC (1978) Patterns of protein synthesis in E. coli: a catalog of the amount of 140 individual proteins at different growth rates. Cell 14: 179–190 [DOI] [PubMed] [Google Scholar]
- Raveh-Sadka T, Levo M, Shabi U, Shany B, Keren L, Lotan-Pompan M, Zeevi D, Sharon E, Weinberger A, Segal E (2012) Manipulating nucleosome disfavoring sequences allows fine-tune regulation of gene expression in yeast. Nat Genet 44: 743–750 [DOI] [PubMed] [Google Scholar]
- Regenberg B, Grotkjaer T, Winther O, Fausbøll A, Akesson M, Bro C, Hansen LK, Brunak S, Nielsen J (2006) Growth-rate regulated genes have profound impact on interpretation of transcriptome profiling in Saccharomyces cerevisiae. Genome Biol 7: R107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Saldanha AJ, Brauer MJ, Botstein D (2004) Nutritional homeostasis in batch and steady-state culture of yeast. Mol Biol Cell 15: 4089–4104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schaechter M (1958) Dependency on medium and temperature of cell size and chemical composition during balanced growth of Salmonella typhimurium. J Gen Microbiol 19: 592–606 [DOI] [PubMed] [Google Scholar]
- Scott M, Gunderson CW, Mateescu EM, Zhang Z, Hwa T (2010) Interdependence of cell growth and gene expression: origins and consequences. Science (New York, NY) 330: 1099–1102 [DOI] [PubMed] [Google Scholar]
- Segal E, Shapira M, Regev A, Pe’er D, Botstein D, Koller D, Friedman N (2003) Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data. Nat Genet 34: 166–176 [DOI] [PubMed] [Google Scholar]
- Shalem O, Dahan O, Levo M, Martinez MR, Furman I, Segal E, Pilpel Y (2008) Transient transcriptional responses to stress are generated by opposing effects of mRNA production and degradation. Mol Syst Biol 4: 223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sharon E, Kalma Y, Sharp A, Raveh-Sadka T, Levo M, Zeevi D, Keren L, Yakhini Z, Weinberger A, Segal E (2012) Inferring gene regulatory logic from high-throughput measurements of thousands of systematically designed promoters. Nat Biotechnol 30: 521–530 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Spellman PT, Sherlock G, Zhang MQ, Iyer VR, Anders K, Eisen MB, Brown PO, Botstein D, Futcher B (1998) Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. Mol Biol Cell 9: 3273–3297 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stewart-Ornstein J, Weissman JS, El-Samad H (2012) Cellular noise regulons underlie fluctuations in Saccharomyces cerevisiae. Mol Cell 45: 483–493 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tachibana C, Yoo JY, Tagne J-B, Kacherovsky N, Lee TI, Young ET (2005) Combined global localization analysis and transcriptome data identify genes that are directly coregulated by Adr1 and Cat8. Mol Cell Biol 25: 2138–2146 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tang T, François N, Glatigny A, Agier N, Mucchielli M-H, Aggerbeck L, Delacroix H (2007) Expression ratio evaluation in two-colour microarray experiments is significantly improved by correcting image misalignment. Bioinformatics (Oxford, England) 23: 2686–2691 [DOI] [PubMed] [Google Scholar]
- Tirosh I, Barkai N (2008) Two strategies for gene regulation by promoter nucleosomes. Genome Res 18: 1084–1091 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tirosh I, Wong KH, Barkai N, Struhl K (2011) Extensive divergence of yeast stress responses through transitions between induced and constitutive activation. Proc Natl Acad Sci USA 108: 16693–16698 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tu BP, Kudlicki A, Rowicka M, McKnight SL (2005) Logic of the yeast metabolic cycle: temporal compartmentalization of cellular processes. Science (New York, NY) 310: 1152–1158 [DOI] [PubMed] [Google Scholar]
- Van de Peppel J, Kemmeren P, van Bakel H, Radonjic M, van Leenen D, Holstege FCP (2003) Monitoring global messenger RNA changes in externally controlled microarray experiments. EMBO Rep 4: 387–393 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wade C, Shea KA, Jensen R V, McAlear MA (2001) EBP2 is a member of the yeast RRB regulon, a transcriptionally coregulated set of genes that are required for ribosome and rRNA biosynthesis. Mol Cell Biol 21: 8638–8650 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang M, Weiss M, Simonovic M, Haertinger G, Schrimpf SP, Hengartner MO, von Mering C (2012) PaxDb, a database of protein abundance averages across all three domains of life. Mol Cell Proteomics 11: 492–500 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yassour M, Kaplan T, Fraser HB, Levin JZ, Pfiffner J, Adiconis X, Schroth G, Luo S, Khrebtukova I, Gnirke A, Nusbaum C, Thompson D-A, Friedman N, Regev A (2009) Ab initio construction of a eukaryotic transcriptome by massively parallel mRNA sequencing. Proc Natl Acad Sci USA 106: 3264–3269 [DOI] [PMC free article] [PubMed] [Google Scholar]
- You C, Okano H, Hui S, Zhang Z, Kim M, Gunderson CW, Wang Y-P, Lenz P, Yan D, Hwa T (2013) Coordination of bacterial proteome with metabolism by cyclic AMP signalling. Nature 500: 301–306 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zaslaver A, Bren A, Ronen M, Itzkovitz S, Kikoin I, Shavit S, Liebermeister W, Surette MG, Alon U (2006) A comprehensive library of fluorescent transcriptional reporters for Escherichia coli. Nat Methods 3: 623–628 [DOI] [PubMed] [Google Scholar]
- Zaslaver A, Kaplan S, Bren A, Jinich A, Mayo A, Dekel E, Alon U, Itzkovitz S (2009) Invariant distribution of promoter activities in Escherichia coli. PLoS Comput Biol 5: e1000545. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zaslaver A, Mayo AE, Rosenberg R, Bashkin P, Sberro H, Tsalyuk M, Surette MG, Alon U (2004) Just-in-time transcription program in metabolic pathways. Nat Genet 36: 486–491 [DOI] [PubMed] [Google Scholar]
- Zeevi D, Sharon E, Lotan-Pompan M, Lubling Y, Shipony Z, Raveh-Sadka T, Keren L, Levo M, Weinberger A, Segal E (2011) Compensation for differences in gene copy number among yeast ribosomal proteins is encoded within their promoters. Genome Res 21: 2114–2128 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhu C, KJRP Byers, McCord RP, Shi Z, Berger MF, Newburger DE, Saulrieta K, Smith Z, Shah M V, Radhakrishnan M, Philippakis AA, Hu Y, De Masi F, Pacek M, Rolfs A, Murthy T, Labaer J, Bulyk ML (2009) High-resolution DNA-binding specificity analysis of yeast transcription factors. Genome Res 19: 556–566 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.