Abstract
Studies in evolutionary developmental biology suggest that the structure of genetic pathways may bias the fixation of natural variation toward particular nodes in these pathways. In an attempt to test this trend genome wide, we integrated several previously published data sets to examine whether the position of genes in the whole-genome transcriptional network of Saccharomyces cerevisiae is associated with the amount of cis-regulatory expression divergence between S. cerevisiae and its sibling species Saccharomyces paradoxus. We find little evidence for an association between connectivity and divergence in the global network that combines data from multiple conditions. However, relationships between connectivity and divergence are apparent in some of the smaller subnetworks. Despite a slight tendency for genes with more transcriptional interactions to show greater divergence, these differences explain no more than a small fraction of variation in evolutionary rates. These results suggest that the systems biology focus on large interactomes may miss some critical details of local interactions. More detailed experimental analysis will be needed to define the genetic pathways that control specific phenotypic traits and quantify the rate of regulatory changes at different points in these pathways.
Keywords: gene networks, cis-regulatory evolution, pleiotropy, species divergence, S. cerevisiae, S. paradoxus
Introduction
Recent analyses suggest that the structure of genetic pathways may bias the fixation of natural variation toward particular nodes in these pathways. For example, parallel evolution of similar phenotypes in different lineages is sometimes caused by independent regulatory changes in the same genes (Sucena et al. 2003; Protas et al. 2006; Chan et al. 2010). A possible explanation is that these genes represent the optimally pleiotropic nodes in the genetic pathways that control the evolving traits. In other words, mutations in these genes have large effects on the trait under selection and little or no effect on other traits (Gompel and Prud'homme 2009; Kopp 2009; Stern and Orgogozo 2009; Streisfeld and Rausher 2009). A broad interpretation of this hypothesis is that a gene's pleiotropy in the context of intersecting regulatory pathways is a key predictor of its role in phenotypic evolution.
Evidence for this hypothesis comes from case studies in which the genetic basis of phenotypic variation was identified by forward genetic analysis. These studies are still few in number and may suffer from ascertainment biases because phenotypic changes that are controlled by one or a few genes of large effect are more likely to be chosen for detailed molecular dissection than more complex quantitative variation. A useful complementary approach may be to examine the effects of gene network topology on genome-wide patterns of regulatory evolution. Regulatory mutations play a major role in the evolution of many phenotypic traits (Wray 2007; Carroll 2008; Streisfeld and Rausher 2011). Thus, if pleiotropy exerts a strong and universal influence on the genetic basis of evolution, we expect a systematic relationship between the number of regulatory interactions in which a gene participates and the rate at which its expression diverges between species.
Here, we test this hypothesis by examining gene expression divergence between sibling species of yeast, Saccharomyces cerevisiae and Saccharomyces paradoxus. These species are closely related and share the same set of transcription factors (TFs) (Kellis et al. 2003; Liti et al. 2006), suggesting that their gene regulatory networks are similar. Despite strong reproductive isolation, S. cerevisiae and S. paradoxus can be hybridized in the laboratory (Naumov 1996; Scannell et al. 2007). This ability can be exploited to decompose gene expression divergence between these species into the cis- and trans-regulatory components (Tirosh et al. 2009). The cis-regulatory component provides a direct estimate of the cumulative effect of regulatory mutations that have accumulated at each locus since species divergence, while ignoring the indirect effects of changes in the more upstream levels of gene networks (the trans-component). At the same time, because yeast are single-celled organisms, it is possible to reconstruct their genome-wide transcriptional networks by identifying the direct downstream targets of each TF and the direct transcriptional regulators of each target gene (Lee et al. 2002; Harbison et al. 2004; Luscombe et al. 2004; Balaji, Babu, et al. 2006, 2008; Farkas et al. 2006; MacIsaac et al. 2006; Jothi et al. 2009). By integrating publicly available data sets on transcriptional network architecture and cis-regulatory expression divergence, we test the hypothesis that the position of genes in the gene network is a systemic predictor of their regulatory evolution.
Materials and Methods
Genome-wide estimates of cis-regulatory expression divergence between S. cerevisiae and S. paradoxus were obtained by Tirosh et al. (2009) based on allele-specific gene expression in F1 interspecific hybrids. In these hybrids, both alleles at each locus are present in the same trans-regulatory background, so that any differences in expression levels between the two alleles must be due to cis-regulatory divergence between the parental genotypes (Cowles et al. 2002; Wittkopp et al. 2004). The expression level of each allele was quantified separately by hybridizing F1 mRNA samples extracted under four different environmental conditions to microarrays that carried species-specific probes. Neither copy number variation nor epigenetic effects made a major contribution to allele-specific expression, and cis-regulatory effects were strongly correlated among environmental treatments (r ∼ 0.85) (Tirosh et al. 2009). We therefore used the average absolute difference between log2-transformed expression levels of the S. cerevisiae and S. paradoxus alleles across all conditions as the estimate of cis-regulatory divergence between these species (calculated from supplementary table S1 in Tirosh et al.). Using sum of squared differences across all conditions as the measure of cis-regulatory divergence produced similar results (supplementary tables 1 and 2, Supplementary Material online, compare with tables 1 and 2). An important caveat is that regulatory divergence was measured in a single F1 genotype produced by crossing isogenic parental strains (Tirosh et al. 2009). Divergence estimates may therefore confound fixed interspecific differences with standing variation segregating within either or both species.
Table 1.
Relationship between the Number of Outgoing Connections and Mean cis-Regulatory Expression Divergence.
| Network | Spearman's ρa | Spearman P Value | Kruskal–Wallis P Valueb | Wilcoxon P Valuec | Number of Genes | Reference |
|---|---|---|---|---|---|---|
| Whole genome | 0.062 | 0.484 | 0.405 | 0.983 | 123 | Balaji et al. (2006a) |
| Whole genome | 0.067 | 0.453 | 0.425 | 0.957 | 125 | Jothi et al. (2009) |
| Whole genome | 0.128 | 0.187 | 0.625 | 0.451 | 107 | Luscombe et al. (2004) |
| Transcription factors only | 0.104 | 0.262 | 0.759 | 0.588 | 115 | Balaji et al. (2006a) |
| Transcription factors only | 0.125 | 0.246 | 0.066 | 0.367 | 86 | MacIsaac et al. (2006) |
| Diauxic shift | 0.320 | 0.023* | 0.081 | 0.058 | 51 | Luscombe et al. (2004) |
| DNA damage response | 0.084 | 0.555 | 0.846 | 0.926 | 49 | Luscombe et al. (2004) |
| Cell cycle | −0.070 | 0.624 | 0.475 | 0.780 | 49 | Luscombe et al. (2004) |
| Sporulation | 0.321 | 0.020* | 0.213 | 0.075 | 53 | Luscombe et al. (2004) |
| Stress response | 0.430 | 0.005** | 0.008** | 0.003* | 43 | Luscombe et al. (2004) |
Rank correlation between the number of connections and expression divergence.
Genes were split into quartiles according to the number of connections; the test shows whether cis-regulatory divergence is significantly different among all four quartiles.
Test shows whether cis-regulatory divergence differs significantly between the top and bottom quartiles (those with the most and fewest connections).
* Corresponds to nominal P < 0.05, ** corresponds to P < 0.01.
Table 2.
Relationship between the Number of Incoming Connections and Mean cis-Regulatory Expression Divergence.
| Network | Spearman's ρa | Spearman P Value | Kruskal–Wallis P Valueb | Wilcoxon P Valuec | Number of Genes | Reference |
|---|---|---|---|---|---|---|
| Whole genome | 0.094 | <0.0001*** | <0.0001*** | <0.0001*** | 3,038 | Balaji et al. (2006a) |
| Whole genome | 0.101 | <0.0001*** | <0.0001*** | <0.0001*** | 3,098 | Jothi et al. (2009) |
| Whole genome | 0.110 | <0.0001*** | 0.302 | 0.041* | 2,408 | Luscombe et al. (2004) |
| Diauxic shift | 0.158 | <0.0001*** | 0.771 | 0.285 | 558 | Luscombe et al. (2004) |
| Transcription factors only | 0.149 | 0.110 | 0.675 | 0.424 | 115 | Balaji et al. (2006a) |
| DNA damage response | 0.216 | <0.0001*** | 0.384 | 0.635 | 486 | Luscombe et al. (2004) |
| Cell cycle | 0.117 | 0.008** | 0.033* | 0.800 | 203 | Luscombe et al. (2004) |
| Sporulation | 0.159 | 0.031* | 0.553 | 0.669 | 185 | Luscombe et al. (2004) |
| Stress response | 0.233 | <0.0001*** | 0.697 | 0.379 | 278 | Luscombe et al. (2004) |
Rank correlation between the number of connections and expression divergence.
Genes were split into quartiles according to the number of connections; the test shows whether cis-regulatory divergence is significantly different among all four quartiles.
Test shows whether cis-regulatory divergence differs significantly between the top and bottom quartiles (those with the most and fewest connections).
* Corresponds to nominal P < 0.05, **corresponds to P < 0.01, ****corresponds to P < 0.0001.
Transcriptional network topologies were inferred by several groups based mainly on genome-wide chromatin immunoprecipitation (ChIP) experiments. This approach uses in vivo binding between candidate TFs and regions of genomic DNA, quantified by hybridizing immunoprecipitated samples to tiling microarrays (ChIP-chip), as evidence of direct transcriptional regulation. Gene expression analysis shows that not all TF–DNA binding is biologically relevant (Gao et al. 2004; Nuzhdin et al. 2010). At the same time, many TF–DNA interactions may occur only under specific environmental conditions. Thus, ChIP-chip data contain a significant number of false positives and false negatives. Despite these caveats, TF–DNA binding provides the core evidence that, in combination with gene expression and genetic interaction studies, allows the identification of all or most direct transcriptional regulators of each target gene and most direct transcriptional targets of each TF. We used several versions of the S. cerevisiae transcriptional network (listed below), which were inferred by integrating ChIP-chip data with other data sets and have somewhat different topologies as a result. At present, no comparable data are available for S. paradoxus. Analyses presented here are predicated on the assumption that transcriptional networks are similar in S. cerevisiae, S. paradoxus, and their hybrids. This assumption seems reasonable in light of the close relationship between these species, but it remains to be verified. In each network, we quantified the number of “incoming” connections as the number of TFs inferred to regulate each gene and the number of “outgoing” connections as the number of inferred target genes of each TF.
The following network estimates were examined (tables 1 and 2):
Whole-genome networks of Luscombe et al. (2004), Balaji, Babu, et al. (2006), Balaji, Iyer, et al. (2006), and Jothi et al. (2009). The network of Luscombe et al. (obtained from http://sandy.topnet.gersteinlab.org/index2.html) includes 142 TFs, 3,420 target genes, and 7,074 inferred interactions. This network was constructed by integrating the original genome-wide ChIP-chip analysis performed under fixed environmental conditions (Lee et al. 2002) with gene expression data from other studies. The other two networks are based on expanded ChIP-chip analysis (Harbison et al. 2004), which used several different environmental conditions and identified DNA motifs that were bound by candidate TFs at high confidence (P < 0.001) and were conserved among multiple Saccharomyces species. The network of Balaji, Babu, et al. (2006) and Balaji, Iyer, et al. (2006) integrate data from the Lee et al. (2002) Harbison et al. (2004), Luscombe et al. (2004), and several smaller-scale studies and consists of 157 TFs, 4,410 target genes, and 12,873 regulatory interactions. The Jothi et al. network (from http://www.nature.com/msb/journal/v5/n1/suppinfo/msb200952_S1.html) is updated from the version by Balaji et al. and includes 158 TFs, 4,369 target genes, and 13,385 interactions.
Most target genes in whole-genome networks have only a small number of incoming connections (≤3), whereas TFs have on average a larger number of incoming connections. For this reason, analyses were repeated for two subnetworks that consisted solely of TFs. The network of MacIsaac et al. (2006) was updated from the version by Harbison et al. using a refined algorithm to measure the evolutionary conservation of binding sites. The TF-only subnetwork was obtained from http://fraenkel.mit.edu/improved_map/ and corresponds to figure 5 in MacIsaac et al. The TF-only subnetwork of Balaji et al. (2006a) is described in the supplementary table S5 from that publication.
Gene expression analysis shows that different transcriptional interactions are realized under different conditions, that is, the same TFs regulate different targets depending on the circumstances (Luscombe et al. 2004). We therefore examined five condition-specific subnetworks constructed by Luscombe et al.: cell cycle, sporulation, diauxic shift, DNA damage, and stress response (fig. 1 in Luscombe et al. 2004). These networks, consisting of 482–1,218 interactions each, were obtained from http://sandy.topnet.gersteinlab.org/index2.html.
Transcript abundance measurements are described by Holstege et al. (1998) and were obtained from http://web.wi.mit.edu/young/expression/transcriptome.html. Genome-wide DN and DS values for the S. cerevisiae–S. paradoxus pairwise comparison are described by Drummond et al. (2005).
The main question of interest is whether there is a relationship between the number of network connections and cis-regulatory divergence of the regulator genes and their targets. The distribution of the number of connections deviates from normality in all networks and for both incoming and outgoing connections (Kolmogorov–Smirnov test, P < 0.01 in all data sets), making Pearson correlation analysis inappropriate. Assuming that the suspected relationship would be linear, Spearman rank correlations were calculated for each network and tested for difference from zero (tables 1 and 2). The number of connections may however exhibit a nonlinear relationship with cis-regulatory divergence. In order to examine whether there were nonlinear trends in the data, genes in each data set were separated into four quartiles according to the number of outgoing or incoming connections and the median divergence was compared among quartiles using a Kruskal–Wallis test. To test more specifically whether genes with few connections differed from those with many connections, the median cis-regulatory divergence was compared between the first and fourth quartiles using a Wilcoxon test (tables 1 and 2). The number of network connections, cis-regulatory divergence, protein divergence, and transcript abundance data were not available for all genes, leading to the different numbers of genes and gene pairs in different comparisons.
Results
In whole-genome transcriptional networks, the number of outgoing connections, and cis-regulatory divergence are not significantly related (table 1 and fig. 1A), and there is no significant difference between the TFs with the lowest and highest numbers of downstream targets (table 1 and fig. 1B). There is a significant relationship between cis-regulatory divergence and the number of incoming connections, although the magnitude of the correlation is small (table 2 and fig. 1C). Genes regulated by many TFs have higher median cis-regulatory divergence than genes regulated by few TFs (table 2 and fig. 1D). In TFs-only subnetworks, where the numbers of incoming and outgoing connections are more evenly balanced, cis-regulatory divergence and the number of outgoing, incoming, and total connections are not significantly associated (tables 1 and 2 and data not shown).
FIG. 1.
Relationship between the number of connections and cis-regulatory expression divergence in the whole-genome transcriptional network (Balaji et al. 2006a). (A). Scatter plot of cis-regulatory divergence versus the number of outgoing connections. (B). Box plot of cis-regulatory divergence for genes with different numbers of outgoing connections. On the x axis, genes were divided into four quartiles according to the number of outgoing connections. Each box shows the second and third quartiles of cis-regulatory divergence values, with the dividing line indicating the median and the “whiskers” showing the first and fourth quartiles. (C). Scatter plot of cis-regulatory divergence versus the number of incoming connections. (D). Box plot of cis-regulatory divergence for genes with different numbers of incoming connections. The first and fourth quartiles of cis-regulatory divergence values are not shown due to their wide range.
The rate of protein evolution is negatively correlated with expression levels, and this effect often confounds other evolutionary rate comparisons (Koonin 2005; Pal et al. 2006). However, the relationship between cis-regulatory expression divergence and transcript abundance (Holstege et al. 1998) is not significant (ρ = −0.041, P = 0.104), whereas the relationship between transcript abundance and the number of network connections is positive (ρ = 0.349, P = <0.001), suggesting that transcript abundance is unlikely to explain the relationship between connectivity and cis-regulatory divergence.
Whole-genome networks represent the average behavior of transcriptional regulators and their targets across multiple conditions. If a TF functions separately in two or more genetic pathways that regulate different traits, its connectivity in the whole-genome network may not be a correct reflection of the influence exerted by any of these intersecting pathways on cis-regulatory evolution. In such cases, smaller gene networks that mediate specific cellular conditions and responses may have greater predictive value. To test this proposition, we examined the relationship between network connectivity and regulatory divergence in five condition-specific subnetworks (Luscombe et al. 2004). Only one of these subnetworks (mediating stress response) reveals a significant positive correlation between the number of outgoing connections and expression divergence (table 1 and fig. 2A) and a significant difference between TFs with the lowest and highest numbers of target genes (table 1 and fig. 2B). This relationship appears to be due to a small number of TFs with the most targets (fig. 2A and B). Most of the remaining subnetworks show no significant evidence of a relationship between outgoing connectivity and cis-regulatory divergence (table 1). A significant positive correlation is seen between cis-regulatory divergence and the number of incoming connections, but there is no significant difference between the quartiles of genes with the highest and lowest numbers of upstream regulators in any of the subnetworks (table 2).
FIG. 2.
Relationship between the number of connections and cis-regulatory expression divergence in the stress response network (Luscombe et al. 2004). (A). Scatter plot of cis-regulatory divergence versus the number of outgoing connections. (B). Box plot of cis-regulatory divergence for genes with different numbers of outgoing connections. (C). Scatter plot of cis-regulatory divergence versus the number of incoming connections. (D). Box plot of cis-regulatory divergence for genes with different numbers of incoming connections. See figure 1 legend for more details.
Jothi et al. (2009) have classified TFs as “top,” “core,” or “bottom,” according to their position in the regulatory hierarchy. The core TFs have the highest average number of connections (Jothi et al. 2009). Although the mean cis-regulatory divergence is higher for the core TFs compared with the top, bottom, and unconnected TFs (0.588 vs. 0.431–0.466), these differences are not significant (Kruskal–Wallis test, P = 0.561). Jothi et al. (2009) also distinguish between 32 “hubs,” which together account for >20% of all regulatory connections in the whole-genome network, and the remaining TFs. The mean cis-regulatory divergence is 0.666 for the hubs, compared with 0.462 for the non-hub TFs, but this difference is also not significant (Wilcoxon test, P = 0.124). Thus, there is little evidence that the most highly connected TFs undergo faster regulatory evolution than other TFs.
It has been suggested that evolutionary changes may often be concentrated at regulatory genes that occupy nexus positions in genetic pathways (i.e., receive multiple inputs and generate multiple outputs) (Gompel and Prud'homme 2009; Kopp 2009; Stern and Orgogozo 2009). We classified TFs in the five condition-specific subnetworks of Luscombe et al. (2004) as “nexus” if they had at least two incoming and at least two outgoing connections and as “non-nexus” otherwise. The nexus TFs showed higher cis-regulatory divergence than non-nexus TFs in the sporulation subnetwork (0.746 vs. 0.434; Wilcoxon test, P = 0.014) but not in the other four subnetworks or in the combined data from all five subnetworks. This result suggests that faster regulatory evolution of nexus regulators may be limited to relatively few genetic pathways.
Some TFs control largely the same sets of target genes under all conditions, whereas others regulate largely nonoverlapping sets of targets depending on the cell state. Luscombe et al. (2004) quantified these differences in the form of interchange indices, such that I = 0 means that a TF regulates the same targets under all conditions and I = 1 means that a TF changes its targets completely between conditions. This is analogous to the “party hubs” and “date hubs” in protein–protein interaction networks, respectively (Han et al. 2004). There is no correlation between interchange index and cis-regulatory divergence (Spearman ρ = 0.017, P = 0.734) and no systematic difference in the rate of divergence between TFs that retain all interactions across multiple states (I < 0.1) and those that replace all interactions in switching conditions (I > 0.9) (Wilcoxon P = 0.877). This differs from protein interaction networks, where party hubs show significantly slower evolution than date hubs (Fraser 2005).
Discussion
In the whole-genome transcriptional network, there is no evidence that TFs with a greater number of direct targets evolve faster at the cis-regulatory level. More restricted, condition-specific networks show only limited evidence for a relationship between regulatory divergence and the number of outgoing connections (table 1). The overall conclusion emerging from this analysis is that although the more highly connected TFs tend to show faster cis-regulatory evolution, this difference is slight and explains only a small fraction of variation in evolutionary rates.
There is a significant relationship between cis-regulatory divergence and the number of incoming connections, but the magnitude of this effect is small. This relationship might be explained by differences in mutational target size. Genes that are regulated by a greater number of direct activators and repressors are likely to have more TF binding sites, increasing the probability of cis-regulatory mutations that alter their expression. The slight positive correlation between expression divergence and the number of incoming connections is therefore consistent with neutral or nearly neutral evolution of gene expression.
Although these results do not refute the pleiotropy conjecture, they do not provide support for it either. There may be several, not mutually exclusive, explanations for this finding. First, the concentration of evolutionary changes at 1 or a few nexus loci may represent relatively rare exceptions, affecting only the pathways with unusual topological features. Most pathways may not have such nexus points, leading to a more diffuse genetic basis of evolutionary change. Second, pleiotropy in yeast may be fundamentally different from the type seen in multicellular organisms. In animals and plants, modular organization of cis-regulatory sequences allows each gene to participate in several distinct regulatory pathways that operate in different cell types or at different stages of development. This modularity allows gene function is each of the cell type–specific pathways to evolve independently of the others (Wray 2007; Carroll 2008). Thus, unicellularity and simple cis-regulatory regions—the very features that make it meaningful to speak of a whole-genome transcriptional network in yeast—may make them poor models for understanding the evolution of genetic pathways in multicellular organisms. Finally, the low correlation between connectivity and regulatory divergence may reflect the limitations of transcriptional network data based on genome-wide ChIP analysis. By pooling or omitting information about condition-specific regulatory interactions, whole-genome networks can obscure biologically meaningful trends. On the other hand, experimental dissection of evolutionary changes in precisely defined regulatory pathways can pinpoint the loci of evolutionary change with high confidence, yet provides only anecdotal, case-by-case evidence that is difficult to generalize in statistically sound ways. Indeed, a recent meta analysis shows that reproducible biases in the genetic basis of phenotypic evolution that are evident in trait-by-trait analyses can be obscured by the pooling of data from multiple traits (Streisfeld and Rausher 2011). Perhaps the greatest promise lies in the integration of these approaches. Detailed experimental analysis will be necessary to decompose whole-genome transcriptional networks into modular genetic pathways that control specific phenotypic traits. A systematic analysis of evolutionary changes in these pathways, ideally in multiple independent lineages (Kopp 2009; Streisfeld and Rausher 2011), will show whether pathway topology exercises a consistent influence on the genetic basis of phenotypic evolution.
Supplementary Material
Supplementary tables 1 and 2 are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).
Acknowledgments
We are grateful to Tim Connallon, Illes Farkas, Richard Jovelin, and Matt Hahn for sharing data sets and to Matt Hahn, Ondrej Podlaha, and two anonymous reviewers for comments on the manuscript. This work was supported by National Institutes of Health (grant 5-R01-GM-082843 to A.K.).
References
- Balaji S, Babu MM, Iyer LM, Luscombe NM, Aravind L. Comprehensive analysis of combinatorial regulation using the transcriptional regulatory network of yeast. J Mol Biol. 2006;360:213–227. doi: 10.1016/j.jmb.2006.04.029. [DOI] [PubMed] [Google Scholar]
- Balaji S, Iyer LM, Aravind L, Babu MM. Uncovering a hidden distributed architecture behind scale-free transcriptional regulatory networks. J Mol Biol. 2006;360:204–212. doi: 10.1016/j.jmb.2006.04.026. [DOI] [PubMed] [Google Scholar]
- Balaji S, Iyer LM, Babu MM, Aravind L. Comparison of transcription regulatory interactions inferred from high-throughput methods: what do they reveal? Trends Genet. 2008;24:319–323. doi: 10.1016/j.tig.2008.04.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carroll SB. Evo-devo and an expanding evolutionary synthesis: a genetic theory of morphological evolution. Cell. 2008;134:25–36. doi: 10.1016/j.cell.2008.06.030. [DOI] [PubMed] [Google Scholar]
- Chan YF, Marks ME, Jones FC, et al. (16 co-authors) Adaptive evolution of pelvic reduction in sticklebacks by recurrent deletion of a Pitx1 enhancer. Science. 2010;327:302–305. doi: 10.1126/science.1182213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cowles CR, Hirschhorn JN, Altshuler D, Lander ES. Detection of regulatory variation in mouse genes. Nat Genet. 2002;32:432–437. doi: 10.1038/ng992. [DOI] [PubMed] [Google Scholar]
- Drummond DA, Bloom JD, Adami C, Wilke CO, Arnold FH. Why highly expressed proteins evolve slowly. Proc Natl Acad Sci U S A. 2005;102:14338–14343. doi: 10.1073/pnas.0504070102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Farkas IJ, Wu C, Chennubhotla C, Bahar I, Oltvai ZN. Topological basis of signal integration in the transcriptional-regulatory network of the yeast, Saccharomyces cerevisiae. BMC Bioinformatics. 2006;7:478. doi: 10.1186/1471-2105-7-478. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fraser HB. Modularity and evolutionary constraint on proteins. Nat Genet. 2005;37:351–352. doi: 10.1038/ng1530. [DOI] [PubMed] [Google Scholar]
- Gao F, Foat BC, Bussemaker HJ. Defining transcriptional networks through integrative modeling of mRNA expression and transcription factor binding data. BMC Bioinformatics. 2004;5:31. doi: 10.1186/1471-2105-5-31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gompel N, Prud'homme B. The causes of repeated genetic evolution. Dev Biol. 2009;332:36–47. doi: 10.1016/j.ydbio.2009.04.040. [DOI] [PubMed] [Google Scholar]
- Han JD, Bertin N, Hao T, et al. (11 co-authors) Evidence for dynamically organized modularity in the yeast protein-protein interaction network. Nature. 2004;430:88–93. doi: 10.1038/nature02555. [DOI] [PubMed] [Google Scholar]
- Harbison CT, Gordon DB, Lee TI, et al. Transcriptional regulatory code of a eukaryotic genome. Nature. 2004;431:99–104. doi: 10.1038/nature02800. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Holstege FC, Jennings EG, Wyrick JJ, Lee TI, Hengartner CJ, Green MR, Golub TR, Lander ES, Young RA. Dissecting the regulatory circuitry of a eukaryotic genome. Cell. 1998;95:717–728. doi: 10.1016/s0092-8674(00)81641-4. [DOI] [PubMed] [Google Scholar]
- Jothi R, Balaji S, Wuster A, Grochow JA, Gsponer J, Przytycka TM, Aravind L, Babu MM. Genomic analysis reveals a tight link between transcription factor dynamics and regulatory network architecture. Mol Syst Biol. 2009;5:294. doi: 10.1038/msb.2009.52. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kellis M, Patterson N, Endrizzi M, Birren B, Lander ES. Sequencing and comparison of yeast species to identify genes and regulatory elements. Nature. 2003;423:241–254. doi: 10.1038/nature01644. [DOI] [PubMed] [Google Scholar]
- Koonin EV. Systemic determinants of gene evolution and function. Mol Syst Biol. 2005;1 doi: 10.1038/msb4100029. 2005 0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kopp A. Metamodels and phylogenetic replication: a systematic approach to the evolution of developmental pathways. Evolution. 2009;63:2771–2789. doi: 10.1111/j.1558-5646.2009.00761.x. [DOI] [PubMed] [Google Scholar]
- Lee TI, Rinaldi NJ, Robert F, et al. (21 co-authors) Transcriptional regulatory networks in Saccharomyces cerevisiae. Science. 2002;298:799–804. doi: 10.1126/science.1075090. [DOI] [PubMed] [Google Scholar]
- Liti G, Barton DB, Louis EJ. Sequence diversity, reproductive isolation and species concepts in Saccharomyces. Genetics. 2006;174:839–850. doi: 10.1534/genetics.106.062166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luscombe NM, Babu MM, Yu H, Snyder M, Teichmann SA, Gerstein M. Genomic analysis of regulatory network dynamics reveals large topological changes. Nature. 2004;431:308–312. doi: 10.1038/nature02782. [DOI] [PubMed] [Google Scholar]
- MacIsaac KD, Wang T, Gordon DB, Gifford DK, Stormo GD, Fraenkel E. An improved map of conserved regulatory sites for Saccharomyces cerevisiae. BMC Bioinformatics. 2006;7:113. doi: 10.1186/1471-2105-7-113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Naumov GI. Genetic identification of biological species in the Saccharomyces sensu stricto complex. J Ind Microbiol Biotechnol. 1996;17:295–302. [Google Scholar]
- Nuzhdin SV, Rychkova A, Hahn MW. The strength of transcription-factor binding modulates co-variation in transcriptional networks. Trends Genet. 2010;26:51–53. doi: 10.1016/j.tig.2009.12.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pal C, Papp B, Lercher MJ. An integrated view of protein evolution. Nat Rev Genet. 2006;7:337–348. doi: 10.1038/nrg1838. [DOI] [PubMed] [Google Scholar]
- Protas ME, Hersey C, Kochanek D, Zhou Y, Wilkens H, Jeffery WR, Zon LI, Borowsky R, Tabin CJ. Genetic analysis of cavefish reveals molecular convergence in the evolution of albinism. Nat Genet. 2006;38:107–111. doi: 10.1038/ng1700. [DOI] [PubMed] [Google Scholar]
- Scannell DR, Butler G, Wolfe KH. Yeast genome evolution—the origin of the species. Yeast. 2007;24:929–942. doi: 10.1002/yea.1515. [DOI] [PubMed] [Google Scholar]
- Stern DL, Orgogozo V. Is genetic evolution predictable? Science. 2009;323:746–751. doi: 10.1126/science.1158997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Streisfeld MA, Rausher MD. Genetic changes contributing to the parallel evolution of red floral pigmentation among Ipomoea species. New Phytol. 2009;183:751–763. doi: 10.1111/j.1469-8137.2009.02929.x. [DOI] [PubMed] [Google Scholar]
- Streisfeld MA, Rausher MD. Population genetics, pleiotropy, and the preferential fixation of mutations during adaptive evolution. Evolution. 2011;65:629–642. doi: 10.1111/j.1558-5646.2010.01165.x. [DOI] [PubMed] [Google Scholar]
- Sucena E, Delon I, Jones I, Payre F, Stern DL. Regulatory evolution of shavenbaby/ovo underlies multiple cases of morphological parallelism. Nature. 2003;424:935–938. doi: 10.1038/nature01768. [DOI] [PubMed] [Google Scholar]
- Tirosh I, Reikhav S, Levy AA, Barkai N. A yeast hybrid provides insight into the evolution of gene expression regulation. Science. 2009;324:659–662. doi: 10.1126/science.1169766. [DOI] [PubMed] [Google Scholar]
- Wittkopp PJ, Haerum BK, Clark AG. Evolutionary changes in cis and trans gene regulation. Nature. 2004;430:85–88. doi: 10.1038/nature02698. [DOI] [PubMed] [Google Scholar]
- Wray GA. The evolutionary significance of cis-regulatory mutations. Nat Rev Genet. 2007;8:206–216. doi: 10.1038/nrg2063. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.


