Abstract
How to combine heterogeneous data sources for reliable prediction of transcriptional regulation is a challenge. Here we present an easy but powerful method to integrate Chromatin immunoprecipitation (ChIP)-chip and knock-out data. Since these two types of data provide complementary (physical and functional) information about transcription, the method combining them is expected to achieve high detection rates and very low false positive rates. We try to seek the optimal integration of these two data using hyper-geometric distribution. We evaluate our method on yeast data and compare our predictions with YEASTRACT, high-quality ChIP-chip data, and literature. The results show that even using low-quality ChIP-chip data, our method uncovers more relations than those inferred before from high-quality data. Furthermore our method achieves a low false positive rate. We find experimental and computational evidence in literature for most transcription factor (TF)-gene relations uncovered by our method.
Keywords: regulatory interaction, transcription factor, ChIP, knock-out data, P-value threshold, hypergeometric distribution, cooperativity
Introduction
The dynamic program that a cell utilizes in response to internal and external stimuli is carried out through coordinated action of many genes and proteins. Transcriptional regulation plays an important role in the program. Thus unraveling transcriptional interactions is critical to our understanding of the complex regulation mechanisms.
Recent advances in high-throughput DNA micro-arrays and chromatin immunoprecipitation (ChIP) assays have provided us with an unprecedented amount of information about transcriptional regulation on a genomic scale. Gene expression profiles under various conditions are the key data source for inferring transcriptional relations. Some researchers modeled gene expression data using random Boolean networks, mutual information, and probabilistic models to reconstruct regulatory networks.1–18 These approaches, although useful, provide only indirect evidence of regulatory interactions. Gene perturbation experiments (e.g. transcription factor (TF) knock-out) and ChIP-chip experiments serve as complementary data sources. Gene perturbation experiments uncover functional relations between TFs and their target genes, but they cannot distinguish those indirect relations from direct ones. Hu et al profiled expression with individual deletions of 263 transcription factors in S. cerevisiae and used directed-weighted graph modeling and regulatory epistasis analysis to remove indirect regulatory relationships.19 ChIP-chip experiments provide direct physical information of the binding between TFs and DNA regions. However, ChIP-chip binding data may not be functional in terms of transcriptional regulation. Most importantly, both types of data are insufficient independently, and depending on the chosen P-value threshold, include many false positive or false negative TF-target relationships.
Since each data source provides partial but complementary information, some research has attempted to integrate those diverse data sources for regulatory network reconstruction.20–37 A typical approach is to first find potential co-regulated genes and the genes that are further analyzed for other biological evidence, such as common binding motifs and common Gene Ontology (GO) terms. Bar-Joseph et al24 relaxed the ChIP-chip P-value threshold if there was strong evidence from expression data. Harbison et al26 combined ChIP-chip data, six motif-discovering algorithms, and phylogenetic conservation to construct an initial map of yeast’s transcriptional regulatory code. Lemmens et al30 integrated three independent data sources: ChIP-chip data, motif information, and gene expression profiles to correlate regulatory programs with regulators and corresponding motifs to a set of co-expressed genes.
Here we present a novel method to infer relations between TF and target genes by integrating the TF knock-out data and ChIP-chip binding data. Since TF knock-out data suggest functional relations, while ChIP-chip binding data provide physical interactions, the intersection of these two types of data shows strong evidence about transcriptional relations between TF and target genes. However, Hu et al19 found that the overlap is quite low, which may be caused by the low quality of the data and the stringent and arbitrary P-value threshold (p < = 0.001). In order to increase the intersection with less false positives, we range both of the P-value thresholds from 0.001 to 0.05 and try to find the optimal P-value threshold pair, at which the most significant intersection is obtained. We demonstrate the method on the yeast data, where it shows that the intersection increases quite a lot. Most inferred TF-target relations have experimental evidence or other computational evidence, which is inferred by combining ChIP-chip data, phylogenetic conservation, motif discovery, other expression data, and enrichment for genes in the same Gene Ontology. The method could be easily extended to identify cooperativity among transcription factors or combine other heterogeneous high-throughput data.
Methods
We integrated the ChIP-chip binding data from Harbison et al26 and the TF knock out expression data from Hu et al19 The overlap of these two data was low at stringent P-value thresholds (both of the P-value < = 0.001). While using lenient P-value thresholds might well improve the overlap, it might also produce many false positives. In order to improve the overlap with less false positives, we ranged both of the P-value thresholds from 0.001 to 0.05 in steps of 0.001, and tried to find for each TF the optimal P-value threshold pair, at which the most significant intersection would be obtained. The schematic diagram of the method is shown in Figure 1.
Selecting sets of target genes
Let us denote by G the common pool of genes that ChIP-chip and knock-out experiments used. Considering a specific transcription factor t, we identify two subsets of G, binding target set Bt and effectual target set Et. Bt includes genes with significant ChIP-chip binding to TF t (binding P-value < Pbt), while Et contains the genes whose mRNA expression are significantly altered in the transcription factor t knockout experiments (P-value < Pet). Pbt and Pet are P-value thresholds for binding and knock-out experiments respectively. Finally we define the intersection of these two sets Bt and Et as It = Bt ⋒ Et.
Calculating the significance of the intersection
To statistically access the significance of the intersection of the two target sets, we calculate the probability of obtaining an intersection size |It| this large or greater, given the two sets are independent. With the assumption that It is randomly picked, the size of the intersection |It| is distributed according to the hypergeometric distribution. The probability to obtain an intersection size |It| is computed by the formula, where x represents the random variable for the intersection of two target sets.
The P-value Pt as the probability of observing an intersection this large or greater can thus be computed by the formula, where x represents the random variable for the intersection of two target sets.
Searching the optimal P-value threshold pair
For each transcription factor t, we consider all possible combinations of Pbt and Pet on a scale ranging from 0.001 to 0.05 by an increment of 0.001. The significance of the intersection for each combination is obtained as Pt(Pbt, Pet). Finally, we compare all 2500 (50 × 50) combinations and find the minimum one Pt*, which is the most significant. The corresponding P-value thresholds are considered to be the optimal pair (Pbt*, Pet*). The intersection for choosing the optimal threshold pair, It* is more likely to be the truly target set of the transcription factor t.
Results
The first 30 transcription factors with statistically significant (P*t < 1e − 4) intersection between the binding target set and the effectual target set were chosen for further analysis. Overall, 631 unique TF-target gene interactions have been identified using our method, containing 5971 genes regulated by those 30 transcription factors. On the other hand, 430 of the TF-target gene interactions (430/631 = 68.15%) would not be detected if we selected the traditional stringent P-value threshold (both of the P-value < = 0.001). The targets, the optimal P-value threshold pair, and the intersection significance for all TFs are shown in Supplementary Table 1.
Comparison with YEASTRACT database
YEASTRACT database presently contains regulatory associations of the yeast genes based on more than 1000 bibliographic references.38,39 To validate our results, we compared the targets identified in our method with documented associations between a Transcription Factor and a target gene in YEASTRACT, which are supported by published data showing at least one of the experimental evidences. As a result, 440 out of the 631 associations in our results have been confirmed. (Those relations found in YEASTARCT are shown in supplementary Table 2). The number of identified targets with stringent P-value cutoff in comparison to that using our method has been shown in Figure 2. The results show that our method significantly reduces the false negatives with less false positives. As an example, RAP1 was assigned to a set of 126 regulated genes using our method, while only 70 targets were identified with stringent P-value cutoff. Out of the remaining 56 targets with our method we found other experimental evidence in YEASTRACT for 51.
Comparison with high-quality ChIP-chip data
Hu et al19 found that the overlap between the binding target set and the effectual target set improved when using the different high-quality ChIP-chip data, suggesting that data quality may be one reason for the low overlap. Our results indicated that the stringent P-value cutoff may be another reason. Even with the low-quality ChIP-chip data, our method obtained 126 common targets for RAP1 between the binding targets and effectual ones, compared with 144 shared between the binding targets from high-quality ChIP-chip and effectual ones. However, out of the 126 targets we found other experimental evidence in YEASTRACT for 121. Furthermore, 104 out of the 126 targets were proven with high-quality ChIP-chip data. In contrast, although only 70 RAP1 targets can be identified at the 0.001 P-value cutoffs, there are still 8 of them not proven. These results indicate that we have reduced 42 false negatives by using relaxed P-value for binding data at the expense of increasing 14 “false positives” even if the high-quality ChIP-chip data are treated as gold standard dataset. However, out of these 14 “false positives” we have found other experimental evidence in YEASTRACT for 9 (see Fig. 3A).
We compared our results with SWI4 high-quality ChIP-chip data (see Fig. 3B), which also suggests that our method can obtain more reliable relations even with the low-quality ChIP-chip data. Only 10 were in the intersection of the binding targets set from low-quality ChIP-chip data and effectual targets set with stringent P-value cutoffs. Also only 16 appeared in the intersection of the binding targets set from high-quality ChIP-chip data and effectual targets set. However, 48 were detected using a pair of relaxed optimal P-value cutoffs (0.04 for the binding P-value and 0.029 for the effectual P-value) even with the low-quality ChIP-chip data. 23 out of the 48 targets are proven with high-quality ChIP-chip data. Additionally, 39 out of the 48 targets have been confirmed in YEASTRACT. Out of the 9 remaining targets (HHF1 HHT1 YER158C, HSP150, BDF1, SUR7, NDE1, HOR7, RSN1) for which we cannot find evidence in YEASTARCT, HHT1 and HHF1 are histone genes. Whole-genome binding studies have suggested that the histone gene promoters are bound by MBF and/or SBF40,41 and Hess et al’s data42 showed that swi4Δ causes a mild reduction in HHT1and HHF1 mRNA levels. Furthermore MBF (Mbp1 and Swi6) and SBF (Swi4 and Swi6) cause transcriptional defects at HTA1-HTB1 and HHT1-HHF1.42 Inferred from the above information, HHT1 and HHF1 may be the novel targets of SWI4. Reinoso-Martín43 found that HSP150 mRNA levels were slightly induced by caspofungin after 1 hour in wild-type cells but increased significantly in the swi4Δ mutant, which suggests that HSP150 is one target of SWI4.
Overlap with literature
Our results have well coincided with previous biological literature. As an example, consider Leu3, a pathway-specific regulator of genes encoding enzymes involved in branched-chain amino acid biosynthesis. Using our methods, we have found that LEU3 regulates 5 additional genes (LEU4, ILV5, ILV3, ALD5 and ISU2) that would not have been identified using the stringent 0.001 P-value threshold pair. Two of them (LEU4 and ILV5) are among the seven established LEU3 targets that comprise the pathway for branched amino acid biosynthesis.44 Three of these genes (LEU4, ILV5 and ILV3) have been annotated as being involved in “branched chain family amino acid biosynthesis”. Furthermore, the other two genes (ALD5 and ISU2) have been inferred as Leu3 targets using computational methods combining ChIP and expression analyses.45
As another example, consider GCR1, which is required for maximal transcription of many genes, including genes encoding glycolytic enzymes. Tpi1p is an abundant glycolytic enzyme that makes up about 2% of the soluble cellular protein while GCR1 binding is required for activation of TPI1.46 Other glycolytic genes such as ENO2 and ADH1 are dependent on GCR1 gene function for full expression.47,48 Finally, consider transcription factors that have functions previously reported to control the cell cycle during growth. The UME6 gene of S. cerevisiae was identified as a mitotic repressor of early meiosis-specific gene expression. It provides target specificity by binding to the URS1 sequence element (TAGCCGCCGA) that is located upstream from many early meiosis-specific genes. UME6 (“Unscheduled Meiotic gene Expression”) is a key transcriptional regulator of early meiotic genes such as SPO149,50 and SPO13. 49–51 In addition to the regulation of meiosis-specific genes, UME6 has been implicated in the transcriptional regulation of genes involved in arginine catabolism. Expression of the catabolic genes CAR1 encoding arginase and omithine transaminase is repressed by nitrogen. Previous studies have indicated that the UME6 gene is involved in mediating this repression.51,52
To further validate our results, we selected some transcriptional factors whose target genes prediction showed a relatively low overlap with information from YEASTRACT, and compared them with other predictions of MacIsaac KD et al53 and Pham TH et al54 MacIsaac KD et al53 combined phylogenetic conservation-based motif discovery algorithms, PhyloCon, and Converge to create a refined regulatory map for S. cerevisiae by reanalyzing the same ChIP-chip binding data. Pham TH et al54 developed a method that combined three different expression datasets with the same ChIP-chip binding data with a relaxed threshold (P-value = 0.005) to discover target genes based on rule induction. Although our methods combined data TF knock-out data different than MacIsaac KD et al53 and Pham TH et al54 and used a different approach, the results showed that most of our predictions that were not supported by YEASTARCT could be proven by data from MacIsaac KD et al53 and Pham TH et al.54 For example, our method identified 21 additional targets of SWI6 with the optimal relaxed P-value thresholds pair. Unfortunately we could find evidence from YEASTRACT for only 5 of them. However, 13 of the 21 additional targets were also predicted by the study of MacIsaac KD et al53 and 9 of them were inferred by the study of Pham TH et al54 Combining the evidence from the above two sources and information from YEASTRACT, 14 in 21 have been convinced of genuine targets of SWI6 (see Table 1). Among the left 7 target genes, YMR144 W and YOR248 W were predicted as SWI6 targets by Harbison et al26 Other five genes (CIS3, YER079 W, FTR1, PLB3, and HTZ1) could be inferred as SWI6 targets as they showed close relationship with the SBF complex (SWI4/SWI6). CIS3, a glycoprotein-encoding gene, was reported to have conserved binding sites for SWI6-SWI4 complex.55 YER079 W, FTR1, PLB3, and HTZ1 also showed evidence to be related with SWI4.55 As another example, although all of the 10 additional targets of DIG1 could not be supported by YEASTRACT, 6 targets could be found in the results of MacIsaac KD et al53 and 5 in Pham TH et al54 (see Table 2). In the remaining 4 genes, MFA1 and AGA2 were involved in mating or pheromone response;56 they stood a good chance to be the targets of DIG1, which was also known to be involved in the regulation of mating-specific genes and the invasive growth pathway.57
Table 1.
TFs | ORF | YEASTRACT | MacIsaac KD et al53 | Pham TH et al54 | Literature evidence |
---|---|---|---|---|---|
SWI6 | YBR071W | x | x | ||
SWI6 | CHA1 | x | x | ||
SWI6 | HTA1 | x | x | x | |
SWI6 | YER079W | ||||
SWI6 | PUP3 | x | x | x | |
SWI6 | SWI4 | x | x | ||
SWI6 | FTR1 | ||||
SWI6 | CIS3 | ||||
SWI6 | RPS4A | x | x | ||
SWI6 | HMS2 | x | x | ||
SWI6 | CWP2 | x | x | x | |
SWI6 | EXG1 | x | x | x | |
SWI6 | YOX1 | x | x | x | x |
SWI6 | YMR144W | ||||
SWI6 | SCW10 | x | x | x | x |
SWI6 | PLB3 | ||||
SWI6 | HTZ1 | ||||
SWI6 | SKM1 | x | x | x | |
SWI6 | SRL1 | x | x | x | x |
SWI6 | YOR248W | ||||
SWI6 | OPY2 | x | x | x | x |
Table 2.
Gene ontology enrichment analysis
Finally, to ensure that we found biologically meaningful targets, we performed Gene ontology analyses using the Saccharomyces Genome Database web site to evaluate whether a gene set was enriched for biologically relevant targets (see Table 3). It turned out that the regulated gene sets generally identified groups of genes that functioned in a similar biological pathway and were generally accurate in assigning regulators to sets of genes whose functions were consistent with the regulators’ known roles. For example, ARG80 was well known to be a transcription factor required for specific regulation of arginine metabolism in yeast.58 Four out of the five genes (P-value ≤ 3e-15) (ARG5,6/YER069W, ARG3/YJL088 W, ARG8/YOL140W, CPA1/YOR303W) that we identified using our method were annotated as being involved in “arginine biosynthetic process”. The same situation happened to GAL80, which was a well-characterized transcription factor involved in a genetic switch. The switch, which consisted of three proteins, controlled the genes that encoded the enzymes required for galactose metabolism at the level of transcription.59 Four out of six genes that we identified as the targets of GAL80 (GAL7/YBR018C, GAL10/YBR019C, GAL1/YBR020W, GAL2/YLR081W) were involved in galactose metabolic process. For another example, GCR2 was the transcription factor affecting expression of most glycolytic genes in S. cerevisiae.60 All six of these genes were directly on the committed pathway to leucine or valine biosynthesis (PGI1/YBR196C, TPI1/YDR050C, TDH3/YGR192C, TDH2/YJR009C, FBA1/YKL060C, and GPM1/YKL152C).
Table 3.
Regulators | Functional description of regulators | # of genes | Significantly shared GO annotations | P value |
---|---|---|---|---|
RAP1 | High level transcriptional activation of genes encoding ribosomal proteins and glycolytic enzymes | 126 | (86/126) structural constituent of ribosome | 2.66E–100 |
(91/126) translation | 8.79E–83 | |||
SUM1 | Mitotic repression of middle sporulation-specific genes, general replication initiation | 45 | (16/45) sporulation | 9.93E–15 |
LEU3 | Regulates the transcription of genes encoding enzymes involved in branched-chain amino acid synthesis | 10 | (6/10) branched chain family amino acid biosynthetic process | 1.45E-13 |
UME6 | Transcriptional regulator of early meiotic genes, transcriptional regulation of genes involved in arginine catabolism | 44 | (2/43) arginine catabolic process | 0.00571 |
(7/43) meiosis | 0.00797 | |||
TEC1 | Required for full Ty1 expression, Ty1-mediated gene activation | 20 | (17/20) transposition, RNA-mediated | 3.85E-25 |
ARG81 | Involved in the regulation of arginine-responsive genes | 9 | (6/9) arginine metabolic process | 2.78E-13 |
SFP1 | Controls expression of many ribosome biogenesis genes in response to nutrients and stress, regulates G2/M transitions during mitotic cell cycle and DNA-damage response | 66 | (42/66) structural constituent of ribosome | 8.62E-45 |
(46/66) translation | 2.90E-39 | |||
RFX1 | Involved in DNA damage and replication checkpoint pathway | 6 | (3/6) deoxyribonucleotide biosynthetic process | 2.18E-07 |
GCR1 | Transcriptional activators of glycolytic genes | 50 | (10/50) glycolysis | 5.94E-14 |
BAS1 | Involved in the expression of genes encoding enzymes acting in the histidine, purine, and pyrimidine biosynthetic pathways | 12 | (4/12) purine ribonucleoside monophosphate biosynthetic process | 1.28E-07 |
ARG80 | Involved in regulation of arginine-responsive genes | 5 | (4/5) arginine biosynthetic process | 7.63E-10 |
DIG1 | Involved in the regulation of mating-specific genes, inhibits pheromone-responsive transcription | 13 | (8/13) sexual reproduction | 1.89E-09 |
(8/13) response to pheromone | 2.93E-10 | |||
HMS1 | Overexpression confers hyperfilamentous growth | 23 | (15/23) cytosolic part | 1.02E-15 |
ACE2 | Activates transcription of genes expressed in the G1 phase | 12 | (4/12) cytokinesis, completion of separation | 8.06E-08 |
YAP1 | Activates the transcription of anti-oxidant genes in response to oxidative stress | 8 | (4/8) response to oxidative stress | 5.32E-05 |
OPI1 | Negative regulation of phospholipid biosynthetic genes | 3 | (2/3) fatty acid synthase complex | 2.54E-06 |
SKO1 | Cytosolic and nuclear protein involved in osmotic and oxidative stress responses | 9 | (2/9) structural constituent of cell wall | 0.00094 |
TYE7 | transcriptional activator in Ty1-mediated gene expression, binds E-boxes of glycolytic genes and contributes to their activation | 24 | (9/24) transposition, RNA-mediated | 7.74E-08 |
(4/24) glycolysis | 9.21E-05 | |||
GAL80 | involved in transcriptional regulation in response to galactose | 6 | (4/6) galactose metabolic process | 1.86E-09 |
GCR2 | transcriptional activators of glycolytic genes | 6 | (6/6) glycolysis | 5.08E-14 |
SUT1 | involved in sterol uptake; involved in induction of hypoxic gene expression | 13 | (3/13) structural constituent of cell wall | 1.37E-05 |
INO2 | required for derepression of phospholipid biosynthetic genes in response to inositol depletion | 5 | (4/5) lipid biosynthetic process | 2.77E-05 |
Functional description of regulators is from the Saccharomyces Genome Database.
Gene Ontology analysis done using GO Term Finder in SGD in Aug 31, 2008; 5952 genes were included in the background set with P-value cut-off < 0.01.
Discussion
ChIP-chip data contain information about physically binding interactions, while TF knock-out experiments provide information about functional relations. By combining these two complementary data sources, the method is expected to uncover the TF-target relations. However, the data quality and the arbitrary P-value threshold lead to the low overlap between these two data. In this study, we developed a novel method to integrate these two data for inferring TF-target gene relations. The key aspect of our approach is to find the optimal P-value threshold pair for each TF, at which the most significant overlap is obtained. Our method is powerful because it allows the P-value threshold to be relaxed if there is supporting evidence from each of these two complementary data. Comparison of the results with the YEASTRACT and the literature shows that experimental evidence exists for most of TF-target gene relations in our results. Considering those relations between TF and target genes for which there is no direct experimental evidence, we are able to found other computational evidence. Furthermore a plausible explanation could often be inferred from the functional links between the TF and target genes.
It should be noted that although we focused on the TF-target gene relations, our method could be easily extended to discover the cooperativity among transcription factors by combining these two data from different TFs. It could also be used to combine the information from multiple ChIP-chip experiments on the same TF when these data are available. With more and more genomic data available, it will become an inevitable trend to study the complex biological systems based on computational integration of those heterogeneous data. Our work provides a simple but novel method to integrate available biological information in a principled fashion.
Acknowledgments
This work is supported by the National Basic Research Program of China (Grant Nos. 2009CB918404, 2006CB910700), International S&T Cooperation Program of China (Grant No. 2007DFA31040), the National Natural Science Foundation of China (Grant No. 30700154), and the School Youth Found of Shanghai Jiao Tong University.
Footnotes
Disclosures
The authors report no conflicts of interest.
References
- 1.Friedman N, Linial M, Nachman I, Pe’er D. Using Bayesian networks to analyze expression data. J Comput Biol. 2000;7:3–4. 601–20. doi: 10.1089/106652700750050961. [DOI] [PubMed] [Google Scholar]
- 2.Ideker TE, Thorsson V, Karp RM. Discovery of regulatory interactions through perturbation: inference and experimental design. Pac Symp Biocomput. 2000:305–16. doi: 10.1142/9789814447331_0029. [DOI] [PubMed] [Google Scholar]
- 3.Birnbaum K, Benfey PN, Shasha DE. cis element/transcription factor analysis (cis/TF): a method for discovering transcription factor/cis element relationships. Genome Res. 2001 Sep;11(9):1567–73. doi: 10.1101/gr.158301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Zhu Z, Pilpel Y, Church GM. Computational identification of transcription factor binding sites via a transcription-factor-centric clustering (TFCC) algorithm. J Mol Biol. 2002 Apr 19;318(1):71–81. doi: 10.1016/S0022-2836(02)00026-8. [DOI] [PubMed] [Google Scholar]
- 5.Imoto S, Goto T, Miyano S. Estimation of genetic networks and functional structures between genes by using Bayesian networks and nonparametric regression. Pac Symp Biocomput. 2002:175–86. [PubMed] [Google Scholar]
- 6.Qian J, Lin J, Luscombe NM, Yu H, Gerstein M. Prediction of regulatory networks: genome-wide identification of transcription factor targets from gene expression data. Bioinformatics. 2003 Oct 12;19(15):1917–26. doi: 10.1093/bioinformatics/btg347. [DOI] [PubMed] [Google Scholar]
- 7.Gardner TS, di Bernardo D, Lorenz D, Collins JJ. Inferring genetic networks and identifying compound mode of action via expression profiling. Science. 2003 Jul 4;301(5629):102–5. doi: 10.1126/science.1081900. [DOI] [PubMed] [Google Scholar]
- 8.Segal E, Shapira M, Regev A, et al. Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data. Nat Genet. 2003 Jun;34(2):166–76. doi: 10.1038/ng1165. [DOI] [PubMed] [Google Scholar]
- 9.Nachman I, Regev A, Friedman N. Inferring quantitative models of regulatory networks from expression data. Bioinformatics. 2004 Aug 4;20(Suppl 1):i248–56. doi: 10.1093/bioinformatics/bth941. [DOI] [PubMed] [Google Scholar]
- 10.Zou M, Conzen SD. A new dynamic Bayesian network (DBN) approach for identifying gene regulatory networks from time course microarray data. Bioinformatics. 2005 Jan 1;21(1):71–9. doi: 10.1093/bioinformatics/bth463. [DOI] [PubMed] [Google Scholar]
- 11.Schafer J, Strimmer K. An empirical Bayes approach to inferring large-scale gene association networks. Bioinformatics. 2005 Mar;21(6):754–64. doi: 10.1093/bioinformatics/bti062. [DOI] [PubMed] [Google Scholar]
- 12.Basso K, Margolin AA, Stolovitzky G, Klein U, Dalla-Favera R, Califano A. Reverse engineering of regulatory networks in human B cells. Nat Genet. 2005 Apr;37(4):382–90. doi: 10.1038/ng1532. [DOI] [PubMed] [Google Scholar]
- 13.Margolin AA, Nemenman I, Basso K, et al. ARACNE: an algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context. BMC Bioinformatics. 2006;7(Suppl 1):S7. doi: 10.1186/1471-2105-7-S1-S7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Li X, Rao S, Jiang W, et al. Discovery of Time-Delayed Gene Regulatory Networks based on temporal gene expression profiling. BMC Bioinformatics. 2006;7:26. doi: 10.1186/1471-2105-7-26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Wang Y, Joshi T, Zhang XS, Xu D, Chen L. Inferring gene regulatory networks from multiple microarray datasets. Bioinformatics. 2006 Oct 1;22(19):2413–20. doi: 10.1093/bioinformatics/btl396. [DOI] [PubMed] [Google Scholar]
- 16.Sayyed-Ahmad A, Tuncay K, Ortoleva PJ. Transcriptional regulatory network refinement and quantification through kinetic modeling, gene expression microarray data and information theory. BMC Bioinformatics. 2007;8:20. doi: 10.1186/1471-2105-8-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Vu TT, Vohradsky J. Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae. Nucleic Acids Res. 2007;35(1):279–87. doi: 10.1093/nar/gkl1001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Faith JJ, Hayete B, Thaden JT, et al. Large-scale mapping and validation of Escherichia coli transcriptional regulation from a compendium of expression profiles. PLoS Biol. 2007 Jan;5(1):e8. doi: 10.1371/journal.pbio.0050008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Hu Z, Killion PJ, Iyer VR. Genetic reconstruction of a functional transcriptional regulatory network. Nat Genet. 2007 May;39(5):683–7. doi: 10.1038/ng2012. [DOI] [PubMed] [Google Scholar]
- 20.Pilpel Y, Sudarsanam P, Church GM. Identifying regulatory networks by combinatorial analysis of promoter elements. Nat Genet. 2001 Oct;29(2):153–9. doi: 10.1038/ng724. [DOI] [PubMed] [Google Scholar]
- 21.Palin K, Ukkonen E, Brazma A, Vilo J. Correlating gene promoters and expression in gene disruption experiments. Bioinformatics. 2002;18(Suppl 2):S172–80. doi: 10.1093/bioinformatics/18.suppl_2.s172. [DOI] [PubMed] [Google Scholar]
- 22.Horng JT, Huang HD, Huang SL, Yan UC, Chang YC. Mining putative regulatory elements in promoter regions of Saccharomyces cerevisiae. In Silico Biol. 2002;2(3):263–73. [PubMed] [Google Scholar]
- 23.Haverty PM, Hansen U, Weng Z. Computational inference of transcriptional regulatory networks from expression profiling and transcription factor binding site identification. Nucleic Acids Res. 2004;32(1):179–88. doi: 10.1093/nar/gkh183. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Bar-Joseph Z, Gerber GK, Lee TI, et al. Computational discovery of gene modules and regulatory networks. Nat Biotechnol. 2003 Nov;21(11):1337–42. doi: 10.1038/nbt890. [DOI] [PubMed] [Google Scholar]
- 25.Gao F, Foat BC, Bussemaker HJ. Defining transcriptional networks through integrative modeling of mRNA expression and transcription factor binding data. BMC Bioinformatics. 2004 Mar 18;5(31) doi: 10.1186/1471-2105-5-31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Harbison CT, Gordon DB, Lee TI, et al. Transcriptional regulatory code of a eukaryotic genome. Nature. 2004 Sep 2;431(7004):99–104. doi: 10.1038/nature02800. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Kato M, Hata N, Banerjee N, Futcher B, Zhang MQ. Identifying combinatorial regulation of transcription factors and binding motifs. Genome Biol. 2004;5(8):R56. doi: 10.1186/gb-2004-5-8-r56. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Das D, Banerjee N, Zhang MQ. Interacting models of cooperative gene regulation. Proc Natl Acad Sci U S A. 2004 Nov 16;101(46):16234–9. doi: 10.1073/pnas.0407365101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Jin VX, Rabinovich A, Squazzo SL, Green R, Farnham PJ. A computational genomics approach to identify cis-regulatory modules from chromatin immunoprecipitation microarray data—a case study using E2F1. Genome Res. 2006 Dec;16(12):1585–95. doi: 10.1101/gr.5520206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Lemmens K, Dhollander T, De Bie T, et al. Inferring transcriptional modules from ChIP-chip, motif and microarray data. Genome Biol. 2006;7(5):R37. doi: 10.1186/gb-2006-7-5-r37. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Geier F, Timmer J, Fleck C. Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge. BMC Syst Biol. 2007;1:11. doi: 10.1186/1752-0509-1-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Tuncay K, Ensman L, Sun J, et al. Transcriptional regulatory networks via gene ontology and expression data. In Silico Biol. 2007;7(1):21–34. [PubMed] [Google Scholar]
- 33.Werhli AV, Husmeier D. Gene regulatory network reconstruction by Bayesian integration of prior knowledge and/or different experimental conditions. J Bioinform Comput Biol. 2008 Jun;6(3):543–72. doi: 10.1142/s0219720008003539. [DOI] [PubMed] [Google Scholar]
- 34.Zhao W, Serpedin E, Dougherty ER.Recovering genetic regulatory networks from chromatin immunoprecipitation and steady-state microarray data EURASIP J Bioinform Syst Biol 2008:248747. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Zhang Y, Xuan J, de los Reyes BG, Clarke R, Ressom HW. Network motif-based identification of transcription factor-target gene relationships by integrating multi-source biological data. BMC Bioinformatics. 2008;9:203. doi: 10.1186/1471-2105-9-203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Li H, Zhan M. Unraveling transcriptional regulatory programs by integrative analysis of microarray and transcription factor binding data. Bioinformatics. 2008 Sep 1;24(17):1874–80. doi: 10.1093/bioinformatics/btn332. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Boden M, Bailey TL. Associating transcription factor-binding site motifs with target GO terms and target genes. Nucleic Acids Res. 2008 Jul;36(12):4108–17. doi: 10.1093/nar/gkn374. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Teixeira MC, Monteiro P, Jain P, et al. The YEASTRACT database: a tool for the analysis of transcription regulatory associations in Saccharomyces cerevisiae Nucleic Acids Res 2006January134(Database issue)D446–51. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Monteiro PT, Mendes ND, Teixeira MC, et al. YEASTRACT-DISCOVERER: new tools to improve the analysis of transcriptional regulatory associations in Saccharomyces cerevisiae Nucleic Acids Res 2008January36(Database issue):D132–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Iyer VR, Horak CE, Scafe CS, Botstein D, Snyder M, Brown PO. Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF. Nature. 2001 Jan 25;409(6819):533–8. doi: 10.1038/35054095. [DOI] [PubMed] [Google Scholar]
- 41.Simon I, Barnett J, Hannett N, et al. Serial regulation of transcriptional regulators in the yeast cell cycle. Cell. 2001 Sep 21;106(6):697–708. doi: 10.1016/s0092-8674(01)00494-9. [DOI] [PubMed] [Google Scholar]
- 42.Hess D, Winston F. Evidence that Spt10 and Spt21 of Saccharomyces cerevisiae play distinct roles in vivo and functionally interact with MCB-binding factor, SCB-binding factor and Snf1. Genetics. 2005 May;170(1):87–94. doi: 10.1534/genetics.104.039214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Reinoso-Martin C, Schuller C, Schuetzer-Muehlbauer M, Kuchler K. The yeast protein kinase C cell integrity pathway mediates tolerance to the anti-fungal drug caspofungin through activation of Slt2p mitogen-activated protein kinase signaling. Eukaryot Cell. 2003 Dec;2(6):1200–10. doi: 10.1128/EC.2.6.1200-1210.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Friden P, Schimmel P. LEU3 of Saccharomyces cerevisiae activates multiple genes for branched-chain amino acid biosynthesis by binding to a common decanucleotide core sequence. Mol Cell Biol. 1988 Jul;8(7):2690–97. doi: 10.1128/mcb.8.7.2690. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Boer VM, Daran JM, Almering MJ, de Winde JH, Pronk JT. Contribution of the Saccharomyces cerevisiae transcriptional regulator Leu3p to physiology and gene expression in nitrogen- and carbon-limited chemostat cultures. FEMS Yeast Res. 2005 Jul;5(10):885–97. doi: 10.1016/j.femsyr.2005.04.003. [DOI] [PubMed] [Google Scholar]
- 46.Scott EW, Baker HV. Concerted action of the transcriptional activators REB1, RAP1, and GCR1 in the high-level expression of the glycolytic gene TPI. Mol Cell Biol. 1993 Jan;13(1):543–50. doi: 10.1128/mcb.13.1.543. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Willett CE, Gelfman CM, Holland MJ. A complex regulatory element from the yeast gene ENO2 modulates GCR1-dependent transcriptional activation. Mol Cell Biol. 1993 Apr;13(4):2623–33. doi: 10.1128/mcb.13.4.2623. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Huie MA, Scott EW, Drazinic CM, et al. Characterization of the DNA-binding activity of GCR1: in vivo evidence for two GCR1-binding sites in the upstream activating sequence of TPI of Saccharomyces cerevisiae. Mol Cell Biol. 1992 Jun;12(6):2690–700. doi: 10.1128/mcb.12.6.2690. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Goldmark JP, Fazzio TG, Estep PW, Church GM, Tsukiyama T. The Isw2 chromatin remodeling complex represses early meiotic genes upon recruitment by Ume6p. Cell. 2000 Oct 27;103(3):423–33. doi: 10.1016/s0092-8674(00)00134-3. [DOI] [PubMed] [Google Scholar]
- 50.Steber CM, Esposito RE. UME6 is a central component of a developmental regulatory switch controlling meiosis-specific gene expression. Proc Natl Acad Sci U S A. 1995 Dec 19;92(26):12490–4. doi: 10.1073/pnas.92.26.12490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Strich R, Surosky RT, Steber C, Dubois E, Messenguy F, Esposito RE. UME6 is a key regulator of nitrogen repression and meiotic development. Genes Dev. 1994 Apr 1;8(7):796–810. doi: 10.1101/gad.8.7.796. [DOI] [PubMed] [Google Scholar]
- 52.Park HD, Luche RM, Cooper TG. The yeast UME6 gene product is required for transcriptional repression mediated by the CAR1 URS1 repressor binding site. Nucleic Acids Res. 1992 Apr 25;20(8):1909–15. doi: 10.1093/nar/20.8.1909. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.MacIsaac KD, Wang T, Gordon DB, Gifford DK, Stormo GD, Fraenkel E. An improved map of conserved regulatory sites for Saccharomyces cerevisiae. BMC Bioinformatics. 2006;7:113. doi: 10.1186/1471-2105-7-113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Pham TH, Clemente JC, Satou K, Ho TB. Computational discovery of transcriptional regulatory rules. Bioinformatics. 2005 Sep 1;21(Suppl 2):ii101–7. doi: 10.1093/bioinformatics/bti1117. [DOI] [PubMed] [Google Scholar]
- 55.Lee HJ, Manke T, Bringas R, Vingron M. Prioritization of gene regulatory interactions from large-scale modules in yeast. BMC Bioinformatics. 2008;9:32. doi: 10.1186/1471-2105-9-32. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Kim PM, Tidor B. Subsystem identification through dimensionality reduction of large-scale gene expression data. Genome Res. 2003 Jul;13(7):1706–18. doi: 10.1101/gr.903503. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Cook JG, Bardwell L, Kron SJ, Thorner J. Two novel targets of the MAP kinase Kss1 are negative regulators of invasive growth in the yeast Saccharomyces cerevisiae. Genes Dev. 1996 Nov 15;10(22):2831–48. doi: 10.1101/gad.10.22.2831. [DOI] [PubMed] [Google Scholar]
- 58.Dubois E, Bercy J, Messenguy F. Characterization of two genes, ARGRI and ARGRIII required for specific regulation of arginine metabolism in yeast. Mol Gen Genet. 1987 Apr;207(1):142–8. doi: 10.1007/BF00331501. [DOI] [PubMed] [Google Scholar]
- 59.Timson DJ, Ross HC, Reece RJ.Gal3p and Gal1p interact with the transcriptional repressor Gal80p to form a complex of 1:1 stoichiometry Biochem J 2002May1363(Pt 3)515–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Uemura H, Jigami Y. Role of GCR2 in transcriptional activation of yeast glycolytic genes. Mol Cell Biol. 1992 Sep;12(9):3834–42. doi: 10.1128/mcb.12.9.3834. [DOI] [PMC free article] [PubMed] [Google Scholar]