. 2005 Jun;15(6):856–866. doi: 10.1101/gr.3760605

Table 2.

Evaluation of the induced rules using the binding data from Lee et al. (2002)

			Binding data evaluation (significant fractions P < 0.01)
				Random tests
Expression data	Expression similarity thresholds	No. rules unique/all	Rules (P-value)	Similar expression	Common motifs	Random
Cell cycle	0.250	39/109	0.54 (0.000)	0.11	0.17	0.02
Sporulation	0.250	45/81	0.13 (0.708)	0.09	0.18	0.02
Diauxic shift	0.200	150/428	0.29 (0.000)	0.06	0.18	0.02
Heat and cold shock	0.125	52/123	0.52 (0.000)	0.18	0.18	0.02
Pheromone	0.150	53/91	0.39 (0.001)	0.14	0.17	0.02
DNA-damaging agents	0.200	59/116	0.35 (0.000)	0.10	0.17	0.02

A rule is said to be significant if at least one transcription factor binding any of the matching genes obtained a Bonferroni-corrected P-value of <0.01 (only experimental bindings at P < 0.01 from Lee et al. 2002 were considered). The table gives the fraction of significant rules for each data set, and compares these values to what is observed when randomly selecting corresponding sets of genes with only similar expression, common binding sites, or neither. All three random tests produce a P-value that is the probability of observing a higher value than the one reported for the rules. We show the highest of these P-values in parentheses and mark the corresponding random test in bold if this P-value is >0. The table also gives the Euclidean distance threshold (normalized by the number of measurement points) used to define similar expression profiles and the number of rules induced for each expression data set (number of rules unique to that data set/all rules derived for that data set). Additional statistics on the standard deviation of the random test scores and comparisons to a new binding data set (Harbison et al. 2004) may be found at our Web site.