Table 5.
Comparing the data structures to compute the goProfiles test and the one based on enrichment contingency tables.
| Non-enriched in both lists | Enriched only in list 1 | Enriched only in list 2 | Enriched in both lists | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GO term number | 1 | |||||||||||
| Annotation frequency in gene list 1 | ||||||||||||
| Annotation frequency in gene list 2 | ||||||||||||
| Enrichment in list 1 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 1 | ||||
| Enrichment in list 2 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | ||||
In the latter test, the annotation frequencies are substituted by 0 and 1 (i.e., “non-enriched” and “enriched” GO term.) and if the test is based on the Sorensen–Dice similarity, the first set of GO terms (non-enriched in both lists) is ignored. The GO terms are arbitrarily ordered: from left to right, first there are all those non-enriched in both lists ( in total), next those enriched in the first list but not in the second one (), then those enriched in the second list but not in the first () and finally those GO terms enriched in both lists ()