Skip to main content
. 2022 May 31;23:207. doi: 10.1186/s12859-022-04739-2

Table 5.

Comparing the data structures to compute the goProfiles test and the one based on enrichment contingency tables.

Non-enriched in both lists Enriched only in list 1 Enriched only in list 2 Enriched in both lists
GO term number 1 a=n00 a+1 b=a+n10 =n.0 b+1 c=b+n01 c+1 c+n11 =n
Annotation frequency in gene list 1 F11 F1a F1(a+1) F1b F1(b+1) F1c F1(c+1) F1n
Annotation frequency in gene list 2 F21 F2a F2(a+1) F2b F2(b+1) F2c F2(c+1) F2n
Enrichment in list 1 0 0 1 1 0 0 1 1
Enrichment in list 2 0 0 0 0 1 1 1 1

In the latter test, the annotation frequencies are substituted by 0 and 1 (i.e., “non-enriched” and “enriched” GO term.) and if the test is based on the Sorensen–Dice similarity, the first set of GO terms (non-enriched in both lists) is ignored. The GO terms are arbitrarily ordered: from left to right, first there are all those non-enriched in both lists (n00 in total), next those enriched in the first list but not in the second one (n10), then those enriched in the second list but not in the first (n01) and finally those GO terms enriched in both lists (n11)