Input: k maximum number of selected subgroups, |
set of ranked generated subgroups, with complexities ranging from to
|
minimum delta to consider that a subgroup has a higher effect size11
|
splitByComplexity # split by subgroup complexity ( corresponds to complexity 1, to complexity 2, …) |
# Initialize , the set of top candidate subgroups |
For
to
do
|
For
g in
do # g: candidate subgroup |
If
p-value(g) max(p-values()) and size
then continue to next c |
For
s in
do # s: subgroup in the top-k |
If redundant() then
|
If complexity(g) complexity(s) then
|
continue to next g
|
If complexity(g) complexity(s) then
|
If EffectSize(g) EffectSize(
then
|
continue to next g
|
For
s in
do
|
If redundant() and complexity(g) complexity(s) and
|
EffectSize(g) EffectSize(
and
p-value(g)
p-value(s) then
|
|
|
while size( > k
do
|
subgroup from with the highest p-value
|
Output: # top-k best candidate subgroups |