Skip to main content
. 2018 Jul 17;19:270. doi: 10.1186/s12859-018-2264-5

Fig. 4.

Fig. 4

Influence of n and p: subsampling experiment based on dataset ID=310. Top: Boxplot of the performance (acc) of RF (dark) and LR (white) for N=50 sub-datasets extracted from the OpenML dataset with ID=310 by randomly picking nn observations and p<p features. Bottom: Boxplot of the differences in performances Δacc=AccRFAccLR between RF and LR. p∈{1,2,3,4,5,6}. n∈{5e2,1e3,5e3,1e4}. Performance is evaluated through 5-fold-cross-validation repeated 2 times