FIGURE 3.

Applied sampling methods. Imbalanced datasets consist of healthy patients (), special subsets of healthy patients (), and sick patients (). Oversampling adds synthetically created sick patients to the dataset. Undersampling methods reduce the number of data points: random sampling randomly excludes healthy patients, informed sampling excludes only specific subsets of healthy patients. In each box the studies applying the respective method are given