Skip to main content
. 2022 Nov 19;28(1):7. doi: 10.1007/s10664-022-10229-z

Table 6.

Top-3 frequently appearing label combinations: a set of 1-label, 2-label, 3-label, and 4-label in the dataset

Combination of Labels % of co-occurrence % appearance as a set
1-label
(data_exploration) 39.37% 27.76%
(data_preprocessing) 24.29% 16.74%
(modelling) 18.24% 9.53%
2-label
(data_exploration, data_preprocessing) 5.25% 4.59%
(evaluation, modelling) 4.27% 2.18%
(data_exploration, load_data) 3.3% 2.39%
3-label
(evaluation, modelling, prediction) 1.16% 0.53%
(evaluation, helper_functions, modelling) 0.85% 0.42%
(data_exploration, helper_functions, load_data) 0.63% 0.52%
4-label
(evaluation, helper_functions, modelling, prediction) 0.32% 0.27%
(evaluation, modelling, prediction, result_visualization) 0.17% 0.13%
(evaluation, helper_functions, load_data, modelling) 0.11% 0.08%

% of co-occurrence indicates how frequent the combination of labels appears (may have other labels in the same cell) in the dataset. % appearance as a set indicates how frequent the combination of labels appears as a set (without other labels in the cell) in the dataset