Table 6.
Top-3 frequently appearing label combinations: a set of 1-label, 2-label, 3-label, and 4-label in the dataset
Combination of Labels | % of co-occurrence | % appearance as a set |
---|---|---|
1-label | ||
(data_exploration) | 39.37% | 27.76% |
(data_preprocessing) | 24.29% | 16.74% |
(modelling) | 18.24% | 9.53% |
2-label | ||
(data_exploration, data_preprocessing) | 5.25% | 4.59% |
(evaluation, modelling) | 4.27% | 2.18% |
(data_exploration, load_data) | 3.3% | 2.39% |
3-label | ||
(evaluation, modelling, prediction) | 1.16% | 0.53% |
(evaluation, helper_functions, modelling) | 0.85% | 0.42% |
(data_exploration, helper_functions, load_data) | 0.63% | 0.52% |
4-label | ||
(evaluation, helper_functions, modelling, prediction) | 0.32% | 0.27% |
(evaluation, modelling, prediction, result_visualization) | 0.17% | 0.13% |
(evaluation, helper_functions, load_data, modelling) | 0.11% | 0.08% |
% of co-occurrence indicates how frequent the combination of labels appears (may have other labels in the same cell) in the dataset. % appearance as a set indicates how frequent the combination of labels appears as a set (without other labels in the cell) in the dataset