Table 6:
Word Classification: the best model for word classification for each approach: (1) ALL CATEGORIES BINARY: binary classification ignorance or not, (2) AN ENSEMBLE OF BINARY CLASSIFIERS: binary classification for each class (reported) combined to create the ensemble, and (3) ALL CATEGORIES COMBINED: one multi-classifier to all categories.
| Ignorance Category | Model | testing F1 score | testing support |
|---|---|---|---|
| ALL CATEGORIES BINARY | BioBERT | 0.89 | 7601 |
| answered question | BioBERT | 0.89 | 320 |
| unknown/novel | CRF | 0.98 | 155 |
| explicit inquiry | BioBERT | 0.97 | 43 |
| incompletely understood | BioBERT | 0.93 | 2809 |
| indefinite relationship | BioBERT | 0.97 | 1205 |
| largely understood | BioBERT | 0.94 | 618 |
| anomalous/curious | BioBERT | 0.96 | 399 |
| alternative/controversy | BioBERT | 0.91 | 598 |
| difficult | CRF | 0.93 | 128 |
| problem/complication | BioBERT | 0.9 | 238 |
| future work | BioBERT | 0.89 | 391 |
| future prediction | BioBERT | 0.94 | 100 |
| important consideration | BioBERT | 0.93 | 608 |
| ALL CATEGORIES COMBINED* | BioBERT | 0.82 | 6239 |
Reporting the average F1 score of all the categories for one multi-classifier.