Table 3.
Characteristics (counts of sentences, words, and entities, words per sentence, entities per sentence, and entity density) in five folds of the dataset and the pool of querying data
Sentence count | Word count | Entity Count | Words per sentence | Entities per sentence | Entity densitya | |
---|---|---|---|---|---|---|
Fold 1 | 4,085 | 44,403 | 5,395 | 10.87 | 1.32 | 0.25 |
Fold 2 | 4,085 | 45,588 | 5,183 | 11.16 | 1.27 | 0.24 |
Fold 3 | 4,084 | 45,355 | 5,201 | 11.11 | 1.27 | 0.24 |
Fold 4 | 4,085 | 45,141 | 5,263 | 11.05 | 1.29 | 0.25 |
Fold 5 | 4,084 | 44,834 | 5,177 | 10.98 | 1.27 | 0.24 |
Pool (Fold 2 + 3 + 4 + 5) | 16,338 | 180,918 | 20,824 | 11.07 | 1.27 | 0.24 |
aEntity density is the number of words of the entities divided by the total number of words