Skip to main content
. 2017 Jul 5;17(Suppl 2):82. doi: 10.1186/s12911-017-0466-9

Table 3.

Characteristics (counts of sentences, words, and entities, words per sentence, entities per sentence, and entity density) in five folds of the dataset and the pool of querying data

Sentence count Word count Entity Count Words per sentence Entities per sentence Entity densitya
Fold 1 4,085 44,403 5,395 10.87 1.32 0.25
Fold 2 4,085 45,588 5,183 11.16 1.27 0.24
Fold 3 4,084 45,355 5,201 11.11 1.27 0.24
Fold 4 4,085 45,141 5,263 11.05 1.29 0.25
Fold 5 4,084 44,834 5,177 10.98 1.27 0.24
Pool (Fold 2 + 3 + 4 + 5) 16,338 180,918 20,824 11.07 1.27 0.24

aEntity density is the number of words of the entities divided by the total number of words