Skip to main content
. 2017 May 13;25(1):72–80. doi: 10.1093/jamia/ocx045

Table 2.

Statistics of annotated data

Statistics Training Set Test Set
No. of sentences 4000 1000
No. of words 84 830 21 036
No. of unique words 8108 3405
No. of body part mentions 1002 258
No. of chem mentions 2181 538
No. of device mentions 1545 406
No. of event mentions 1784 545
No. of flavor mentions 176 13
No. of num mentions 1353 389
No. of people mentions 7607 1888
No. of time mentions 1019 221
No. of unit mentions 227 201