Table 1.
The statistics of different types of entities in 2 electronic medical record data sets
| Data sets and entity type | Training set, n | Test set, n | |||
| Yidu-S4K | |||||
|
|
Disease | 4212 | 1323 | ||
|
|
Anatomy | 8426 | 3094 | ||
|
|
Laboratory | 1195 | 590 | ||
|
|
Image | 969 | 348 | ||
|
|
Medicine | 1822 | 485 | ||
|
|
Operation | 1029 | 162 | ||
|
|
All entities | 17,653 | 6002 | ||
| Self-annotated | |||||
|
|
Disease | 9470 | 4504 | ||
|
|
Symptoms | 26,334 | 11,065 | ||
|
|
Anatomy | 17,877 | 7588 | ||
|
|
Examination | 19,664 | 8746 | ||
|
|
Instrument | 1244 | 560 | ||
|
|
Medicine | 5314 | 2566 | ||
|
|
Operation | 2578 | 1133 | ||
|
|
All entities | 82,481 | 36,162 | ||