Table 1.
Descriptive statistics of the QUAERO French Medical Corpus
EMEA | MEDLINE | |||||
---|---|---|---|---|---|---|
| ||||||
Training | Development | Test | Training | Development | Test | |
Documents | 3 | 3 | 4 | 833 | 832 | 833 |
Tokens | 14,944 | 13,271 | 12,042 | 10,552 | 10,503 | 10,871 |
Entities | 2,695 | 2,260 | 2,204 | 2,994 | 2,977 | 3,103 |
Unique Entities | 923 | 756 | 658 | 2,296 | 2,288 | 2,390 |
Unique CUIs | 648 | 523 | 474 | 1,860 | 1,848 | 1,909 |