Table 1.
Descriptive statistics of the QUAERO French Medical Corpus
| EMEA | MEDLINE | |||||
|---|---|---|---|---|---|---|
|
| ||||||
| Training | Development | Test | Training | Development | Test | |
| Documents | 3 | 3 | 4 | 833 | 832 | 833 |
| Tokens | 14,944 | 13,271 | 12,042 | 10,552 | 10,503 | 10,871 |
| Entities | 2,695 | 2,260 | 2,204 | 2,994 | 2,977 | 3,103 |
| Unique Entities | 923 | 756 | 658 | 2,296 | 2,288 | 2,390 |
| Unique CUIs | 648 | 523 | 474 | 1,860 | 1,848 | 1,909 |