Skip to main content
. Author manuscript; available in PMC: 2018 Jan 5.
Published in final edited form as: CEUR Workshop Proc. 2016 Sep;1609:28–42.

Table 1.

Descriptive statistics of the QUAERO French Medical Corpus

EMEA MEDLINE

Training Development Test Training Development Test
Documents 3 3 4 833 832 833
Tokens 14,944 13,271 12,042 10,552 10,503 10,871
Entities 2,695 2,260 2,204 2,994 2,977 3,103
Unique Entities 923 756 658 2,296 2,288 2,390
Unique CUIs 648 523 474 1,860 1,848 1,909