Skip to main content
. 2023 Nov 24;39(12):btad716. doi: 10.1093/bioinformatics/btad716

Table 4.

Typo corpora statistics.

GSC+ EHR
Number of documents 228 100
Unique HPO concepts 461 252
Annotations containing typographical errors 61 902 57 789
Total typographical error tokens 95 191 37 242