Skip to main content
. 2020 Jan 30;21:35. doi: 10.1186/s12859-020-3375-3

Table 2.

Statistics of name variations for the Bio-ID corpus

Properties Training set Test set
# IDs 5282 1980
# Single Var. 4133 1689
# Multiple Var. / Synonymy Rate 1149 / 2.46 291 / 2.26

The left column tabulates four types of attributes, which are the number of unique entity IDs (#IDs), the number of #IDs with only one variant (#Single Var.), the number of #IDs with two or more variants (#Multiple Var.), and the average number of variants that a multiple var. target ID has (Synonymy Rate)