Table 1B.
Corpus | No. articles with N codes | Total articles | |||
---|---|---|---|---|---|
1 | 2 | 3 | 4 | ||
Training | 15444 | 888 | 60 | 9 | 16401 |
Test 2000 | 2682 | 231 | 27 | 1 | 2941 |
Test 2001 | 184 | 22 | 2 | 0 | 208 |
Some of the articles within the training set were obtained in more than one of the queries; thus these articles have more than a single relevant GO classification. This table lists the number of abstracts in each data set and the number of abstracts with one, two, three, and four relevant codes.