Skip to main content
. 2021 Jul 15;12:12. doi: 10.1186/s13326-021-00247-z

Table 5.

Summary statistics of the Hallmarks of Cancer (HOC) and the Chemical Exposure Assessment (CEA) datasets

HOC CEA
Document Sentence Document Sentence
Train 1,303 12,279 2,555 25,307
Dev 183 1,775 384 3,770
Test 366 3,410 722 7,100
Total 1,852 17,464 3,661 36,177