. 2022 Mar 31;23:111. doi: 10.1186/s12859-022-04646-6

Table 3.

Global statistics comparison between TBGA, BioRel [24], and DTI [10] datasets

Dataset	Split	Instances	Bags	Inst.s/bag	Relations
BioRel	Train	534,277	39,969	13.37	125
	Validation	114,506	20,675	5.54
	Test	114,565	20,756	5.52
DTI	Train	604,303	472,033	1.28	6
	Validation	6133	4769	1.29
	Test	6312	4817	1.31
TBGA	Train	178,264	85,047	2.10	4
	Validation	20,193	10,491	1.92
	Test	20,516	10,494	1.96

Statistics are reported separately for each data split. Columns represent, from left to right, the considered granularity level, the data split, the total number of instances and bags, the average number of instances per bag, as well as the total number of relations