Figure 8.

Average F1 score for each group for the substances in the external (hold-out) group as a function of the group size. The groups were partitioned into five pools according to the group size with the inner bin boundaries corresponding to the 20th, 40th, 60th, and 80th percentiles (the outer boundaries, 10 and 57, correspond to the smallest and largest group).