Skip to main content
. Author manuscript; available in PMC: 2025 Sep 30.
Published in final edited form as: Proc Mach Learn Res. 2025 Jun;287:527–542.

Table 7:

Agreement statistics between annotators for each category, using pairwise F1 scores. The “Support” column shows the number of instances in each category.

Category Pairwise F1 Support

Vitals _Hema 0.33 40
RESP 0.36 34
Lab_Image 0.55 400
LYMPH 0.0 2
DERM 0.56 136
History 0.57 148
EENT 0.53 88
Neuro 0.51 245
Pregnancy 0.54 44
GI 0.42 103
CVS 0.50 81
MSK 0.36 82
GU 0.33 79
ENDO 0.18 38