Skip to main content
. 2021 Aug 5;3(3):lqab066. doi: 10.1093/nargab/lqab066

Table 1.

The numbers of sequences for each class in the COALA90 dataset

ARG Class Whole dataset Training data for component methods Training data for LTR Validation data Test data
MULTIDRUG 382 263 34 37 48
AMINOGLYCOSIDE 1189 844 110 122 113
MACROLIDE 756 563 64 66 63
BETA-LACTAM 5845 4051 606 586 602
GLYCOPEPTIDE 2304 1638 193 243 230
TRIMETHOPRIM 666 424 91 71 80
FOLATE-SYNTHESIS- 2448 1730 249 249 220
INHIBITOR (FSI)
TETRACYCLINE 2056 1448 205 185 218
SULFONAMIDE 315 217 32 36 30
FOSFOMYCIN 138 102 15 10 11
PHENICOL 460 318 50 46 46
QUINOLONE 229 154 27 23 25
STREPTOGRAMIN 19 11 2 3 3
BACITRACIN 127 90 16 11 10
RIFAMYCIN 23 15 3 3 2
MACROLIDE/LINCOSAMIDE/ 66 48 5 11 2
STREPTOGRAMIN (MLS)