Skip to main content
. Author manuscript; available in PMC: 2016 Jan 1.
Published in final edited form as: Pattern Recognit. 2014 Aug 6;48(1):276–287. doi: 10.1016/j.patcog.2014.07.025

Table 1.

Characteristics of the 34 UCI datasets employed in this study. Under the class labels, “rest” designates that it was a multi-class problem and that the rest of the classes were combined into one class.

Dataset Name # Attr Class label (+1) Class label (−1)
1 Abalone 4177 8 Female Male, infant
2 Arcene_train 100 10,000 Positive Negative
3 Blood 748 5 Donated blood Did not donate
4 Breast 106 9 Car, fad, mas gla, con, adi
5 Bupaliver 345 6 >5 drinks <5 drinks
6 Cancer_wbc 669 9 Malignant Benign
7 Cardio 74 10 Alive after 1 year Died before 1 year
8 cmc 1473 9 Long/use of contraceptives No contraceptive use
9 cnae_9 1080 856 Category: range 1–5 Category: range 6–9
10 Credit_g 1000 20 Good credit Bad credit
11 Derm 366 34 4,5,6 1,2,3
12 E.coli 336 6 pp rest
13 Glass 214 9 7th type Rest
14 Heart 270 14 Absence Presence
15 Hepatitis 155 19 Die Live
16 House 435 16 Democrat Republican
17 Ionosphere 351 34 Bad Good
18 Iris 150 4 setosa Versicolor, virginica
19 Kidney_inflam 120 6 Bladder inflammation No inflammation
20 Kr vs. kp 3196 36 White wins White loses
21 Mushroom 8124 21 Edible Poisonous
22 Parkinsons 197 23 Parkinson’s Healthy
23 pima 768 8 Positive for diabetes No diabetes
24 Post_op 90 8 Patient discharged (s) Rest
25 sonar 208 60 Rock Mine
26 Spectf 267 45 1 0
27 Statlog 690 14 Credit approved Not approved
28 Survival 306 3 Survived 5+ years Died within 5 years
29 Teach 151 5 Low Medium, high
30 Tictactoe 958 9 x wins x loses
31 Vehicle 846 18 van, bus saab, opel
32 Weight 625 4 Right-leaning Balanced/left-leaning
33 Wine 178 12 Cultivar 3 Cultivar 1 and 2
34 Zoo 101 17 Aquatic animals Not aquatic