Table 1.
Dataset Name and Source | No. Observations | No. Features | No. Classes | Dimensionality * |
---|---|---|---|---|
Jasmine | 2984 (1492/1492) | 145 | 2 | 0.048592 |
Spectrometer | 531 (476/55) | 103 | 2 | 0.193974 |
Image | 2000 (1420/580) | 140 | 2 | 0.07 |
Fri | 1000 (564/436) | 101 | 2 | 0.101 |
Scene | 2407 (1976/431) | 295 | 2 | 0.122559 |
Musk | 6598 (5581/1017) | 170 | 2 | 0.025765 |
Philippine | 5832 (2916/2916) | 309 | 2 | 0.052984 |
Ionosphere | 351 (126/225) | 34 | 2 | 0.096866 |
Optdigits | 5620 (572/5048) | 64 | 2 | 0.011388 |
Satellite | 5100 (75/5025) | 37 | 2 | 0.007255 |
Ada | 4147 (1029/3118) | 49 | 2 | 0.011816 |
Splice | 3190 (1535/1655) | 62 | 2 | 0.019436 |
HIVA | 4229 (149/4080) | 1617 | 2 | 0.382359 |
* Dimensionality is the ratio of features to number of observations. Superscripts indicate the data sources as follows: 1 automl.chalearn.org, 2 www.openml.org, 3 mulan.sourceforge.net, 4 archive.ics.uci.edu.