Skip to main content
. 2024 Jan 8;6:1296508. doi: 10.3389/fdata.2023.1296508

Table 2.

Datasets description.

Dataset Problem Train/test split Target variable Continuous Binary Multi-class Mixed-type Long-tail General transform
Adult Classification 39k/9k “income” 3 2 7 2 0 1
Covertype Classification 40k/10k “Cover_Type” 10 44 1 0 0 47
Credit Classification 40k/10k “Class” 30 1 0 0 1 29
Intrusion Classification 40k/10k “Class” 22 6 14 0 2 6
Loan Classification 4k/1k “PersonalLoan” 5 5 2 1 0 9
Insurance Regression 1k/300 “charges” 3 2 2 0 0 1
King Regression 17.3k/4.3k 'price' 11 2 5 2 0 7