Table 2.
The parameters used for support vector machine and random forest classifiers; all other parameters are kept as default.
| Parameters | Value | |
| Support vector machines | ||
|
|
C | 100 |
|
|
Gamma | 1 |
|
|
Kernel | linear |
|
|
Norm | l1 |
|
|
Use-idfa | TRUE |
|
|
Max-dfb | 1 |
|
|
N-gram range | (1,1) |
| Random forests | ||
|
|
N-estimators | 10 |
|
|
Criterion | Gini |
|
|
Min-impurity-split | 1.00E-07 |
aUse-idf: when true, term weights are scaled by the number of documents they appear in.
bMax-df: when set to 1, words that appear in every document are not removed.