Table 2. Important Parameters Used To Establish the Models.
models | descriptors | methodsa | parameters |
---|---|---|---|
model 1A | MACCS fingerprint | k-NN | the number of the closet neighbors k = 3. |
model 1B | PubChem fingerprint | k-NN | the number of the closet neighbors k = 3 |
model 1C | CORINA Symphony | k-NN | the number of the closet neighbors k = 10 |
model 2A | MACCS fingerprint | DT | the maximum depth of the tree d = 0; the minimum number of samples s = 2; the number of features to consider when looking for the best split n = 13 |
model 2B | PubChem fingerprint | DT | the maximum depth of the tree d = 0; the minimum number of samples s = 4; the number of features to consider when looking for the best split n = 30 |
model 2C | CORINA Symphony | DT | the maximum depth of the tree d = 8; the minimum number of samples s = 2; the number of features to consider when looking for the best split n = 7 |
model 3A | MACCS fingerprint | RF | the number of trees n = 100; the max depth of the tree d = 10; the number of features to consider when looking for the best split n = 13 |
model 3B | PubChem fingerprint | RF | the number of trees n = 70; the max depth of the tree d = 10; the number of features to consider when looking for the best split n = 30 |
model 3C | CORINA Symphony | RF | the number of trees n = 110; the max depth of the tree d = 6; the number of features to consider when looking for the best split n = 7 |
model 4A | MACCS fingerprint | SVM | the optimum loss parameter C = 4.0; the kernel function parameter γ = 0.0625 |
model 4B | PubChem fingerprint | SVM | the optimum loss parameter C = 4.0; the kernel function parameter γ = 0.0015625 |
model 4C | CORINA Symphony | SVM | the optimum loss parameter C = 4.0; the kernel function parameter γ = 2.0 |
model 5A | MACCS fingerprint | DNN | the number of hidden layer = 1; the number of neuron = 16 |
model 5B | PubChem fingerprint | DNN | the number of hidden layer = 2; the number of neuron in first hidden layer = 16, the number of neuron in second hidden layer = 16 |
model 5C | CORINA Symphony | DNN | the number of hidden layer = 1; the number of neuron = 64 |
NB: naive bayes; k-NN: k-nearest neighbors; DT: decision tree; RF: random forest; SVM: support vector machine; DNN: deep neural net.