Table 5.
The performance of SVM based models developed using binary profile of atoms obtained from terminals of SMILES format.
| Feature (parameters) | Main dataset |
Validation dataset |
||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Sen | Spc | Acc | MCC | AUROC | Sen | Spc | Acc | MCC | AUROC | |
| N25 (g = 0.05, c = 8, j = 2) | 77.63 | 75.68 | 76.67 | 0.53 | 0.83 | 79.59 | 85.71 | 82.42 | 0.65 | 0.91 |
| N50 (g = 0.01, c = 3, j = 3) | 83.17 | 79.31 | 81.27 | 0.63 | 0.88 | 90.58 | 86.40 | 88.59 | 0.77 | 0.93 |
| N100 (g = 0.005, c = 6, j = 2) | 85.71 | 84.18 | 84.90 | 0.70 | 0.93 | 85.04 | 84.93 | 84.98 | 0.70 | 0.93 |
| C25 (g = 0.01, c = 5, j = 4) | 79.11 | 70.43 | 74.70 | 0.50 | 0.79 | 89.19 | 74.51 | 82.16 | 0.65 | 0.83 |
| C50 (g = 0.1, c = 1, j = 1) | 83.47 | 72.08 | 77.94 | 0.56 | 0.85 | 88.31 | 74.83 | 81.73 | 0.64 | 0.91 |
| C100 (g = 0.001, c = 3, j = 2) | 82.97 | 81.85 | 82.38 | 0.65 | 0.89 | 89.55 | 77.55 | 83.27 | 0.67 | 0.92 |
| N25C25 (g = 0.01, c = 5, j = 2) | 85.69 | 84.82 | 85.27 | 0.71 | 0.91 | 84.71 | 82.31 | 83.55 | 0.67 | 0.92 |
| N50C50 (g = 0.05, c = 2, j = 1) | 89.79 | 87.16 | 88.47 | 0.77 | 0.95 | 87.43 | 85.63 | 86.53 | 0.73 | 0.95 |
| N100C100 (g = 0.01, c = 6, j = 1) | 90.15 | 89.58 | 89.84 | 0.80 | 0.96 | 90.51 | 84.62 | 87.37 | 0.75 | 0.96 |
Sen, Sensitivity; Spc, Specificity; Acc, Accuracy; MCC, Matthew’s Correlation Coefficient; AUROC, Area Under the Receiver Operating Characteristic curve; N25/N50/N100, first 25/50/100 elements from N-terminal; C25/C50/C100, first 25/50/100 elements from C-terminal; N25C25/N50C50/N100C100, first 25/50/100 elements from N-terminal as well as from C-terminal joined together.