Table 3.
Comparison between different kNN-Dragon QSAR models generated with or without variable selection.
| Model | Num. Modelsa | Confusion Matrix | Statistics for the Models | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N(1)b | N(0)c | TP | TN | FP | FN | SE | SP | En(1) | En(0) | CCRevsd | Coveragee | ||
| Af | 908 | 26 | 34 | 20 | 23 | 11 | 6 | 0.77 | 0.68 | 1.41 | 1.49 | 0.72 | 100% | 
| Bg | 1 | 26 | 34 | 10 | 22 | 10 | 8 | 0.38 | 0.65 | 1.13 | 1.36 | 0.52 | 83% | 
| Ch | 1 | 26 | 34 | 14 | 15 | 19 | 9 | 0.54 | 0.44 | 0.98 | 1.12 | 0.49 | 95% | 
| Di | 1 | 26 | 34 | 14 | 15 | 19 | 9 | 0.54 | 0.44 | 0.98 | 1.12 | 0.49 | 95% | 
Num. models, number of models with CCRtrain and CCRtest ≥ 0.70
N(1), number of actives
N(0), number of inactives
CCRevs, correct classification rate of the consensus models using the external validation set
Coverage: percentage of predicted compounds, and coverage = % of the external set compounds predicted by the models
A, kNN-Dragon
B, kNN-Dragon-NVS where kNN model was generated using all 387 Dragon descriptors with no variable selection and 1 nearest neighbor (NN)
C, kNN-Dragon-MFD where the kNN model was generated with top 20 most frequent Dragon descriptors and 1NN
D, kNN-Dragon-DWD where the kNN model was generated with top 20 highly weighted Dragon descriptors by DWD and 1 NN.