Skip to main content
. 2019 Jul 11;14(7):e0219774. doi: 10.1371/journal.pone.0219774

Fig 3. Mean performance metrics for random forest binary classification models trained on data sets containing imputed IC50 values.

Fig 3

IC50 values were imputed by the k-nearest neighbors (k = 9) algorithm (left) and the logistic regression with lasso (‘lassoC’) algorithm (right). Percentages of imputed values range from 10% to 40%. An IC50 activity cutoff of 1 μM was applied. The kappa statistic (κ) has been multiplied by 100 for scaling.