Skip to main content
. 2016 Jan 11;56(2):275–285. doi: 10.1021/acs.jcim.5b00555

Figure 5.

Figure 5

Unsupervised model building based on 1843 data sets extracted from ChEMBL v20. Results are divided into bin sizes (columns). Each point corresponds to the ratio of correctly predicted bins versus chance of random guessing (enrichment), with a purple line indicating the null hypothesis. The average and standard deviation are marked on the Y-axis. Training set size is shown on the X-axis. The testing sets were made up of 10% of each total data set.