Skip to main content
. 2019 Nov 21;11:71. doi: 10.1186/s13321-019-0393-0

Fig. 4.

Fig. 4

Histograms of different statistics from the randomized SMILES models. a Kernel Density Estimates (KDEs) of the number of randomized SMILES per molecule from a sample of 1 million molecules from GDB-13. The plot has the x axis cut at 5000, but the unrestricted randomized variant plot has outliers until 15,000. b KDEs of the molecule negative log-likelihood (NLL) for each molecule (summing the probabilities for each randomized SMILES) for the same sample of 1 million molecules from GDB-13. The plot is also cropped between range 19,25. c Histograms between the NLL of all the restricted randomized SMILES of two molecules from GDB-13