Skip to main content
. 2020 Nov 4;11:5575. doi: 10.1038/s41467-020-19266-y

Fig. 1. Character and exact sequence-based accuracies calculated for the monitoring set.

Fig. 1

The transformer memorized the target sequences if the target sequences were all canonical SMILES (red dots). It also reasonably predicted the sequence composition for randomized target SMILES (cyan rectangle, dashed) but its performance decreased for prediction of exact full SMILES (cyan circle). The performance normalized by the percentage of canonical sequences increased with the number of augmentations, N, since some of the random sequences were canonical ones.