Skip to main content
. 2022 Apr 1;14:20. doi: 10.1186/s13321-022-00601-y

Fig. 1.

Fig. 1

Experimental setup described by Renz and al. [12]. The initial dataset is split in two sets. The first split is used as a training set for the optimization model and the model-control model, and the second split for the data-control model. For a given molecule, the optimization (resp. model-control, data-control) score Sopt (resp. Smc, Sdc) is given by the optimization model’s (resp. model-control model’s, data-control model’s) predicted probability of being active The optimization score is used to guide goal-directed generation, and the evolution of control scores is also tracked during optimization. While the optimization score Sopt grows throughout training, the control scores Smc and Sdc stagnates and reaches much lower values