Skip to main content
. 2020 Sep 4;12:53. doi: 10.1186/s13321-020-00454-3

Fig. 1.

Fig. 1

Block diagram of our basic system. A molecule is generated by the Reinforcement Learning (RL) pathway using a Graph Convolutional Policy Networks. This molecule is then used as an input for the property prediction module which outputs the property score as predicted by the module. This score is then used as the reward feedback for the RL pathway and the cycle restarts