Skip to main content
. 2018 Apr 25;34(18):3169–3177. doi: 10.1093/bioinformatics/bty323

Fig. 2.

Fig. 2.

The reinforcement learning framework. A cell interacts with the embryo. At each time step, the cell receives a state St, selects an action At, gets a reward Rt and enters the next state St+1. The cell’s objective is to maximize the total rewards received