Skip to main content
. 2019 Jun 28;13:40. doi: 10.3389/fnbot.2019.00040

Figure 2.

Figure 2

Sequence updates in the recurrent network. Only the scores of the actions taken in states 5, 6, and 7 will be updated. The first four states provide a more accurate hidden state to the LSTM, while the last state provides a target for state 7.