Skip to main content
. 2020 Aug 10;20(16):4468. doi: 10.3390/s20164468
Algorithm 1 Updating R𝓉λ
1: For transition (S𝓉,a𝓉,r𝓉,R𝓉λ,S𝓉+1), from back to front do
2:   If S𝓉+1 is terminal
3:     Update R𝓉λ←r𝓉
4:   Else
5:     Obtain the next transition from T
(S𝓉+1,a𝓉+1,r𝓉+1,R𝓉+1Ξ»,S𝓉+2)
6:     Update R𝓉λ←r𝓉+Ξ³[Ξ»R𝓉+1Ξ»+(1βˆ’Ξ»)maxa∈AQ(s𝓉+1,a)]
7:   End If
8: End For