Table - PMC

Skip to main content

View full-text article in PMC

. 2023 Oct 4;25(10):1415. doi: 10.3390/e25101415

Algorithm 1 Offline Training for Domain Theory

Require:

S

,

A

: State and action spaces in the continuous domain
Require:

m a p

: A map used for navigation tasks in the continuous domain

1:
Initializing DRL (deep reinforcement learning) parameters $γ$
2:
Initializing the reward function $r (s, a)$
3:
Initializing training iterations T
4:
for t = 1 to T do
5:
Randomly generating goal g in the $m a p$
6:
$\forall a \forall s \neq g, r (s, a) \leftarrow 0$
7:
$\forall a, s = g, r (g, a) \leftarrow C$
8:
$Q_{π_{g}} \leftarrow D R L (S, A, γ)$
9:
end for
10:
return $T_{π} (G)$