Skip to main content
. 2023 Oct 4;25(10):1415. doi: 10.3390/e25101415
Algorithm 1 Offline Training for Domain Theory
Require: S, A: State and action spaces in the continuous domain
Require: map: A map used for navigation tasks in the continuous domain
  •  1:

    Initializing DRL (deep reinforcement learning) parameters γ

  •  2:

    Initializing the reward function r(s,a)

  •  3:

    Initializing training iterations T

  •  4:

    for t = 1 to T do

  •  5:

       Randomly generating goal g in the map

  •  6:

       asg,r(s,a)0

  •  7:

       a,s=g,r(g,a)C

  •  8:

       QπgDRL(S,A,γ)

  •  9:

    end for

  • 10:

    return Tπ(G)