Algorithm 1 Modified HER Algorithm for Retaining Experience with Adjusted Rewards |
-
1:
Initialize environment and agent
-
2:
-
3:
-
4:
while task not finished do
-
5:
-
6:
-
7:
Append to
-
8:
if then
-
9:
-
10:
-
11:
end if
-
12:
-
13:
end while
-
14:
if
then
-
15:
-
16:
-
17:
Add a reward of 10 to the reward of
-
18:
Process for further learning as successful attempts
-
19:
Store in experience pool without additional reward
-
20:
end if
|