|
Algorithm 1 Reinforcement learning with evolution strategies. |
| 1: Given
|
| 2: Parent NN with weight matrix
|
|
| 3: number of children m
|
| 4: learning_rate
|
| 5: Start
|
| 6: for iteration in a predefined range do
|
| 7: for
h in range m
do
|
| 8: Parent NN + random noise |
() |
| 9: Evaluate
|
| 10: Calculate
|
| 11:
|
|
| 12: Parent NN → Parent NN + |
|
| 13: Evaluate Parent NN |
| 14: End
|
|
|
|
Return the highest performing Parent NN |