Real-Time Task Assignment Approach Leveraging Reinforcement Learning with Evolution Strategies for Long-Term Latency Minimization in Fog Computing

. 2018 Aug 27;18(9):2830. doi: 10.3390/s18092830

Algorithm 1 Reinforcement learning with evolution strategies.
1: Given
2: Parent NN with weight matrix $W^{(i)}$	$i = 1, 2$
3: number of children m
4: learning_rate $η$
5: Start
6: for iteration in a predefined range do
7: for h in range m do
8: $C h i l d^{(h)} =$ Parent NN + random noise	( $W^{(i) (h)} = W^{(i)} + n o i s e$ )
9: Evaluate $C h i l d^{(h)} \to R e w a r d^{(h)}$
10: Calculate $M e a n_r e w a r d$
11: $G a i n^{(h)} = R e w a r d^{(h)} - M e a n_r e w a r d$	$h = 1, \dots, m$
12: Parent NN → Parent NN + $η \times \sum_{h = 1}^{m} G a i n^{(h)} \times C h i l d^{(h)}$	$W^{(i)} = η \times \sum G a i n^{(h)} \times W^{(i) (h)}$
13: Evaluate Parent NN
14: End

Return the highest performing Parent NN