|
Algorithm
1
: Q-learning extended Kalman filter
|
| 1: , Initialization |
| 2: k 0 |
| 3:
|
| 4: for each period, do
|
|
5:
for all , do
|
|
6:
,
|
|
7:
for , do
|
|
8:
|
|
9:
Benchmark filter |
|
10:
Exploring filter |
|
11:
Main filter |
|
12:
|
|
13:
|
|
14:
end for
|
|
15:
Calculation of Reward |
|
16:
Update of weight |
|
17:
, Resetting of exploring filter |
|
18:
end for
|
|
19:
Selection of the best action |
| 20: end for
|
| 21: return and Result of state estimation |