Skip to main content
. 2015 Apr 28;15(5):10026–10047. doi: 10.3390/s150510026

Algorithm 1. Learning progress of sensor vi

1 Initialise Q value for each available action arbitrarily;
2 for k = 0 to a predefined integer do;
3  calculate π;
4 for each available action aAi do;
5   Qk+1(a) = Qk(a) + π (a)α1(∑a Inline graphic(a)π(a) − Qk(a));
6 end for
7 end for
8 aoptiargMax(Q);
9 vi takes the action aopti;