Skip to main content
. 2020 Aug 30;22(9):957. doi: 10.3390/e22090957
Algorithm 4: The ABS ratio α and the CRE bias β are given to optimize the number of SBS activation S.
Require: at=(α,β,S).
Ensure: Optimized ABS ratio α.
  1: Initialize QS(s,a), state s and n = 0;
  2: Setting learning rate λ3, greedy probability ε3, discount factor γ3, and threshold3
  3: while n <= threshold3 do
  4:  In state s, select the optimal action a with greedy probability ε3;
  5:  Observe r;
  6:  randomly transfer from s to s;
  7:  Update QS(s,a) according to Formula (20);
  8:  ss;
  9:  n=n+1;
  10: end while
  11: Output: S=a;