Skip to main content
. 2020 Aug 30;22(9):957. doi: 10.3390/e22090957
Algorithm 2: The CRE bias β and the number of SBSs activation S are given to optimize the ABS ratio α.
Require: At=(α,β,S).
Ensure: Optimized ABS ratio α.
  1: Initialize Qα(s,a), state s and n = 0;
  2: Setting learning rate λ1, greedy probability ε1, discount factor γ1 and threshold1;
  3: while n <= threshold1 do
  4:  In state s, select the optimal action a with greedy probability ε1;
  5:  Observe r;
  6:  randomly transfer from s to s;
  7:  Update Qα(s,a) according to Formula (18);
  8:  ss;
  9:  n=n+1;
  10: end while
  11: Output: α=a;