BENS−B5G: Blockchain-Enabled Network Slicing in 5G and Beyond-5G (B5G) Networks

. 2022 Aug 14;22(16):6068. doi: 10.3390/s22166068

Algorithm 1. BENS—Training Algorithm (BENS-T).

Input: BENS framework, scenario emulator, and all applications’ QoS requirements.
For every session i = (1, 2, …, to M)^instancs, perform the following:
Initialize: every agent Q-network where, function Q(p, b), rule-based approach φ(p, b), load β, refactor W;
Perform process Instance = (0, 1, 2, to F);
Every agent keeps track of its condition T_s;
Randomly select action A_c with probability λ;
Alternatively, randomly select action A_c with probability = max arg a∈A Q_t(P_s, b_s, µ_s);
Carry out an action at, then collect a reward W_ins;
Record a fresh state T_s₊₁;
Store refactor W_ins = (p_s, b_s, W(p_s, b_s), T_s+1) in memory O;
Repeat for every agent, proceed;
Randomly sample a micro-data k_s in O;
Add µ_s—Update;
Incline to update µ_s+1;
Update φ, the policy with Q-max_value;
Perform action on φ;
Conclude the loop for;
Conclude the loop for;
Return: Return trained BENS models.