Skip to main content
. 2021 Mar 25;21(7):2308. doi: 10.3390/s21072308
Lb Index of the number N of BSs.
Li Index of the number Ki of information users.
Le Index of the number Ke of active energy users (EUs).
Le[idle] Index of the number Ke[idle] of idle EUs.
Pn[Tx] Total transmit power at the n-th RRH.
Pn[circ] Hardware circuit power consumption at the n-th RRH.
PCU[circ] Hardware circuit power consumption at the CU.
Pn[Tmax] Maximum transmit power allowance of the n-th RRH.
PCU[max] Maximum power provision by the grid at the CU.
T Index of the number T of time slots.
F Index of the number F of frames within a time slot.
K Index of the number K of trials within a frame.
Bn[ahead] Amount of energy purchased from the day-ahead
market (Arm).
Bn[spot] Amount of energy to be purchased from the spot-market.
Sn Amount of excessive energy to be sold back to the grid.
En Amount of renewable energy generation at the n-th RRH.
Bn[total](k) Total energy cost of the n-th RRH at the k-th trial.
E[total]={E1,,EJ} All energy packages (arms) offered by the grid in the
day-ahead market.
μn,p[k,f,t]=R(Bn[ahead](k)) Reward associated with arm Bn[ahead] at the k-th trial of the
f-th frame at the t-th time slot.
Ak[set]={B1[ahead](k),,BN[ahead](k)} N energy packages purchased a day ahead at the k-th trial
(super arm).
R(Ak[set]) Reward for the super arm Ak[set] at the k-th trial.
μn[k,f,t]=(μn,1[k,f,t],μn,2[k,f,t],,μn,J[k,f,t]) Reward vector for the n-th RRH
μ^n[f,t]=(μ^n,1[f,t],μ^n,2[f,t],,μ^n,J[f,t]) Estimated mean reward vector
μ¯n[f,t]=(μ¯n,1[f,t],μ¯n,2[f,t],,μ¯n,J[f,t]) Adjusted reward vector of individual arms
Rt[acc] Accumulated reward at time slot t.
Qt Regret at time slot t.