|
Algorithm 4 The two-layer hybrid MRA training algorithm with the pricing strategy. |
(Input), ⋯;
⋯
for episode = 1 to do
for time to do
⋯
Observe next state ;
Obtain , by using (50)
for each link i do
for iteration to do
Update by using (47) with;
Update by using (48) with;
⋯
end for
end for
⋯
end for
end for
|