Federated Deep Reinforcement Learning Based Task Offloading with Power Control in Vehicular Edge Computing

. 2022 Dec 7;22(24):9595. doi: 10.3390/s22249595

Algorithm 1 FL-DDPG-based Offloading Method

Initialize actor network

μ

and critic network

Q

Initialize target network with weights

μ ’

and critic network

Q ’

Initialize experience replay memory
for each episode

e \in E

do
Initialize parameters for simulation in VEC environment
Generate randomly an initial state

s_{n}^{t}

for each vehicle agent

n \in N

for each time slot

t \in T

do
for each agent

n \in N

do
Determine transmission power for offloading by selecting an action

a_{n}^{t} = μ (s^{t}) + Δ μ

Δ μ

is a sampled noise
Execute the action

a_{n}^{t},

receive reward

r_{n}^{t}

, and observe new state

s_{n}^{t + 1}

Store the tuple

(s_{n}^{t}, a_{n}^{t}, r_{n}^{t}, s_{n}^{t + 1})

into experience replay memory
Sample randomly a mini-batch of samples from experience replay memory
Update the critic and actor network via Equations (14) and (16)
Update target networks via Equations (17) and (18)
If episode

e

e_{a g g r e g a t i o n}

then
Upload the parameters to the VECS controller according to Equation (19)
Download the parameters from the VECS controller to each vehicle