Data-Driven Self-Triggered Control for Networked Motor Control Systems Using RNNs and Pre-Training: A Hierarchical Reinforcement Learning Framework

. 2024 Mar 20;24(6):1986. doi: 10.3390/s24061986

Algorithm 2 Pre-training method for control actor network.

Initialize $μ, K$
for $e p = 0, 1, \dots, N$ do
$t \leftarrow 0, x_{0} \leftarrow r a n d o m (x)$
for $t \leq t_{f i n a l}$ do
Choose $τ_{t} = H_{σ} (x_{t})$
Choose $u_{t} = U_{λ} (x_{t}, τ_{t})$
Apply $u_{t}$ after the sampling time unit $τ_{t}$
Add ( $x_{t}$ , $τ_{t}$ , $u_{t}$ ) to the data buffer $D$
if $buffer size \geq mini-batch size$ then
for data in $D$ do
Generated by the control strategy network ${\hat{u}}_{t} = U_{η} (x_{t}, τ_{t})$
Compute $L (μ) = \frac{1}{n} \sum_{i = 1}^{n} {({\hat{u}}_{t i} - u_{t i})}^{2}$
Using the adam optimizer to update the network parameters $μ$
end for
end if
end for
end for