Skip to main content
. 2024 Mar 20;24(6):1986. doi: 10.3390/s24061986
Algorithm 1 Model-free self-triggered control based on HRL.
  • Initialize W,θ,β

  • for ep=0,1,,N do

  •     t0,x0random(x)

  •     for ttfinal do

  •         Choose u¯l and τt using actor network with noisy signal

  •         Apply u¯l during τt and read xt,rt on each sampling time unit dt

  •         Add (xt,u¯l,n·dt,rt) to the experience replay buffer D

  •         if buffersizemini-batchsize then

  •            for data in buffer do

  •                Compute L(W):=1ni=1nGxti,u¯li,τtiWt=tltl+τtixtiTQxti+u¯liTRu¯li2

  •                Using the adam optimizer to update the critic network parameters W

  •                Compute the performance evaluation metrics J and η

  •                Update the actor network parameters θ and μ using the formula (24)

  •            end for

  •         end if

  •     end for

  • end for