Skip to main content
. 2024 Mar 20;24(6):1986. doi: 10.3390/s24061986
Algorithm 2 Pre-training method for control actor network.
  • Initialize μ,K

  • for ep=0,1,,N do

  •     t0,x0random(x)

  •     for ttfinal do

  •         Choose τt=Hσ(xt)

  •         Choose ut=Uλ(xt,τt)

  •         Apply ut after the sampling time unit τt

  •         Add (xt,τt,ut) to the data buffer D

  •         if buffersizemini-batchsize then

  •            for data in D do

  •                Generated by the control strategy network u^t=Uηxt,τt

  •                Compute L(μ)=1ni=1nu^tiuti2

  •                Using the adam optimizer to update the network parameters μ

  •            end for

  •         end if

  •     end for

  • end for