Skip to main content
. 2026 Apr 17;13(4):474. doi: 10.3390/bioengineering13040474
Algorithm A1 PDC-KD Training Procedure
  • 1:

     Input: Dataset D, teacher model T, student model S

  • 2:

     Initialize student parameters θS; set Cholesky factor L=I

  • 3:

     Load precomputed Teacher Fisher matrix FT and projection operator PT

  • 4:

     for epoch =1 to 200 do

  • 5:

        if epoch <10 then

  • 6:

            Apply linear warm-up

  • 7:

        end if

  • 8:

        for batch (x,y)D do

  • 9:

             Compute teacher output τT=T(xcwt)

  • 10:

            Compute student output τS=S(xraw)

  • 11:

            Apply SG filtering to obtain (ω˙filt,ωfilt)

  • 12:

            Compute equivalent inertia tensor Ieff=LL

  • 13:

            Compute Ldata,LKD,LFisher,Lsubspace,Lphy

  • 14:

            Aggregate total loss Ltotal (Equation (10))

  • 15:

            Perform backpropagation with gradient clipping to update θS and L

  • 16:

        end for

  • 17:

     end for

  • 18:

     Output: Trained student model S