Skip to main content
. 2022 Nov 28;11(23):3845. doi: 10.3390/foods11233845

Table 6.

Training parameters setting of DDPG.

Training Parameters Values
Size of experience pool 5000
Number of cycles in the outer layer 2000
Time step of inner layer 7
Adjustment time of inner action space 60
Threshold of discarding the sample 0.5
Discount factor 0.9
Learning rates of actual network 0.0001
Learning rates of target network 0.01
Soft update rate 0.001
Exploration noise 0.01