Skip to main content
. 2022 Oct 7;13(10):1688. doi: 10.3390/mi13101688

Table A3.

Training Hyper Parameters.

Parameter Value
Parallel Environment 150
Maximum time steps per epoch (Stage One) 2000
Learning Rate 1 × 104
Discount Factor 0.996
Curriculum Reward Threshold 2300