Skip to main content

View full-text article in PMC

. 2025 Feb 18;11:e2690. doi: 10.7717/peerj-cs.2690

Table 5. Parameter configuration of the actor-critic neural networks and agent.

Parameter	Actor network	Critic network
$X_{i}$ Input layer size	19	19
Hidden layer $L_{1}$	150	150
Activation function $L_{1}$	Tanh	Tanh
Hidden layer $L_{2}$	150	150
Activation function $L_{2}$	Tanh	Tanh
Hidden layer $L_{3}$	100	100
Activation function $L_{3}$	ReLu	ReLu
$Y_{i}$ Output layer	27	1
AC agent parameters
Number of steps to look ahead	70	70
Learning rate	0.001	0.001
Entropy loss weight	0.25	0.25
Gradient threshold	1	1
Discount factor	0.91	0.91
Max number of episodes	5,000	5,000
Max steps per episode	4,000	4,000