Skip to main content
. 2023 Feb 23;14:1040. doi: 10.1038/s41467-023-36583-0
Actor layer widths 250, 150, 50
Representation width 50
Critic stimulus layer widths 10, 5
Critic action layer widths 1
Critic shared layer widths 5
Batch size 200
Training epochs 50119
Initial batches 20000
Reward/punishment threshold ±0.33