Skip to main content
. 2021 May 6;15(3):034101. doi: 10.1063/5.0032377

TABLE II.

Q-values when converged at γ = 0.90.

State Action (next state)
0 1 2 3 4 5 6 7
(0,0,0) (0,0,1) (0,1,0) (0,1,1) (1,0,0) (1,0,1) (1,1,0) (1,1,1)
0 (0,0,0) 840.6 783.3 830.4 797.6 922.4 828.5 932.3 856.8
1 (0,0,1) 798.7 718.1 617.3 696.3 717.0 714.8 682.8 676.9
2 (0,1,0) 810.7 863.3 774.8 842.0 735.0 860.7 785.6 824.0
3 (0,1,1) 776.4 720.2 607.9 698.3 555.8 716.6 620.5 678.4
4 (1,0,0) 807.0 835.5 896.3 938.5 843.2 831.6 907.7 925.4
5 (1,0,1) 779.9 720.2 657.1 698.8 728.7 717.0 796.9 748.6
6 (1,1,0) 768.4 866.0 778.9 844.5 718.9 864.0 779.1 824.9
7 (1,1,1) 756.4 721.8 643.8 699.9 596.7 718.4 662.8 680.2