TABLE II.
Q-values when converged at γ = 0.90.
State | Action (next state) | ||||||||
---|---|---|---|---|---|---|---|---|---|
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | ||
(0,0,0) | (0,0,1) | (0,1,0) | (0,1,1) | (1,0,0) | (1,0,1) | (1,1,0) | (1,1,1) | ||
0 | (0,0,0) | 840.6 | 783.3 | 830.4 | 797.6 | 922.4 | 828.5 | 932.3 | 856.8 |
1 | (0,0,1) | 798.7 | 718.1 | 617.3 | 696.3 | 717.0 | 714.8 | 682.8 | 676.9 |
2 | (0,1,0) | 810.7 | 863.3 | 774.8 | 842.0 | 735.0 | 860.7 | 785.6 | 824.0 |
3 | (0,1,1) | 776.4 | 720.2 | 607.9 | 698.3 | 555.8 | 716.6 | 620.5 | 678.4 |
4 | (1,0,0) | 807.0 | 835.5 | 896.3 | 938.5 | 843.2 | 831.6 | 907.7 | 925.4 |
5 | (1,0,1) | 779.9 | 720.2 | 657.1 | 698.8 | 728.7 | 717.0 | 796.9 | 748.6 |
6 | (1,1,0) | 768.4 | 866.0 | 778.9 | 844.5 | 718.9 | 864.0 | 779.1 | 824.9 |
7 | (1,1,1) | 756.4 | 721.8 | 643.8 | 699.9 | 596.7 | 718.4 | 662.8 | 680.2 |