Table 2.
Reward matrix.
R | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
---|---|---|---|---|---|---|---|
1 | −1 | 3.6 | 3.1 | 3.6 | −1 | −1 | −1 |
2 | 0 | −1 | 4 | −1 | −1 | −1 | −1 |
3 | 0 | 0 | −1 | 7.6 | −1 | −1 | −1 |
3 | 0 | 0 | −1 | 7.2 | −1 | −1 | −1 |
4 | 0 | −1 | 0 | −1 | 8.1 | 8.1 | −1 |
5 | −1 | −1 | −1 | 0 | −1 | −1 | 10 |
6 | −1 | −1 | −1 | 0 | −1 | −1 | 10 |
7 | −1 | −1 | −1 | −1 | 0 | 0 | −1 |