Figure 5. Demonstration of single-photon decision maker (1).
(a) The correct selection rate, or the rate of making an accurate decision (i.e., selecting the slot machine with higher reward probability) as a function of time. The correct selection rate increases with time even after the sudden inversion of the reward probability that is induced in the system every 150 cycles. (b) Evolution of the associated PA values.
