Table 2.
Primary study | Replication study | t-value | |
---|---|---|---|
Exploration | 0.52 (0.09) | 0.48 (0.10) | 2.29* |
Strategic exploration | 0.03 (0.10) | 0.03 (0.14) | 0.08 |
Reaction time | 1.44 (0.58) | 1.50 (1.00) | −0.38 |
Reward maximization | 0.82 (0.12) | 0.79 (0.12) | 1.90 |
Mean reward gained | 55.16 (1.90) | 54.02 (1.80) | 2.35* |
Note: Exploration was the percentage of choosing the more informative options at the first free choice averaging across trials. Strategic exploration was the difference of exploration rate in long-horizon versus short-horizon games. Reaction time was the reaction time in seconds at the first free choice. Reward maximization was the percentage of choosing the bandit with a higher mean payout history at the last trial in long-horizon games. Mean reward gained was rewards gained in all free choices averaging across trials. Two-sample t tests were used to compare task performance between the Primary and Replication studies. *P < 0.05, **P < 0.01, ***P < 0.001.