. 2023 Nov 27;120(49):e2303869120. doi: 10.1073/pnas.2303869120

Table 2.

Task performance (Mean and SD) by study

	Primary study	Replication study	t-value
Exploration	0.52 (0.09)	0.48 (0.10)	2.29*
Strategic exploration	0.03 (0.10)	0.03 (0.14)	0.08
Reaction time	1.44 (0.58)	1.50 (1.00)	−0.38
Reward maximization	0.82 (0.12)	0.79 (0.12)	1.90
Mean reward gained	55.16 (1.90)	54.02 (1.80)	2.35*

Note: Exploration was the percentage of choosing the more informative options at the first free choice averaging across trials. Strategic exploration was the difference of exploration rate in long-horizon versus short-horizon games. Reaction time was the reaction time in seconds at the first free choice. Reward maximization was the percentage of choosing the bandit with a higher mean payout history at the last trial in long-horizon games. Mean reward gained was rewards gained in all free choices averaging across trials. Two-sample t tests were used to compare task performance between the Primary and Replication studies. *P < 0.05, **P < 0.01, ***P < 0.001.