Children’s strategies. Children in Experiment 1 seemed to exhibit two distinct strategies, well captured by the Value and Lag models, respectively. These strategies differed in a few key ways. Value-based participants (N=15) exploited the best option often, explored little, and otherwise showed no particular pattern in their sequences of choices. Lag-based participants (N=17) chose all options equally often, and switched responses at extremely high rates, almost never picking the same option twice in a row. Darker colors represent higher transition probabilities. The predictions for each model are based on simulations using the best-fitting parameters for children that were best-fit by that model.