To measure the similarity in choices across mice within a lab, we computed
within-lab choice consistency. For each lab and each stimulus, we computed the variance across mice in the fraction of rightward choices. We then computed the inverse (consistency) and averaged the result across stimuli. (
a) Within-lab choice consistency for the basic task (same data as in
Figure 3) for each lab (
dots) and averaged across labs (
line). This averaged consistency was not significantly higher (p=0.73) than a null distribution generated by randomly shuffling lab assignments between mice and computing the average within-lab choice variability 10,000 times (
violin plot). Therefore, choices were no more consistent within labs than across labs. (
b) Same analysis, for the full task (same data as in
Figure 4). Within-lab choice consistency on the full task was not higher than expected by chance, p=0.25. In this analysis we computed consistency separately for each stimulus and prior block before averaging across them. Choice consistency was higher on the full task than the basic task; this likely reflects both increased training on the task, and a stronger constraint on choice behavior through the full task’s block structure. (
c) As in a, b, but measuring the within-lab consistency of ‘bias shift’ between the 20:80 and 80:20 blocks (as in
Figure 4d,e). Within-lab consistency in bias shift was not higher than expected by chance (p=0.31).