Inter-reader scoring variability on low, moderate, and high activity cages. Three independent readers were tasked with scoring 7 cages that had been pre-selected for exemplifying different stages of behavior as detailed in Table 2. No significant differences were observed for scores on the same cage between reviewers (ANOVA, p>0.05). Reviewers were able to reliably delineate between low, moderate, and high activity cages (Student’s t-test, p<0.05).