(A) The striatum was imaged during conditioning sessions in which tones predicted reward delivery. (B) Two tones (4 and 11 kHz) were played (1.5 s duration) and were associated with distinct reward probabilities (80% or 20%). After a 0.5 s delay, reward could be delivered. Inter-trial interval durations varied from 5.25 to 8.75 s. (C) Frequency of licking after training, averaged over five mice (±SEM). Anticipatory licking was significantly higher during the presentation of the high-probability tone (blue) than during the presentation of the low-probability tone (green). After reward delivery, licking rates were elevated for several seconds (solid lines: rewarded trials; dotted lines: unrewarded trials). (D) Licking during the tone and reward delay, shown as z-scores calculated relative to the 2 s baseline period preceding the tone, during training sessions (average of 3 mice). Mice began to exhibit differences in levels of anticipatory licking between the two cues after 11–12 sessions. Animals were considered to be trained when they exhibited significantly higher anticipatory licking during the high-probability tone (blue) than during the low-probability tone (green) in 2 out of 3 consecutive sessions. Shading represents SEM.