a) Timeline of trials. Illustrated is the task from the perspective of the Responder: firstly a blank screen is presented for 500-1500msec (mean 1000msec) in the colour of the Proposer group (L, M or H); second, a panel containing the photographs of the Proposer group was added for 1500 msec; and third the proposal was then shown for 3000msec (denoted both numerically and visually by the height of the coin stacks) along with the instruction to accept or reject (side counterbalanced between subjects). Subjects understood that the silhouette represented the other subject attending that session, who had been placed in one of the three Proposer groups. During the 3000msec in which the proposal was shown, subjects had to decide by a button press whether to accept or reject the offer. Subjects saw a brief screen with “REST” displayed every 8-9 contiguous trials before an introductory screen announced the group or groups whose offers would be presented next (i.e. M group only; M and H groups; M and L groups). b) Order of conditions. Initially, in a reputation learning session performed outside the scanner subjects responded to the full set of 25 offers from the M Proposer group alone (grey in panel b), the 25 H offers alone (white in panel b) and the 25 L offers alone (black in panel b). Subjects then underwent the main testing session in the scanner or behaviourally, which comprised 3 runs. In each run of 125 trials, subjects responded to the M set of offers in 3 contexts: alone (“neutral”); interleaved with the L set (M-in-L; “more fair”); and interleaved with the H set (M-in-H; “less fair”).