Models 1–4 were defined using different combinations of parameters for reward sensitivity and the mapping of expected values to the drift rate. A ‘static’ reward sensitivity means that pain increase and pain decrease were defined as –1 and 1, respectively (see Equation 4). A ‘scaled’ outcome sensitivity means that pain decrease was defined as and pain decrease as (see Equation 5). A ‘linear’ drift rate mapping means that the drift rate for each trial was defined as the difference of expected values multiplied by (see Equation 7). A sigmoid mapping of the drift rate means that was defined by a sigmoid function bounded at . (see Equation 8 und Equation 9). All models included two learning rates ( , ), the non-decision time , the boundary separation , and the a priori bias .