. 2008 Apr 21;105(18):6741–6746. doi: 10.1073/pnas.0711099105

Table 1.

Model update rules

Model	Update rule
RL	V_t+1^a = V_{t + η}^a(R_t − V_t^a)
Fictitious	p_t+1* = p_t* + η(P_t − p_t*)
Influence	p_t+1* = p_t* + η(P_t − p_t) − κ(Q_t* − q_t**)

The RL model updates the value of the chosen action a with a simple Rescorla–Wagner (35) prediction error (R_t − V_t^a) as the difference between received rewards and expected rewards, where η is the learning rate. The fictitious play model instead updates the state (strategy) of the opponent p_t* with a prediction error (P_t − p_t*) between the opponent's action and expected strategy. The influence model extends this approach by also including the influence (Q_t − q_t**) that a player's own action Q_t has on the opponent's strategy (see Methods).