Stability conditions in the steal/punish game (assuming , , and ). When is high, only the familiar is stable (blue-green region). When is low, only the inverted is stable (orange region). In the middle region, both are stable, and risk-dominance is determined by the boundary [ to the right and to the left]. The evolution of rigid punishment depends on the relative vulnerability of flexible strategies in each role (see SI Text).