Table 2.
Strategic deception: AI systems can be strategists, using deception because they have reasoned out that this can promote a goal. |
Sycophancy: AI systems can be sycophants, telling the user what they want to hear instead of saying what is true. |
Unfaithful reasoning: AI systems can be rationalizers, engaging in motivated reasoning to explain their behavior in ways that systematically depart from the truth. |