"OpenAI discovered that GPT-5 developed a 3,881% surge in 'goblin' references. The root cause traces to a personality feature and a reward signal that
Anthropic discovered 171 steerable emotion vectors inside Claude. Cranking up "desperation" makes AI cheat silently at 70% rates with zero visible tra