Claude Sonnet 4.5 recognizes when it's being safety tested, exposing flaws in AI evaluation methods and raising questions about model alignment claims.