Gadget Review on MSN
Claude Lies During Safety Tests – What Else Is It lying About?
Claude Sonnet 4.5 recognizes when it's being safety tested, exposing flaws in AI evaluation methods and raising questions about model alignment claims.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results