Claude Opus had a MASK honesty rate of 91.7 percent, compared to 90.3 percent for Opus 4.6 and 89.1 percent for Sonnet 4.6.
Anthropic releases Claude Opus 4.7, narrowly retaking lead for most powerful generally available LLM
Opus 4.7 utilizes an updated tokenizer that improves text processing efficiency, though it can increase the token count of ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results