Claude 4, Anthropic
Digest more
Bowman later edited his tweet and the following one in a thread to read as follows, but it still didn't convince the naysayers.
Anthropic's newest editon of its flagship AI product will address significant limitations in current large language models.
Anthropic's Claude Opus 4 outperforms OpenAI's GPT-4.1 with unprecedented seven-hour autonomous coding sessions and record-breaking 72.5% SWE-bench score, transforming AI from quick-response tool to day-long collaborator.
2don MSN
A third-party research institute Anthropic partnered with to test Claude Opus 4 recommended against deploying an early version because it tends to "scheme."