News

Anthropic launched Claude Opus 4.1. The model exceeds the predecessor's performance on complex tasks. It is available to paid Claude users, Claude Code, API, Amazon Bedrock, and Google Cloud's ...
Claude Opus 4.1 scores 74.5% on the SWE-bench Verified benchmark, indicating major improvements in real-world programming, bug detection, and agent-like problem solving.
In May, AI firm Anthropic introduced its Claude 4 family of models with a focus on improvements to coding, reasoning, and following instructions. Three months later, Anthropic is back with Claude ...