Anthropic's Claude Sonnet 4.5 now scores 77% on a key software engineering benchmark and can work autonomously for over 30 ...
One of the hottest markets in the artificial intelligence industry is selling chatbots that write computer code.
Now, Claude Sonnet 4.5 has lapped that last model, outperforming it on the SWE-bench Verified evaluation, a human-filtered subset of the SWE-bench. Claude Sonnet 4.5 also outperformed leading models ...
The user interface of Codex CLI is less intuitive, but IDE extensions like Open Agents and Codeexia can enhance usability, ...
Hands on with GitHub’s open-source tool kit for steering AI coding agents by combining detailed specifications and a human in ...
Another key competitor is Graphite, which secured $52 million in funding in March. Graphite benefits from a close partnership ...
The new prime editors are about as efficient as their predecessors but make up to 60-fold fewer ‘indel’ mistakes.