Claude Sonnet 4.5 achieved top scores on the SWE-bench Verified evaluation, which tests real-world software coding skills.
Google DeepMind unveils CodeMender, an AI agent that autonomously patches software vulnerabilities - SiliconANGLE ...