Tech/SaaS September 29, 2025

Anthropic says its new AI model “maintained focus” for 30 hours on multistep tasks

Benj Edwards
1 views
Anthropic says its new AI model “maintained focus” for 30 hours on multistep tasks

On Monday, Anthropic released Claude Sonnet 4.5, a new AI language model the company calls its "most capable model to date," with improved coding and computer use capabilities. The company also revealed Claude Code 2.0, a command-line AI agent for developers, and the Claude Agent SDK, which is a tool developers can use to build their own AI coding agents. Anthropic says it has witnessed Sonnet 4.5 working continuously on the same project "for more than 30 hours on complex, multi-step tasks," though the company did not provide specific details about the tasks. In the past, agentic models have been known to typically lose coherence over long periods of time as errors accumulate and context windows (a type of short-term memory for the model) fill up. In the past, Anthropic has mentioned that previous Claude 4.0 models have played Pokémon for over 24 hours or refactored code for seven hours. To understand why Sonnet exists, you need to know a bit about how AI language models work. Traditionally, Anthropic has produced three differently sized AI models in the Claude family: Haiku (the smallest), Sonnet (mid-range), and Opus (the largest). Anthropic last updated Haiku in November 2024 (to 3.5), Sonnet this past May (to 4.0), and Opus in August (to 4.1). Model size in parameters, which are values stored in its neural network, is roughly proportional to overall contextual depth (the number of multidimensional connections between concepts, which you might call "knowledge") and better problem-solving capability, but larger models are also slower and more expensive to run. So AI companies always seek a sweet spot in the middle with reasonable performance-cost trade-offs. Claude Sonnet has filled that role for Anthropic quite well for several years now.Read full article Comments

Advertisement

Related Articles
AI tools I wish existed

Article URL: https://sharif.io/28-ideas-2025 Comments URL: https://news.ycombinator.com/item?id=45421812 Points: 6 # Comments: 0

1 day, 20 hours ago 2
Notion Capital raises $130M growth fund to tackle …

The growth fund is nearly twice the size of its previous one.

1 day, 20 hours ago 2
Hiring only senior engineers is killing companies

Article URL: https://workweave.dev/blog/hiring-only-senior-engineers-is-killing-companies Comments URL: https://news.ycombinator.com/item?id=45421564 Points: 104 # Comments: 102

1 day, 21 hours ago 2
Show HN: Devbox – Containers for better dev …

I've been frustrated with dependency hell and clutter on my VPS from dev, so I …

1 day, 22 hours ago 2
Awakening Bell

Article URL: https://awakeningbell.org/ Comments URL: https://news.ycombinator.com/item?id=45421067 Points: 12 # Comments: 0

1 day, 22 hours ago 2
FAA decides it trusts Boeing enough to certify …

Article URL: https://www.theregister.com/2025/09/29/faa_decides_it_trusts_boeing/ Comments URL: https://news.ycombinator.com/item?id=45420327 Points: 113 # Comments: 54

2 days ago 2
Tech/SaaS Stats

493

Total Articles

1

Views

Advertisement