Anthropic Admits It Dumbed Down Claude when Trying to Make It Smarter

Anthropic Admits It Dumbed Down Claude when Trying to Make It Smarter

The Register – AI/ML (data-related)
The Register – AI/ML (data-related)Apr 23, 2026

Companies Mentioned

Why It Matters

The missteps reduced user productivity and highlighted the fragility of rapid AI model tweaks, underscoring the need for rigorous testing in generative AI deployments. Restoring performance helps retain enterprise confidence in Claude as a coding assistant and agent platform.

Key Takeaways

  • Default effort level lowered, causing slower reasoning quality
  • Cache bug cleared session data each turn, making output repetitive
  • System-prompt limit reduced verbosity, dropping performance by three percent
  • Reversions restored high effort, fixed cache, and removed prompt limits
  • Anthropic pledges stricter testing and transparent user communication

Pulse Analysis

Anthropic’s recent admission sheds light on the challenges of iterating large language models in production. In early March the company reduced Claude Code’s default reasoning effort from high to medium, aiming to cut latency. The trade‑off backfired, as developers reported shallower analyses and more superficial code suggestions. By April 7 the setting was restored to “xhigh,” acknowledging that users value depth over speed for complex programming tasks.

A second issue stemmed from a cache‑optimization update intended to free idle session data after an hour. The patch mistakenly cleared cached tokens after every turn, causing Claude to lose context and repeat earlier statements. This regression was fixed on April 10 for both Opus 4.6 and Sonnet 4.6, but not before customers experienced a noticeable dip in coherence, especially in multi‑step coding workflows. The third change introduced stricter length limits in the system prompt, trimming tool‑call text to 25 words and final answers to 100 words. Internal tests missed a three‑percent performance drop, prompting a quick rollback on April 20.

The episode illustrates a broader industry tension between rapid feature rollout and model reliability. Enterprises relying on AI‑driven development tools expect consistent output quality; any perceived degradation can erode trust and slow adoption. Anthropic’s commitment to more rigorous A/B testing, a dedicated @ClaudeDevs channel, and clearer user communication aims to rebuild confidence. As generative AI becomes integral to software engineering pipelines, transparent governance and robust validation will be critical differentiators for providers competing in this fast‑moving market.

Anthropic admits it dumbed down Claude when trying to make it smarter

Comments

Want to join the conversation?

Loading comments...