THE DECODER

Publication

0 followers

News, business insights, and research updates on artificial intelligence

ElevenLabs and Google Dominate Artificial Analysis' Updated Speech-to-Text Benchmark

Artificial Analysis released version 2.0 of its AA‑WER speech‑to‑text benchmark, ranking ElevenLabs' Scribe v2 as the most accurate model with a 2.3 % word error rate. Google’s Gemini 3 Pro follows at 2.9 % and Mistral’s Voxtral Small at 3.0 %, while OpenAI’s Whisper Large v3 sits at 4.2 %. In the specialized AA‑AgentTalk test for voice‑assistant queries, Scribe v2 and Gemini 3 Pro again lead with error rates of 1.6 % and 1.7 % respectively. The results highlight rapid gains in multimodal AI transcription without dedicated training.

By THE DECODER

News•Mar 1, 2026

Moltbook's Alleged AI Civilization Is Just a Massive Void of Bloated Bot Traffic

Researchers from the University of Maryland and MBZUAI conducted the first large‑scale study of Moltbook, a Reddit‑style platform populated solely by over 2.6 million autonomous LLM agents. Analyzing 290 000 posts and 1.8 million comments, they found the AI community to be socially...

By THE DECODER

News•Mar 1, 2026

The Pentagon-OpenAI-Anthropic Fallout Comes Down to Three Words: "All Lawful Use"

OpenAI signed a Pentagon contract within hours of Anthropic being barred from federal use, agreeing to provide its models for “all lawful purposes” while drawing three red‑line restrictions on domestic mass surveillance, autonomous weapons, and high‑risk automated decisions. The agreement’s...

By THE DECODER

News•Feb 28, 2026

Even Frontier LLMs From GPT-5 Onward Lose up to 33% Accuracy when You Chat Too Long

Researchers led by Philippe Laban evaluated frontier large language models from GPT‑5 onward across six diverse tasks and found that spreading a request over multiple conversation turns reduces accuracy by up to 33 %. While newer models shrink the degradation from...

By THE DECODER

News•Feb 28, 2026

Current Language Model Training Leaves Large Parts of the Internet on the Table

Researchers from Apple, Stanford, and the University of Washington discovered that the choice of HTML extraction tool dramatically influences which web pages enter large language model training sets. Their analysis of three popular extractors—Resiliparse, Trafilatura, and JusText—found that only 39%...

By THE DECODER

News•Feb 27, 2026

Claude Code Now Remembers Your Fixes, Your Preferences, and Your Project Quirks on Its Own

Claude Code introduced an auto‑memory feature that automatically records debugging patterns, project context, and user preferences in a per‑project MEMORY.md file. The system recalls these details in subsequent sessions, eliminating the need for manual logging or the /init command. The...

By THE DECODER

News•Feb 26, 2026

Suno Investor Admits She Ditched Spotify for AI Music, Accidentally Undermining the Company's Fair Use Defense

Suno, the AI‑generated music platform, has reached $300 million in annualized revenue and 2 million paying subscribers in under two years. Investor C.C. Gong publicly said she shifted most of her listening from Spotify to Suno, claiming AI music offers a personalized, infinite...

By THE DECODER

News•Feb 24, 2026

Claude Can Now Jump Between Excel and PowerPoint on Its Own

Anthropic announced that Claude can now switch autonomously between Excel and PowerPoint, allowing users to run data analyses and instantly generate presentation decks. The capability is released as a research preview on all paid plans. At the same time, Anthropic...

By THE DECODER

News•Feb 24, 2026

Inception Launches Mercury 2, the First Diffusion-Based Language Reasoning Model

Inception Labs unveiled Mercury 2, the first diffusion‑based language reasoning model, claiming dramatic speed and cost advantages over leading models. The model generates 1,009 tokens per second with 1.7‑second end‑to‑end latency, beating Gemini 3 Flash and Claude Haiku on latency while delivering comparable benchmark...

By THE DECODER

News•Feb 24, 2026

Deepmind Suggests AI Should Occasionally Assign Humans Busywork so We Do Not Forget How to Do Our Jobs

DeepMind researchers propose an "intelligent AI delegation" framework to govern how autonomous AI agents assign tasks to each other and to humans. The model adapts organizational theory, treating AI delegation as a principal‑agent problem and emphasizing verifiable outcomes, decentralized smart‑contract...

By THE DECODER

News•Feb 24, 2026

OpenAI Ships API Upgrades Targeting Voice Reliability and Agent Speed for Developers

OpenAI released two API upgrades for developers: the gpt‑realtime‑1.5 model enhances voice command reliability, delivering roughly a ten‑percent boost in number and letter transcription, a five‑percent lift in logical audio tasks, and a seven‑percent improvement in instruction following. The audio...

By THE DECODER

News•Feb 23, 2026

Anthropic Accuses Deepseek, Moonshot, and MiniMax of Stealing Claude's AI Data Through 16 Million Queries

Anthropic has uncovered a coordinated distillation attack by three Chinese AI labs—Deepseek, Moonshot AI, and MiniMax—targeting its Claude model. Over 24,000 fabricated accounts generated more than 16 million queries to extract reasoning, programming, and tool‑usage capabilities. The labs employed proxy services...

By THE DECODER

News•Feb 23, 2026

OpenAI Wants to Retire the AI Coding Benchmark that Everyone Has Been Competing On

OpenAI announced that the SWE‑bench Verified coding benchmark has lost its credibility, citing that roughly 59.4% of its tasks are flawed and enforce overly specific implementation details. The company also highlighted data contamination, noting that leading models such as GPT‑5.2,...

By THE DECODER

News•Feb 22, 2026

ChatGPT and Gemini Voice Bots Are Easy to Trick Into Spreading Falsehoods

Newsguard evaluated the audio output of OpenAI’s ChatGPT Voice, Google’s Gemini Live, and Amazon’s Alexa+ by feeding each bot 20 false claims across health, politics, and world news. In neutral prompts, ChatGPT and Gemini reproduced falsehoods about 22‑23 percent of the...

By THE DECODER

Deals•Jan 30, 2026

OpenAI Announces Plans for Late‑2026 IPO

OpenAI disclosed that it is preparing for an initial public offering in the fourth quarter of 2026, targeting a valuation of $830 billion and a raise of over $100 billion. The startup is in informal talks with Wall Street banks and has...

THE DECODER

Deals•Jan 1, 2026

Baidu's Kunlunxin Files Confidential Hong Kong IPO, Valued at $3B

Baidu's AI chip division Kunlunxin has confidentially filed for an IPO in Hong Kong, submitting its application on Jan 1. A recent financing round values the unit at roughly $3 billion, though the final offering size remains undetermined. The filing adds Kunlunxin...

THE DECODER

Deals•Jan 1, 2026

Moonshot AI Secures $500M Series C Funding Led by IDG

Chinese AI startup Moonshot AI announced a $500 million Series C round that values the company at $4.3 billion. The round was led by IDG with $150 million and included Alibaba, Tencent and individual investor Wang Huiwen. The capital will fund Kimi‑K3 development and...

THE DECODER

Technology Pulse

THE DECODER

Recent Posts

ElevenLabs and Google Dominate Artificial Analysis' Updated Speech-to-Text Benchmark

Moltbook's Alleged AI Civilization Is Just a Massive Void of Bloated Bot Traffic

The Pentagon-OpenAI-Anthropic Fallout Comes Down to Three Words: "All Lawful Use"

Even Frontier LLMs From GPT-5 Onward Lose up to 33% Accuracy when You Chat Too Long

Current Language Model Training Leaves Large Parts of the Internet on the Table

Claude Code Now Remembers Your Fixes, Your Preferences, and Your Project Quirks on Its Own

Suno Investor Admits She Ditched Spotify for AI Music, Accidentally Undermining the Company's Fair Use Defense

Claude Can Now Jump Between Excel and PowerPoint on Its Own

Inception Launches Mercury 2, the First Diffusion-Based Language Reasoning Model

Deepmind Suggests AI Should Occasionally Assign Humans Busywork so We Do Not Forget How to Do Our Jobs

OpenAI Ships API Upgrades Targeting Voice Reliability and Agent Speed for Developers

Anthropic Accuses Deepseek, Moonshot, and MiniMax of Stealing Claude's AI Data Through 16 Million Queries

OpenAI Wants to Retire the AI Coding Benchmark that Everyone Has Been Competing On

ChatGPT and Gemini Voice Bots Are Easy to Trick Into Spreading Falsehoods

OpenAI Announces Plans for Late‑2026 IPO

Baidu's Kunlunxin Files Confidential Hong Kong IPO, Valued at $3B

Moonshot AI Secures $500M Series C Funding Led by IDG

Technology Pulse

THE DECODER

Recent Posts

ElevenLabs and Google Dominate Artificial Analysis' Updated Speech-to-Text Benchmark

Moltbook's Alleged AI Civilization Is Just a Massive Void of Bloated Bot Traffic

The Pentagon-OpenAI-Anthropic Fallout Comes Down to Three Words: "All Lawful Use"

Even Frontier LLMs From GPT-5 Onward Lose up to 33% Accuracy when You Chat Too Long

Current Language Model Training Leaves Large Parts of the Internet on the Table

Claude Code Now Remembers Your Fixes, Your Preferences, and Your Project Quirks on Its Own

Suno Investor Admits She Ditched Spotify for AI Music, Accidentally Undermining the Company's Fair Use Defense

Claude Can Now Jump Between Excel and PowerPoint on Its Own

Inception Launches Mercury 2, the First Diffusion-Based Language Reasoning Model

Deepmind Suggests AI Should Occasionally Assign Humans Busywork so We Do Not Forget How to Do Our Jobs

OpenAI Ships API Upgrades Targeting Voice Reliability and Agent Speed for Developers

Anthropic Accuses Deepseek, Moonshot, and MiniMax of Stealing Claude's AI Data Through 16 Million Queries

OpenAI Wants to Retire the AI Coding Benchmark that Everyone Has Been Competing On

ChatGPT and Gemini Voice Bots Are Easy to Trick Into Spreading Falsehoods

OpenAI Announces Plans for Late‑2026 IPO

Baidu's Kunlunxin Files Confidential Hong Kong IPO, Valued at $3B

Moonshot AI Secures $500M Series C Funding Led by IDG