
Claude Code's New Ultraplan Feature Moves Task Planning to the Cloud
Anthropic introduced Ultraplan, a new feature for Claude Code that moves the programming task planning phase to the cloud. Developers launch a planning job in the terminal while the plan is generated on the Claude Code web interface, allowing the terminal to stay free for other work. The web UI adds inline comments, emoji reactions, revision requests, and lets users execute the finished plan either in the browser or back in the terminal. Ultraplan requires a Claude Code web account, a linked GitHub repository, and version 2.1.91 or later, and it is offered as a preview to activated users.

Deepmind CEO Hassabis Says AGI Will Hit Like Ten Industrial Revolutions Compressed Into a Single Decade
DeepMind CEO Demis Hassabis told the 20VC podcast that artificial general intelligence could arrive within the next five years, delivering an impact equivalent to ten industrial revolutions compressed into a single decade. He described current systems as “jagged intelligences” that...

LLMs Crush Coding and Math but Choke on Casual Questions, and That's Not a Contradiction
Andrej Karpathy notes a stark split in large language model performance: free‑tier ChatGPT often falters on trivial everyday queries, while premium models such as OpenAI's GPT‑5.4 Thinking and Claude Opus 4.6 excel at complex coding and math tasks, even autonomously restructuring...

OpenAI Is Building a Cybersecurity Product for a Select Group of Companies
OpenAI is developing a cybersecurity product that will be offered only to a select group of companies through its Trusted Access for Cyber pilot. The offering, tied to the GPT‑5.3‑Codex model, provides highly capable AI tools for defensive security tasks...

OpenAI Halves Its Pro Price to $100 for Heavy Codex Users, Undercuts Anthropic and Google
OpenAI introduced a new Pro subscription at $100 per month, halving the price of its previous $200 tier and targeting heavy users of its Codex programming tool. The plan delivers up to five times the Codex usage allowance of the...

Google Gemini Now Generates Interactive Visualizations You Can Tweak and Explore Right in the Chat
Google Gemini now generates interactive visualizations directly within its chat interface, letting users tweak variables, rotate 3D models, and explore data on the fly. The feature is accessible through the Gemini Pro model on gemini.google, triggered by prompts such as...

New Stanford Study Reveals when Teaming up AI Agents Is Worth the Compute
A new Stanford study challenges the prevailing belief that multi‑agent AI systems are inherently superior. By matching compute budgets, the researchers found that a single, well‑scaled model performs as well as—or better than—team configurations across two multi‑step reasoning benchmarks. The...

Zhipu AI's GLM-5.1 Can Rethink Its Own Coding Strategy Across Hundreds of Iterations
Zhipu AI released GLM-5.1 under an MIT license, an open‑weight model that can self‑revise its coding strategy across hundreds of iterations. In internal tests it generated 21,500 queries per second on a vector‑database benchmark—a six‑fold improvement over Claude Opus 4.6—and delivered...

Meta's Muse Spark Is Its First Frontier Model and Its First without Open Weights
Meta’s Superintelligence Labs unveiled Muse Spark, the company’s first frontier‑scale AI model that is not open‑weight. The multimodal reasoning system delivers top‑5 benchmark scores, rivaling OpenAI’s GPT‑5.4, Google’s Gemini 3.1 and Anthropic’s Claude Opus. Meta claims a new pretraining stack provides more...

Stability AI Launches Brand Studio for Brand-Consistent Image Generation
Stability AI, the creator of the open‑source Stable Diffusion model, is pivoting toward commercial offerings with the launch of Brand Studio. The platform provides a "Brand Central" hub where creative teams can train custom, brand‑aligned image models and build reusable...

One in Four Quotes in AI Chatbot Responses Comes From Journalism, Muckrack Study Finds
Muckrack analyzed 15 million AI‑generated quotes from Gemini, Perplexity, Claude and ChatGPT and found that roughly one‑quarter of the citations come from journalistic sources. Reuters, Forbes and The Guardian are the most frequently referenced outlets, while former Business Insider editor Henry Blodget...

Nudifying Bots, Deepfakes, and Automated Archives: How AI Powers a Monetized Abuse Ecosystem on Telegram
A new AI Forensics report examined 2.8 million Telegram messages from Italy and Spain, revealing a thriving ecosystem that uses AI‑powered nudifying bots to create synthetic non‑consensual intimate images. The analysis found the term “bot” 16,232 times, with nearly half of...

Microsoft's Bing Team Open-Sources "Harrier" Embedding Model
Microsoft’s Bing team has open‑sourced an embedding model called Harrier, available in three sizes up to 27 billion parameters. The model supports more than 100 languages, offers a 32,000‑token context window, and was trained on over two billion examples plus synthetic GPT‑5...

China Actively Targeting Taiwan's Chip Talent and Technology, Security Report Says
Taiwan’s National Security Bureau warned that Beijing is intensifying efforts to lure semiconductor engineers and intellectual property from the island. The campaign targets senior chip designers, researchers, and supply‑chain specialists with attractive salaries and research grants. By siphoning talent, China...

Bezos' Project Prometheus Hires xAI Co-Founder From OpenAI
Jeff Bezos' AI venture Project Prometheus has recruited Kyle Kosic, co‑founder of Elon Musk’s xAI and former OpenAI infrastructure lead. Kosic will head AI infrastructure, bringing experience from xAI’s Colossus supercomputer. The startup, founded by Bezos and ex‑Google executive Vikram...

Meta Plans to Open-Source Parts of Its New AI Models
Meta announced it will open‑source portions of its next‑generation AI models, the first under CEO Alexandr Wang, who arrived via a roughly $15 billion Scale AI partnership. While smaller variants will be released to the public, the largest models remain proprietary...

Meta Employees Compete for Token Consumption on an Internal AI Leaderboard
Meta has created an internal “Claudeonomics” leaderboard that records AI token consumption for over 85,000 employees. In the first month, staff collectively burned roughly 60 trillion tokens, with the top user averaging 281 billion tokens daily. The gamified titles such as “Token...

Anthropic Signs Multi-Gigawatt TPU Deal with Google and Broadcom
Anthropic has struck a multi‑gigawatt TPU agreement with Google and Broadcom, with the hardware slated to be deployed in the United States beginning in 2027. The deal reflects surging demand, as the company’s annualized revenue now tops $30 billion, up from...

OpenAI's Safety Brain Drain Finally Gets an Explanation and It's Just Sam Altman's Vibes
OpenAI has dismantled its dedicated AI‑safety teams, prompting a wave of departures that helped spawn rival Anthropic. In a New Yorker profile, CEO Sam Altman attributes the exodus to a cultural mismatch, emphasizing rapid product development over traditional safety caution....

Less Work, Equal Pay: OpenAI Lays Out Its Vision for a World Reshaped by Superintelligence
OpenAI released a policy paper titled "Industrial Policy for the Intelligence Age" outlining early proposals for governments to manage the transition to superintelligence. The document suggests a public wealth fund that distributes AI‑driven returns to all citizens, a four‑day workweek...

OpenAI Reveals 600,000 Weekly Health Queries From Hospital Deserts as Seven in Ten Come After Hours
OpenAI disclosed that roughly 600,000 weekly health‑related queries come from U.S. residents living in “hospital deserts,” where the nearest hospital is at least a 30‑minute drive away. Overall, Americans send about two million messages per week to ChatGPT about health insurance,...

AI Benchmarks Systematically Ignore How Humans Disagree, Google Study Finds
Google Research and Rochester Institute of Technology examined how AI benchmarks handle human disagreement. Their study shows the common practice of using three to five annotators per test item often fails to produce reproducible model comparisons. By simulating thousands of...

AI Chatbot Traffic Grows Seven Times Faster than Social Media but Still Trails by a Factor of Four
Similarweb’s latest analysis shows AI chatbot platforms attracted 9.3 billion visits, a figure that is four times lower than social media’s 41 billion. However, chatbot traffic grew 44.4% year‑over‑year, outpacing social media’s 6.3% growth by a factor of seven. The audience demographics...

Alibaba's Qwen Team Makes AI Models Think Deeper with New Algorithm
Alibaba’s Qwen team introduced Future‑KL Influenced Policy Optimization (FIPO), a reinforcement‑learning algorithm that weights each token by its downstream impact on reasoning. By assigning credit more precisely, FIPO extends chain‑of‑thought lengths from roughly 4,000 to over 10,000 tokens. On the...

Know3D Lets Users Control the Hidden Back Side of 3D Objects with Text Prompts
Researchers from several Chinese universities introduced Know3D, a system that lets users shape the hidden backside of a 3D object using natural‑language prompts. The approach bridges a large language model, an image‑generation model, and Microsoft’s Trellis.2 3D generator, extracting intermediate...

Anthropic Says Claude Code's Usage Drain Comes Down to Peak-Hour Caps and Ballooning Contexts
Anthropic investigated why users of its Claude Code model were exhausting usage limits faster than anticipated. The company identified two primary drivers: stricter token caps during peak‑hour periods and the growth of 1‑million‑token context windows that dramatically increase consumption. Bugs...

OpenAI Shifts to Usage-Based Pricing for Codex in ChatGPT Business Plans
OpenAI announced a shift to usage‑based pricing for its Codex model within ChatGPT Business and Enterprise plans, eliminating upfront seat licenses. Administrators can now enable free Codex access across a workspace and pay only for the compute actually consumed, with...

Sakana AI Launches "Ultra Deep Research" To Automate Weeks of Strategy Work
Japanese AI startup Sakana AI introduced Sakana Marlin, its first enterprise‑focused product that autonomously researches a topic for up to eight hours and delivers a full written report plus presentation slides. The system combines the company’s AI Scientist, which resolves...

Microsoft's MAI-Transcribe-1 Runs 2.5x Faster than Its Predecessor at $0.36 per Audio Hour
Microsoft unveiled MAI‑Transcribe‑1, a speech‑to‑text model that covers 25 languages and sets a new low on the FLEURS benchmark. The system delivers a 2.5‑times speed boost over the previous Azure Fast offering while charging just $0.36 per audio hour. It...
AI Models Fail at Robot Control without Human-Designed Building Blocks but Agentic Scaffolding Closes the Gap
Researchers from Nvidia, UC Berkeley, Stanford and CMU introduced CaP‑X, an open‑access framework that evaluates how large language models control robots via self‑written code. Testing twelve frontier models—including Gemini‑3‑Pro, GPT‑5.2 and Claude Opus 4.5—across seven manipulation tasks revealed that without high‑level...

Google Deepmind Study Exposes Six "Traps" That Can Easily Hijack Autonomous AI Agents in the Wild
Google DeepMind’s new paper defines six “AI agent traps” that exploit the perception, reasoning, memory, action, multi‑agent dynamics, and human‑in‑the‑loop stages of autonomous agents. The study shows real‑world proof‑of‑concept attacks, from hidden HTML instructions to coordinated multi‑agent flash‑crash scenarios. Researchers...

EU Bars AI-Generated Content From Official Communications, According to Politico
The European Commission, Parliament and Council have banned staff from using fully AI‑generated videos or images in official communications, allowing AI only for tasks like image‑quality enhancement. Officials say the rule protects authenticity and citizen trust. Experts argue the blanket...

Oracle Reportedly Lays Off Thousands of Employees to Bankroll Its Massive AI Infrastructure Bet
Oracle announced a massive workforce reduction, targeting 20,000 to 30,000 positions, to free up roughly $10 billion in cash flow for its AI infrastructure push. The cuts follow a $50 billion capital raise plan that has left the company in debt and...

Anthropic Accidentally Publishes Claude Code Source Code for Anyone to Find
Anthropic unintentionally released over 500,000 lines of Claude Code source on the public NPM registry, exposing more than 1,000 internal files. The leak, attributed to human error rather than a security flaw, included details on unreleased models and features. No...

Qwen3.5-Omni Learned to Write Code From Spoken Instructions and Video without Anyone Training It To
Alibaba unveiled Qwen3.5-Omni, an omnimodal AI model that handles text, images, audio, and video, boasting a 256,000‑token context window and the ability to process over ten hours of audio or 400 seconds of 720p video. The Plus variant set new...

AI Models Confidently Describe Images They Never Saw, and Benchmarks Fail to Catch It
A new study reveals that leading multimodal AI models—including GPT‑5 series, Gemini 3 Pro, and Claude Opus 4.5—confidently generate visual descriptions and medical diagnoses despite never receiving an image, achieving 60‑90% correctness in a text‑only benchmark called Phantom‑0. When tested on established visual‑understanding benchmarks,...

MetaClaw Framework Trains AI Agents While You're in Meetings by Checking Your Google Calendar
Researchers from UNC‑Chapel Hill, Carnegie Mellon, UC Santa Cruz and UC Berkeley introduced MetaClaw, a framework that continuously improves AI agents by learning from mistakes and fine‑tuning during idle times. The system uses an Opportunistic Meta‑Learning Scheduler that watches sleep...

Google's New Gemini API Agent Skill Patches the Knowledge Gap AI Models Have with Their Own SDKs
Google introduced an Agent Skill for the Gemini API that injects live SDK documentation and sample code into the model, eliminating the knowledge gap that plagues AI coding assistants. In a benchmark of 117 tasks, Gemini 3.1 Pro Preview’s success rate surged from...

Meta's Hyperagents Improve at Tasks and Improve at Improving
Meta, the University of British Columbia and collaborators introduced "hyperagents," AI systems that can rewrite both their task‑solving code and the underlying improvement mechanism. Built on the Darwin Gödel Machine framework, the new DGM‑H architecture lets the meta‑agent self‑modify, breaking...

Cohere Releases Open Source Model that Tops Speech Recognition Benchmarks
Cohere has launched Transcribe, an open‑source automatic speech recognition model that now leads the Hugging Face Open ASR Leaderboard with a 5.42% word error rate. The 2 billion‑parameter system also records the highest throughput, processing audio 525 times faster than real...

Suno 5.5 Lets Users Sing Their Own AI-Generated Songs with a Personalized Voice Feature
Suno has rolled out version 5.5 of its AI music generator, branding it the most expressive model yet. The upgrade adds a Voices feature that lets Pro and Premier users record or upload their own singing voice, with a verification step...

OpenAI CEO Sam Altman Reportedly Teases a "Very Strong" Model Internally that Can "Really Accelerate the Economy"
OpenAI has completed pre‑training its next‑gen AI model, codenamed “Spud,” and CEO Sam Altman told staff it will be a “very strong” system ready in a few weeks, aimed at accelerating the economy. The company is reallocating compute by shutting...

OpenAI Expands Its Record Funding Round to over $120 Billion as It Eyes a Potential IPO Later This Year
OpenAI announced an additional $10 billion injection, pushing its record financing round beyond $120 billion. The expanded round brings in new backers such as Andreessen Horowitz, D.E. Shaw Ventures, MGX, TPG and T. Rowe Price, while Microsoft remains a key investor. CFO Sarah Friar hinted...

Popular AI Proxy LiteLLM Got Hacked with Malware that Spreads Through Kubernetes Clusters
Open‑source AI proxy library LiteLLM was compromised on PyPI, with versions 1.82.7 and 1.82.8 containing malware. The malicious code steals SSH keys, cloud credentials, database passwords, and Kubernetes configurations, encrypts them, and exfiltrates data to an external server while propagating...

Google Deepmind's Gemini 3.1 Flash-Lite Generates Websites Almost in Real Time
Google DeepMind unveiled Gemini 3.1 Flash‑Lite, a generative AI that builds webpages live from text prompts, effectively acting as a pseudo‑browser. The model delivers its first token 2.5 times faster than Gemini 2.5 Flash and processes over 360 tokens per...

Google Brings AI-Powered Dark Web Analysis to Enterprise Security Teams
Google Cloud announced at RSA 2026 an AI‑driven agent called “Triage and Investigation” within its Security Operations platform, automating alert review and reducing false positives for SOC analysts. The same rollout includes an AI‑powered dark‑web analysis tool that sifts through...

OpenAI Wants UK Regulators to Treat ChatGPT as a Google Search Alternative
OpenAI is urging the UK Competition and Markets Authority to list ChatGPT alongside Google on the CMA’s proposed "choice screens" for Android and Chrome users. The regulator previously designated Google as a strategic market player in search and is considering...

Xiaomi Launches Three MiMo AI Models to Power Agents, Robots, and Voice
Xiaomi’s MiMo team unveiled three new AI models—MiMo‑V2‑Pro, MiMo‑V2‑Omni, and MiMo‑V2‑TTS—aimed at powering agents, multimodal robotics, and expressive speech synthesis. The flagship MiMo‑V2‑Pro features a trillion‑parameter mixture‑of‑experts architecture with 42 billion active weights per request and a one‑million‑token context window, ranking...

Andrej Karpathy Says Humans Are Now the Bottleneck in AI Research with Easy-to-Measure Results
Andrej Karpathy spent months hand‑tuning a GPT‑2 training pipeline before handing it to an autonomous search agent for a single night. The agent uncovered fine‑grained adjustments that humans missed, demonstrating that systematic searches can outperform intuition when objective metrics exist....

OpenAI Publishes a Prompting Playbook that Helps Designers Get Better Frontend Results From GPT-5.4
OpenAI released a prompting playbook to help frontend designers generate higher‑quality UX/UI with its GPT‑5.4 model. The guide stresses defining a design system—colors, typography, layout—and supplying real content and visual references to avoid generic outputs. It also outlines hard rules...