VentureBeat

VentureBeat

Publication
1 followers

AI/data/automation with enterprise finance implications

Microsoft's New AI Training Method Eliminates Bloated System Prompts without Sacrificing Model Performance
NewsFeb 27, 2026

Microsoft's New AI Training Method Eliminates Bloated System Prompts without Sacrificing Model Performance

Microsoft researchers introduced On‑Policy Context Distillation (OPCD), a training framework that embeds lengthy system prompts directly into a model’s parameters. By having the student model learn from its own generation trajectories under a teacher’s real‑time guidance, OPCD eliminates the need...

By VentureBeat
Google's Nano Banana 2 Takes Aim at the Production Cost Problem That's Kept AI Image Gen Out of Enterprise Workflows
NewsFeb 26, 2026

Google's Nano Banana 2 Takes Aim at the Production Cost Problem That's Kept AI Image Gen Out of Enterprise Workflows

Google DeepMind unveiled Nano Banana 2, a Gemini 3.1 Flash Image model that delivers Pro‑level text rendering, subject consistency, and image search at roughly half the cost of the Nano Banana Pro tier. The new offering reduces per‑image pricing to...

By VentureBeat
ServiceNow Resolves 90% of Its Own IT Requests Autonomously. Now It Wants to Do the Same for Any Enterprise
NewsFeb 26, 2026

ServiceNow Resolves 90% of Its Own IT Requests Autonomously. Now It Wants to Do the Same for Any Enterprise

ServiceNow reports that it resolves 90% of its own employee IT requests autonomously, delivering solutions up to 99% faster than human agents. The company unveiled an Autonomous Workforce framework, the EmployeeWorks product, and a "role automation" architecture to extend this...

By VentureBeat
Perplexity Launches 'Computer' AI Agent that Coordinates 19 Models, Priced at $200 a Month
NewsFeb 26, 2026

Perplexity Launches 'Computer' AI Agent that Coordinates 19 Models, Priced at $200 a Month

Perplexity, valued at $20 billion, launched Computer, a cloud‑based AI agent that coordinates 19 specialized models to execute complex workflows. The service is currently available only to Perplexity Max subscribers at $200 per month and promises autonomous task decomposition and model...

By VentureBeat
Visual Imitation Learning: Guidde Trains AI Agents on Human 'Expert Video' Instead of Documentation
NewsFeb 25, 2026

Visual Imitation Learning: Guidde Trains AI Agents on Human 'Expert Video' Instead of Documentation

Guidde, an Israeli AI Digital Adoption Platform, announced a $50 million Series B round led by PSG Equity to expand its video‑ground‑truth approach for training both human users and autonomous agents. The platform captures every click, scroll and DOM change during screen...

By VentureBeat
Kilo Launches KiloClaw, Allowing Anyone to Deploy Hosted OpenClaw Agents Into Production in 60 Seconds
NewsFeb 24, 2026

Kilo Launches KiloClaw, Allowing Anyone to Deploy Hosted OpenClaw Agents Into Production in 60 Seconds

Kilo has launched KiloClaw, a fully managed service that provisions a production‑ready OpenClaw agent in under 60 seconds, removing the need for SSH, Docker, or YAML setup. The platform runs on multi‑tenant VMs hosted by Fly.io, providing enterprise‑grade isolation, security...

By VentureBeat
How Smarsh Built an AI Front Door for Regulated Industries — and Drove 59% Self-Service Adoption
NewsFeb 24, 2026

How Smarsh Built an AI Front Door for Regulated Industries — and Drove 59% Self-Service Adoption

Smarsh deployed an AI‑powered support agent, Archie, on Salesforce Agentforce 360 to create a unified front‑door for regulated‑industry customers. The system lets users describe needs in plain language, routing them to the right solution and reducing navigation friction. Early results...

By VentureBeat
Anthropic Says DeepSeek, Moonshot, and MiniMax Used 24,000 Fake Accounts to Rip Off Claude
NewsFeb 23, 2026

Anthropic Says DeepSeek, Moonshot, and MiniMax Used 24,000 Fake Accounts to Rip Off Claude

Anthropic disclosed that three Chinese AI labs—DeepSeek, Moonshot AI and MiniMax—used roughly 24,000 fraudulent accounts to conduct over 16 million interactions with its Claude models, targeting reasoning, coding and tool‑use capabilities. The coordinated distillation attacks extracted large‑scale training data, effectively stealing...

By VentureBeat
Researchers Baked 3x Inference Speedups Directly Into LLM Weights — without Speculative Decoding
NewsFeb 23, 2026

Researchers Baked 3x Inference Speedups Directly Into LLM Weights — without Speculative Decoding

Researchers from Maryland, Livermore Lab, Columbia and TogetherAI introduced a multi‑token prediction (MTP) technique that embeds a special token into existing LLM weights, eliminating the need for separate drafting models. The method uses a self‑distillation student‑teacher training loop to...

By VentureBeat
Rapidata Emerges to Shorten AI Model Development Cycles From Months to Days with Near Real-Time RLHF
NewsFeb 19, 2026

Rapidata Emerges to Shorten AI Model Development Cycles From Months to Days with Near Real-Time RLHF

Rapidata, a startup, has built a platform that crowdsources RLHF feedback through mobile app users, turning ad slots into short annotation tasks. By tapping 15‑20 million global users, it can deliver up to 1.5 million annotations per hour, shrinking feedback loops from...

By VentureBeat
The 'Last-Mile' Data Problem Is Stalling Enterprise Agentic AI — 'Golden Pipelines' Aim to Fix It
NewsFeb 19, 2026

The 'Last-Mile' Data Problem Is Stalling Enterprise Agentic AI — 'Golden Pipelines' Aim to Fix It

Enterprise AI is hitting a ‘last‑mile’ data bottleneck as messy operational data hampers model inference. Empromptu’s ‘golden pipelines’ embed automated ingestion, cleaning, labeling and governance directly into the AI application workflow, shrinking data‑preparation cycles from weeks to under an hour....

By VentureBeat
New Agent Framework Matches Human-Engineered AI Systems — and Adds Zero Inference Cost to Deploy
NewsFeb 18, 2026

New Agent Framework Matches Human-Engineered AI Systems — and Adds Zero Inference Cost to Deploy

Researchers at UC Santa Barbara introduced Group‑Evolving Agents (GEA), a framework that evolves entire groups of AI agents instead of single individuals. By sharing a collective experience archive and using a reflection module, GEA combines innovations across agents, leading to...

By VentureBeat
SurrealDB 3.0 Wants to Replace Your Five-Database RAG Stack with One
NewsFeb 17, 2026

SurrealDB 3.0 Wants to Replace Your Five-Database RAG Stack with One

SurrealDB launched version 3.0 alongside a $23 million Series A extension, bringing total funding to $44 million. The new release consolidates relational, vector and graph capabilities into a single Rust‑native engine, letting AI agents store memory, business logic and multimodal data transactionally. By...

By VentureBeat
Nvidia, Groq and the Limestone Race to Real-Time AI: Why Enterprises Win or Lose Here
NewsFeb 15, 2026

Nvidia, Groq and the Limestone Race to Real-Time AI: Why Enterprises Win or Lose Here

The article argues that AI compute growth is shifting from GPU‑centric training to inference speed, with Groq’s Language Processing Unit (LPU) offering dramatically lower latency for reasoning‑heavy models. Nvidia, which has historically moved from gaming GPUs to generative AI, could...

By VentureBeat
'Observational Memory' Cuts AI Agent Costs 10x and Outscores RAG on Long-Context Benchmarks
NewsFeb 10, 2026

'Observational Memory' Cuts AI Agent Costs 10x and Outscores RAG on Long-Context Benchmarks

Mastra’s open‑source observational memory replaces dynamic retrieval with two background agents that compress conversation history into a dated observation log. The approach achieves 3‑6× compression for text and up to 40× for tool‑heavy outputs, keeping the context window stable and...

By VentureBeat
What AI Builders Can Learn From Fraud Models that Run in 300 Milliseconds
NewsFeb 9, 2026

What AI Builders Can Learn From Fraud Models that Run in 300 Milliseconds

Mastercard’s Decision Intelligence Pro (DI Pro) uses a sub‑300 ms recurrent neural network to assign risk scores to each payment transaction in real time. The platform treats fraud detection as an "inverse recommender" problem, comparing current merchant behavior to historical patterns. By...

By VentureBeat
Nvidia Releases DreamDojo, a Robot ‘World Model’ Trained on 44,000 Hours of Human Video
NewsFeb 9, 2026

Nvidia Releases DreamDojo, a Robot ‘World Model’ Trained on 44,000 Hours of Human Video

Nvidia unveiled DreamDojo, a robot world model trained on a 44,000‑hour human egocentric video dataset, enabling robots to acquire physical intuition by observation before hardware‑specific fine‑tuning. The DreamDojo‑HV dataset is 15× longer, contains 96× more skills and spans 2,000× more...

By VentureBeat
AI's GPU Problem Is Actually a Data Delivery Problem
NewsFeb 9, 2026

AI's GPU Problem Is Actually a Data Delivery Problem

Enterprises are spending billions on GPU clusters for AI, yet many GPUs sit idle because the data delivery layer between object storage and compute cannot keep pace. F5 argues that the real bottleneck is not the GPUs but the lack...

By VentureBeat
The Missing Layer Between Agent Connectivity and True Collaboration
NewsFeb 9, 2026

The Missing Layer Between Agent Connectivity and True Collaboration

Vijoy Pandey of Cisco Outshift and Stanford professor Noah Goodman argue that today’s AI agents can connect but cannot truly think together. They propose an "Internet of Cognition"—a three‑layer architecture of protocol, fabric, and cognition engines—to enable shared intent, knowledge,...

By VentureBeat
TrueFoundry Launches TrueFailover to Automatically Reroute Enterprise AI Traffic During Model Outages
NewsJan 21, 2026

TrueFoundry Launches TrueFailover to Automatically Reroute Enterprise AI Traffic During Model Outages

TrueFoundry unveiled TrueFailover, an autonomous resilience layer that detects AI provider outages, slowdowns, or quality drops and instantly reroutes enterprise traffic to backup models and regions. The system integrates multi‑model, multi‑region routing, degradation‑aware monitoring, and dynamic prompt adjustment to preserve...

By VentureBeat
Stop Calling It 'The AI Bubble': It's Actually Multiple Bubbles, Each with a Different Expiration Date
NewsJan 18, 2026

Stop Calling It 'The AI Bubble': It's Actually Multiple Bubbles, Each with a Different Expiration Date

The AI boom consists of three distinct layers—wrapper companies, foundation‑model providers, and infrastructure—each with its own risk profile and timeline. Wrapper startups that merely repackage APIs are expected to implode first, as large platforms absorb their functionality and margins evaporate....

By VentureBeat
Claude Code Just Got Updated with One of the Most-Requested User Features
NewsJan 15, 2026

Claude Code Just Got Updated with One of the Most-Requested User Features

Anthropic has rolled out a major update to Claude Code called MCP Tool Search, which introduces lazy loading of tool definitions. The change stops the model from pre‑loading every available tool, cutting token consumption by up to 85 percent. Early...

By VentureBeat
This New, Dead Simple Prompt Technique Boosts Accuracy on LLMs by up to 76% on Non-Reasoning Tasks
NewsJan 13, 2026

This New, Dead Simple Prompt Technique Boosts Accuracy on LLMs by up to 76% on Non-Reasoning Tasks

Google Research’s new paper reveals that simply repeating a user query—placing the prompt twice in the input—significantly lifts accuracy on non‑reasoning tasks, with gains as high as 76% across models such as Gemini, GPT‑4o, Claude and DeepSeek. The technique exploits...

By VentureBeat
Why Egnyte Keeps Hiring Junior Engineers Despite the Rise of AI Coding Tools
NewsJan 13, 2026

Why Egnyte Keeps Hiring Junior Engineers Despite the Rise of AI Coding Tools

Egnyte, a $1.5 billion cloud content governance firm, has deployed AI coding assistants such as Claude Code, Cursor, Augment and Gemini CLI across its 350‑plus developer workforce. Despite the automation hype, the company continues hiring junior engineers, using AI to accelerate onboarding, code...

By VentureBeat
DeepSeek’s Conditional Memory Fixes Silent LLM Waste: GPU Cycles Lost to Static Lookups
NewsJan 13, 2026

DeepSeek’s Conditional Memory Fixes Silent LLM Waste: GPU Cycles Lost to Static Lookups

DeepSeek introduced Engram, a conditional memory module that separates static pattern retrieval from dynamic reasoning in large language models. By allocating roughly 25% of sparse capacity to memory and 75% to computation, the system achieves O(1) lookups via hash tables...

By VentureBeat
Salesforce Rolls Out New Slackbot AI Agent as It Battles Microsoft and Google in Workplace AI
NewsJan 13, 2026

Salesforce Rolls Out New Slackbot AI Agent as It Battles Microsoft and Google in Workplace AI

Salesforce launched a rebuilt Slackbot AI agent for Business+ and Enterprise+ customers, powered by Anthropic’s Claude large language model and integrated with Salesforce records, Google Drive, calendars, and Slack history. Internally, 80,000 employees adopted it rapidly, achieving 96% satisfaction and...

By VentureBeat
Why Sakana AI’s Big Win Is a Big Deal for the Future of Enterprise Agents
NewsJan 13, 2026

Why Sakana AI’s Big Win Is a Big Deal for the Future of Enterprise Agents

Japanese startup Sakana AI’s coding agent ALE‑Agent captured first place in the AtCoder Heuristic Contest, outpacing more than 800 human competitors. The four‑hour run leveraged inference‑time scaling, generating, testing, and iterating hundreds of solutions. By introducing a "Virtual Power" concept,...

By VentureBeat
Nvidia Rubin's Rack-Scale Encryption Signals a Turning Point for Enterprise AI Security
NewsJan 12, 2026

Nvidia Rubin's Rack-Scale Encryption Signals a Turning Point for Enterprise AI Security

Nvidia unveiled the Vera Rubin NVL72 at CES 2026, a rack‑scale platform that encrypts every bus across 72 GPUs, 36 CPUs and the entire NVLink fabric, delivering the first fully confidential computing stack for AI workloads. The move addresses a...

By VentureBeat
How DoorDash Scaled without a Costly ERP Overhaul
NewsJan 12, 2026

How DoorDash Scaled without a Costly ERP Overhaul

DoorDash grew from a 2013 startup to a global local‑commerce leader while retaining its original Oracle NetSuite system. The company avoided a multi‑million‑dollar ERP migration, instead leveraging NetSuite’s cloud‑based scalability to support IPO, acquisitions, and expansion into grocery, convenience and...

By VentureBeat
Why Your LLM Bill Is Exploding — and How Semantic Caching Can Cut It by 73%
NewsJan 10, 2026

Why Your LLM Bill Is Exploding — and How Semantic Caching Can Cut It by 73%

A company saw its LLM API bill rise 30% month‑over‑month despite modest traffic growth. Analysis revealed that users asked the same questions in varied phrasing, causing duplicate LLM calls that exact‑match caching missed. By replacing text hashes with embedding‑based semantic...

By VentureBeat
Anthropic Cracks Down on Unauthorized Claude Usage by Third-Party Harnesses and Rivals
NewsJan 9, 2026

Anthropic Cracks Down on Unauthorized Claude Usage by Third-Party Harnesses and Rivals

Anthropic has deployed new technical safeguards that block third‑party harnesses spoofing its Claude Code client, disrupting open‑source tools like OpenCode and causing automatic account bans. The same enforcement also cut off rival labs such as xAI from using Claude models...

By VentureBeat
Orchestral Replaces LangChain’s Complexity with Reproducible, Provider-Agnostic LLM Orchestration
NewsJan 9, 2026

Orchestral Replaces LangChain’s Complexity with Reproducible, Provider-Agnostic LLM Orchestration

Orchestral AI launches a new Python framework that replaces the asynchronous complexity of tools like LangChain with a synchronous, type‑safe architecture aimed at reproducible research. The framework is provider‑agnostic, supporting OpenAI, Anthropic, Google Gemini, Mistral and local models via Ollama,...

By VentureBeat
How KPMG Is Redefining the Future of SAP Consulting on a Global Scale
NewsJan 9, 2026

How KPMG Is Redefining the Future of SAP Consulting on a Global Scale

KPMG has integrated SAP's conversational AI, Joule for Consultants, across 29 member firms, giving thousands of consultants real‑time access to SAP best practices. The tool streamlines documentation‑heavy SAP projects, accelerating design workshops and reducing reliance on manual knowledge retrieval. By...

By VentureBeat
Databricks' Instructed Retriever Beats Traditional RAG Data Retrieval by 70% — Enterprise Metadata Was the Missing Link
NewsJan 8, 2026

Databricks' Instructed Retriever Beats Traditional RAG Data Retrieval by 70% — Enterprise Metadata Was the Missing Link

Databricks unveiled the Instructed Retriever, a new architecture that claims up to a 70% boost over traditional Retrieval‑Augmented Generation (RAG) on complex, instruction‑heavy enterprise question‑answering tasks. The improvement stems from propagating full system specifications—user instructions, metadata schemas, and examples—through every...

By VentureBeat
MiroMind’s MiroThinker 1.5 Delivers Trillion-Parameter Performance From a 30B Model — at 1/20th the Cost
NewsJan 8, 2026

MiroMind’s MiroThinker 1.5 Delivers Trillion-Parameter Performance From a 30B Model — at 1/20th the Cost

MiroMind unveiled MiroThinker 1.5, a 30‑billion‑parameter model that delivers performance on par with trillion‑parameter rivals while costing roughly one‑twentieth as much per inference. The model introduces a "scientist mode" that forces verifiable research loops, dramatically cutting hallucinations and providing audit...

By VentureBeat
How Ralph Wiggum Went From 'The Simpsons' To the Biggest Name in AI Right Now
NewsJan 6, 2026

How Ralph Wiggum Went From 'The Simpsons' To the Biggest Name in AI Right Now

Anthropic’s Claude Code has introduced the Ralph Wiggum plugin, turning the model into an autonomous coding agent that loops until predefined success criteria are met. The tool originated from Geoffrey Huntley’s Bash script that fed model output back as input,...

By VentureBeat
Nvidia’s Cosmos Reason 2 Aims to Bring Reasoning VLMs Into the Physical World
NewsJan 5, 2026

Nvidia’s Cosmos Reason 2 Aims to Bring Reasoning VLMs Into the Physical World

Nvidia unveiled Cosmos Reason 2 at CES 2026, the latest vision‑language model built for embodied reasoning in robots and autonomous systems. The model expands on its predecessor's two‑dimensional ontology, letting enterprises customize agents that can plan next actions in real‑world settings. Nvidia...

By VentureBeat
Brex Bets on ‘Less Orchestration’ as It Builds an Agent Mesh for Autonomous Finance
NewsJan 5, 2026

Brex Bets on ‘Less Orchestration’ as It Builds an Agent Mesh for Autonomous Finance

Brex is shifting from traditional AI agent orchestration to an “Agent Mesh,” a network of narrow, role‑specific agents that converse in plain language and operate independently while maintaining full visibility. The mesh replaces a central coordinator with event‑driven message streams,...

By VentureBeat
Why “Which API Do I Call?” Is the Wrong Question in the LLM Era
NewsJan 3, 2026

Why “Which API Do I Call?” Is the Wrong Question in the LLM Era

The article argues that the traditional question "which API do I call?" is being replaced by "what outcome am I trying to achieve?" Modern large language models enable this shift through the Model Context Protocol (MCP), which translates natural‑language intent...

By VentureBeat
Why Notion’s Biggest AI Breakthrough Came From Simplifying Everything
NewsJan 2, 2026

Why Notion’s Biggest AI Breakthrough Came From Simplifying Everything

Notion AI’s breakthrough came from stripping away complex data models in favor of simple, human‑readable prompts and markdown representations. By rewiring its middleware and limiting context to a 100‑150k token window, the team delivered V3 with customizable AI agents that...

By VentureBeat
Seven Steps to AI Supply Chain Visibility — Before a Breach Forces the Issue
NewsJan 2, 2026

Seven Steps to AI Supply Chain Visibility — Before a Breach Forces the Issue

Enterprises are facing a critical AI visibility gap, with 62% unable to locate LLM deployments and a surge in prompt‑injection, vulnerable code, and jailbreaking attacks. Research shows only 6% of firms have advanced AI security strategies, while 13% reported AI...

By VentureBeat
Four AI Research Trends Enterprise Teams Should Watch in 2026
NewsJan 1, 2026

Four AI Research Trends Enterprise Teams Should Watch in 2026

Enterprises are shifting focus from raw model performance to research that makes AI production‑ready. Four trends—continual learning, world models, orchestration, and refinement—promise to keep models up‑to‑date, simulate physical environments, manage multi‑step workflows, and iteratively improve outputs without costly retraining. Companies...

By VentureBeat
Open Source Qwen-Image-2512 Launches to Compete with Google's Nano Banana Pro in High Quality AI Image Generation
NewsDec 31, 2025

Open Source Qwen-Image-2512 Launches to Compete with Google's Nano Banana Pro in High Quality AI Image Generation

Alibaba’s Qwen team released Qwen-Image-2512, an open‑source AI image model that rivals Google’s Gemini 3 Pro Image (Nano Banana Pro) in quality. The model delivers higher human realism, finer texture detail, and accurate embedded text for both Chinese and English...

By VentureBeat
Why Meta Bought Manus — and What It Means for Your Enterprise AI Agent Strategy
NewsDec 30, 2025

Why Meta Bought Manus — and What It Means for Your Enterprise AI Agent Strategy

Meta announced a more‑than‑$2 billion acquisition of Singapore‑based AI startup Manus, a general‑purpose agent that autonomously executes multi‑step tasks such as research, coding, and content creation. Manus boasts impressive usage metrics—over 147 trillion tokens processed, 80 million virtual computers created, and a $100 million...

By VentureBeat
Why AI Adoption Fails without IT-Led Workflow Integration
NewsDec 30, 2025

Why AI Adoption Fails without IT-Led Workflow Integration

At Gold Bond Inc., CIO Matt Price embedded generative AI directly into high‑friction workflows such as ERP intake, document processing, and call follow‑ups instead of launching a standalone chatbot. He formed a small “super‑user” cohort, ran sandbox tests, and layered...

By VentureBeat
New Year's AI Surprise: Fal Releases Its Own Version of Flux 2 Image Generator That's 10x Cheaper and 6x More...
NewsDec 29, 2025

New Year's AI Surprise: Fal Releases Its Own Version of Flux 2 Image Generator That's 10x Cheaper and 6x More...

Fal.ai unveiled FLUX.2 [dev] Turbo, a distilled LoRA adapter that speeds image generation to eight inference steps while cutting costs to $0.008 per 1024×1024 output. The model outperforms open‑weight rivals on benchmark ELO scores and delivers 6.6‑second latency for high‑resolution...

By VentureBeat
Inside Microsoft Ignite: How Microsoft and NVIDIA Are Redefining the AI Stack
NewsDec 29, 2025

Inside Microsoft Ignite: How Microsoft and NVIDIA Are Redefining the AI Stack

At Microsoft Ignite 2025, NVIDIA and Microsoft unveiled a unified AI stack that couples NVIDIA’s Blackwell GPUs with Azure’s new NCv6 virtual machines, expanding cloud‑native compute for complex AI and visual workloads. The partnership also introduced Omniverse libraries on Azure,...

By VentureBeat
Google Releases FunctionGemma: A Tiny Edge Model that Can Control Mobile Devices with Natural Language
NewsDec 19, 2025

Google Releases FunctionGemma: A Tiny Edge Model that Can Control Mobile Devices with Natural Language

Google AI unveiled FunctionGemma, a 270‑million‑parameter model that converts natural‑language commands into executable code on edge devices. Trained on a dedicated Mobile Actions dataset, its function‑calling accuracy climbs to 85%, far surpassing generic small models. The model runs locally on...

By VentureBeat
Palona Goes Vertical, Launching Vision, Workflow Features: 4 Key Lessons for AI Builders
NewsDec 18, 2025

Palona Goes Vertical, Launching Vision, Workflow Features: 4 Key Lessons for AI Builders

Palona AI, founded by former Google and Meta engineers, announced a vertical shift into the restaurant and hospitality sector with two new products—Palona Vision and Palona Workflow. Vision leverages in‑store security cameras to monitor queue lengths, table turnover, and kitchen...

By VentureBeat