Israeli AI Digital Adoption Platform Guidde announced an oversubscribed $50 million Series B round led by PSG Equity. The funding will accelerate the company's video‑based AI training platform for enterprise workflows, expanding its customer base of 4,500 enterprises. The round underscores growing investor interest in AI‑driven knowledge infrastructure.
The article argues that AI compute growth is shifting from GPU‑centric training to inference speed, with Groq’s Language Processing Unit (LPU) offering dramatically lower latency for reasoning‑heavy models. Nvidia, which has historically moved from gaming GPUs to generative AI, could...
Mastra’s open‑source observational memory replaces dynamic retrieval with two background agents that compress conversation history into a dated observation log. The approach achieves 3‑6× compression for text and up to 40× for tool‑heavy outputs, keeping the context window stable and...
Mastercard’s Decision Intelligence Pro (DI Pro) uses a sub‑300 ms recurrent neural network to assign risk scores to each payment transaction in real time. The platform treats fraud detection as an "inverse recommender" problem, comparing current merchant behavior to historical patterns. By...
Nvidia unveiled DreamDojo, a robot world model trained on a 44,000‑hour human egocentric video dataset, enabling robots to acquire physical intuition by observation before hardware‑specific fine‑tuning. The DreamDojo‑HV dataset is 15× longer, contains 96× more skills and spans 2,000× more...
Enterprises are spending billions on GPU clusters for AI, yet many GPUs sit idle because the data delivery layer between object storage and compute cannot keep pace. F5 argues that the real bottleneck is not the GPUs but the lack...
Vijoy Pandey of Cisco Outshift and Stanford professor Noah Goodman argue that today’s AI agents can connect but cannot truly think together. They propose an "Internet of Cognition"—a three‑layer architecture of protocol, fabric, and cognition engines—to enable shared intent, knowledge,...
TrueFoundry unveiled TrueFailover, an autonomous resilience layer that detects AI provider outages, slowdowns, or quality drops and instantly reroutes enterprise traffic to backup models and regions. The system integrates multi‑model, multi‑region routing, degradation‑aware monitoring, and dynamic prompt adjustment to preserve...
The AI boom consists of three distinct layers—wrapper companies, foundation‑model providers, and infrastructure—each with its own risk profile and timeline. Wrapper startups that merely repackage APIs are expected to implode first, as large platforms absorb their functionality and margins evaporate....
Anthropic has rolled out a major update to Claude Code called MCP Tool Search, which introduces lazy loading of tool definitions. The change stops the model from pre‑loading every available tool, cutting token consumption by up to 85 percent. Early...
Google Research’s new paper reveals that simply repeating a user query—placing the prompt twice in the input—significantly lifts accuracy on non‑reasoning tasks, with gains as high as 76% across models such as Gemini, GPT‑4o, Claude and DeepSeek. The technique exploits...
Egnyte, a $1.5 billion cloud content governance firm, has deployed AI coding assistants such as Claude Code, Cursor, Augment and Gemini CLI across its 350‑plus developer workforce. Despite the automation hype, the company continues hiring junior engineers, using AI to accelerate onboarding, code...
DeepSeek introduced Engram, a conditional memory module that separates static pattern retrieval from dynamic reasoning in large language models. By allocating roughly 25% of sparse capacity to memory and 75% to computation, the system achieves O(1) lookups via hash tables...
Salesforce launched a rebuilt Slackbot AI agent for Business+ and Enterprise+ customers, powered by Anthropic’s Claude large language model and integrated with Salesforce records, Google Drive, calendars, and Slack history. Internally, 80,000 employees adopted it rapidly, achieving 96% satisfaction and...
Japanese startup Sakana AI’s coding agent ALE‑Agent captured first place in the AtCoder Heuristic Contest, outpacing more than 800 human competitors. The four‑hour run leveraged inference‑time scaling, generating, testing, and iterating hundreds of solutions. By introducing a "Virtual Power" concept,...
Nvidia unveiled the Vera Rubin NVL72 at CES 2026, a rack‑scale platform that encrypts every bus across 72 GPUs, 36 CPUs and the entire NVLink fabric, delivering the first fully confidential computing stack for AI workloads. The move addresses a...
DoorDash grew from a 2013 startup to a global local‑commerce leader while retaining its original Oracle NetSuite system. The company avoided a multi‑million‑dollar ERP migration, instead leveraging NetSuite’s cloud‑based scalability to support IPO, acquisitions, and expansion into grocery, convenience and...
A company saw its LLM API bill rise 30% month‑over‑month despite modest traffic growth. Analysis revealed that users asked the same questions in varied phrasing, causing duplicate LLM calls that exact‑match caching missed. By replacing text hashes with embedding‑based semantic...
Anthropic has deployed new technical safeguards that block third‑party harnesses spoofing its Claude Code client, disrupting open‑source tools like OpenCode and causing automatic account bans. The same enforcement also cut off rival labs such as xAI from using Claude models...
Orchestral AI launches a new Python framework that replaces the asynchronous complexity of tools like LangChain with a synchronous, type‑safe architecture aimed at reproducible research. The framework is provider‑agnostic, supporting OpenAI, Anthropic, Google Gemini, Mistral and local models via Ollama,...
KPMG has integrated SAP's conversational AI, Joule for Consultants, across 29 member firms, giving thousands of consultants real‑time access to SAP best practices. The tool streamlines documentation‑heavy SAP projects, accelerating design workshops and reducing reliance on manual knowledge retrieval. By...
Databricks unveiled the Instructed Retriever, a new architecture that claims up to a 70% boost over traditional Retrieval‑Augmented Generation (RAG) on complex, instruction‑heavy enterprise question‑answering tasks. The improvement stems from propagating full system specifications—user instructions, metadata schemas, and examples—through every...
MiroMind unveiled MiroThinker 1.5, a 30‑billion‑parameter model that delivers performance on par with trillion‑parameter rivals while costing roughly one‑twentieth as much per inference. The model introduces a "scientist mode" that forces verifiable research loops, dramatically cutting hallucinations and providing audit...
Anthropic’s Claude Code has introduced the Ralph Wiggum plugin, turning the model into an autonomous coding agent that loops until predefined success criteria are met. The tool originated from Geoffrey Huntley’s Bash script that fed model output back as input,...
Nvidia unveiled Cosmos Reason 2 at CES 2026, the latest vision‑language model built for embodied reasoning in robots and autonomous systems. The model expands on its predecessor's two‑dimensional ontology, letting enterprises customize agents that can plan next actions in real‑world settings. Nvidia...
Brex is shifting from traditional AI agent orchestration to an “Agent Mesh,” a network of narrow, role‑specific agents that converse in plain language and operate independently while maintaining full visibility. The mesh replaces a central coordinator with event‑driven message streams,...
The article argues that the traditional question "which API do I call?" is being replaced by "what outcome am I trying to achieve?" Modern large language models enable this shift through the Model Context Protocol (MCP), which translates natural‑language intent...
Notion AI’s breakthrough came from stripping away complex data models in favor of simple, human‑readable prompts and markdown representations. By rewiring its middleware and limiting context to a 100‑150k token window, the team delivered V3 with customizable AI agents that...
Enterprises are facing a critical AI visibility gap, with 62% unable to locate LLM deployments and a surge in prompt‑injection, vulnerable code, and jailbreaking attacks. Research shows only 6% of firms have advanced AI security strategies, while 13% reported AI...
Enterprises are shifting focus from raw model performance to research that makes AI production‑ready. Four trends—continual learning, world models, orchestration, and refinement—promise to keep models up‑to‑date, simulate physical environments, manage multi‑step workflows, and iteratively improve outputs without costly retraining. Companies...
Alibaba’s Qwen team released Qwen-Image-2512, an open‑source AI image model that rivals Google’s Gemini 3 Pro Image (Nano Banana Pro) in quality. The model delivers higher human realism, finer texture detail, and accurate embedded text for both Chinese and English...
Meta announced a more‑than‑$2 billion acquisition of Singapore‑based AI startup Manus, a general‑purpose agent that autonomously executes multi‑step tasks such as research, coding, and content creation. Manus boasts impressive usage metrics—over 147 trillion tokens processed, 80 million virtual computers created, and a $100 million...
At Gold Bond Inc., CIO Matt Price embedded generative AI directly into high‑friction workflows such as ERP intake, document processing, and call follow‑ups instead of launching a standalone chatbot. He formed a small “super‑user” cohort, ran sandbox tests, and layered...
Fal.ai unveiled FLUX.2 [dev] Turbo, a distilled LoRA adapter that speeds image generation to eight inference steps while cutting costs to $0.008 per 1024×1024 output. The model outperforms open‑weight rivals on benchmark ELO scores and delivers 6.6‑second latency for high‑resolution...
At Microsoft Ignite 2025, NVIDIA and Microsoft unveiled a unified AI stack that couples NVIDIA’s Blackwell GPUs with Azure’s new NCv6 virtual machines, expanding cloud‑native compute for complex AI and visual workloads. The partnership also introduced Omniverse libraries on Azure,...
Google AI unveiled FunctionGemma, a 270‑million‑parameter model that converts natural‑language commands into executable code on edge devices. Trained on a dedicated Mobile Actions dataset, its function‑calling accuracy climbs to 85%, far surpassing generic small models. The model runs locally on...
Palona AI, founded by former Google and Meta engineers, announced a vertical shift into the restaurant and hospitality sector with two new products—Palona Vision and Palona Workflow. Vision leverages in‑store security cameras to monitor queue lengths, table turnover, and kitchen...
OpenAI announced that third‑party developers can now submit apps for inclusion in a new ChatGPT App Directory, accessible from the sidebar and chat interface. The submission process went live on December 17, with approved apps slated to roll out to users...
OpenAI released GPT‑5.2‑Codex, an agentic coding model built on GPT‑5.2 with enhanced cybersecurity capabilities. The model achieved top scores on Capture‑the‑Flag, CVE‑Bench (87%) and a 72.7% pass rate on Cyber Range tests, demonstrating improved long‑horizon code understanding. Enterprise users can...
JPMorgan Chase rolled out an internal LLM‑powered assistant suite two‑and‑a‑half years ago, and adoption surged to over 60% of its 250,000‑plus workforce without mandates. The rapid, organic uptake stemmed from a connectivity‑first architecture that embeds AI into existing data, CRM,...
Patronus AI, backed by $20 million, unveiled Generative Simulators—a dynamic training architecture that creates adaptive, continuously evolving environments for AI agents. The platform aims to replace static benchmarks, which have struggled to predict real‑world performance, by generating on‑the‑fly challenges and feedback....
Small and mid‑size businesses are rapidly deploying AI at the edge, moving workloads from centralized data centers to retail stores, clinics, and remote sites. This shift delivers real‑time insights, resilience, and faster deployment but strains network bandwidth and security controls....
Zencoder unveiled Zenflow, a free desktop AI orchestration tool that coordinates multiple AI agents—such as Claude and OpenAI models—to plan, implement, test, and review code in structured workflows. The platform replaces ad‑hoc prompting with repeatable sequences, spec‑driven development, multi‑agent verification,...
Israeli startup Echo raised $35 million Series A to overhaul container base images, the hidden OS layer of cloud workloads, with a secure‑by‑design approach. The company rebuilds images from source, hardens them to SLSA Level 3, and uses autonomous AI agents to monitor and...
Vectorize.io’s open‑source Hindsight memory architecture outperforms traditional retrieval‑augmented generation (RAG) by organizing agent knowledge into four specialized networks. The system achieved a record 91.4% accuracy on the LongMemEval benchmark, dramatically boosting multi‑session recall, temporal reasoning, and knowledge‑update scores. Hindsight’s TEMPR...
Zoom announced that its federated AI system achieved a 48.1% score on the Humanity's Last Exam, surpassing Google’s Gemini 3 Pro benchmark. The approach routes queries to multiple external models and selects the best output via a proprietary Z‑scorer. Critics...
Israeli startup Echo announced a $35 million Series A round to commercialize its AI‑powered platform that rebuilds and hardens container base images. The round, led by N47 with participation from Notable Capital, Hyperwise Ventures and SentinelOne, brings Echo’s total funding to...
Marble, an AI startup for tax professionals, announced a $9 million seed funding round led by Susa Ventures with participation from MXV Capital and Konrad Capital. The capital will fuel the rollout of its free AI-driven tax research platform and...
London‑based AI startup Ascentra Labs announced a $2 million seed round on Monday, led by Berlin venture firm NAP with participation from several founder‑angels. The funding will fuel the company’s U.S. expansion and go‑to‑market efforts targeting consulting firms.
HumanSignal announced the acquisition of Erud AI earlier this month, expanding its Frontier Data Labs for novel data collection. The deal, disclosed without a financial figure, aims to strengthen HumanSignal's position in enterprise AI data labeling and evaluation services.
AI startup CraftStory, founded by OpenCV creators, announced a $2 million seed round led by investor Andrew Filev. The funding will support its Model 2.0 system that generates five‑minute human‑centric videos for enterprise training and marketing.
Berlin‑based dltHub announced an $8 million seed funding round today, led by Bessemer Venture Partners. The capital will fuel the development of its cloud‑hosted platform that extends the open‑source dlt library for AI‑native data pipelines. The round underscores growing investor interest...