
Immuta launches Agentic Data Access module for AI agents
Immuta unveiled an Agentic Data Access module that enables autonomous AI agents to retrieve enterprise data in real time while enforcing governance policies. The system treats agents as first‑class data users, applying least‑access and zero standing privileges with full audit trails. Built on Immuta’s policy engine, it adds registration, guardrail policies and multi‑approval controls.

SAP announced that Joule, its generative‑AI platform, is now generally available integrated with SAP Signavio. The solution adds a natural‑language conversational layer to process models, allowing users to query owners, flow descriptions, and regional variations instantly. Joule orchestrates workflows across SAP S/4HANA, Business Technology Platform, SuccessFactors and non‑SAP systems, delivering informational, navigational and transactional capabilities. Early adopter feedback shows faster process discovery, reduced manual analysis, and smoother user onboarding.

Everyone’s watching which LLM wins benchmarks. I’m watching Informatica suddenly show up everywhere. GCP coverage up 200%. The enterprise data stack isn’t being replaced by AI. It’s becoming the delivery mechanism for AI. The boring infrastructure is now the moat. https://t.co/9dxQv330NZ
In a recent Data Engineering Central podcast, Bart Konieczny discussed the evolving synergy between Apache Spark, lakehouse architectures, and artificial intelligence. He highlighted Spark's latest performance enhancements, including Catalyst optimizer refinements and native GPU acceleration. Konieczny explained how lakehouses bridge...

KoreTech unveiled a suite of updates to its Kore Integrate platform, adding new connectors for Canals.ai, Blue Yonder WMS, LeadSmart CRM, FieldPulse and Housecall Pro. The release also upgrades security protocols and promises support for Microsoft SQL Server 2025. Kore Integrate already...

Entrinsik announced a spring 2026 roadshow, speaking and exhibiting at Accelerate, Ellucian Live, and MultiValue World. Scott Allen will lead a session on data‑driven agency growth at Accelerate, while the company will demo Informer AI Assistants for campus‑wide personalization at...

Scality and WEKA announced that Scality RING will serve as the back‑end object store for WEKA’s NeuralMesh high‑performance AI file system. The partnership leverages NeuralMesh’s SSD‑based front‑end with RING’s cost‑efficient, disk‑based object tier, delivering up to ten times faster performance than...

70% of executives say they have difficulty acting on data. Meanwhile, Power BI just won the 2025 Gartner Magic Quadrant. Again. The tools keep getting better. The problem isn’t the tools. It never was. Source: https://t.co/KNtNLIRTOQ https://t.co/ZOAPhWZKSf
Newsday, a Long Island‑based multiplatform news outlet, has leveraged the American Press Institute’s Metrics for News (MFN) analytics tool since 2018 to turn data into subscriber growth. By tracking audience enthusiasm for niche beats, the paper launched new content initiatives,...

Zifo and Maze Therapeutics have teamed up to launch an AI‑powered platform that manages, stores, and scales massive biobank datasets. The solution tackles the fragmentation of genetic, proteomic, and phenotypic data by providing a unified workflow that delivers summary statistics...
The article introduces pg_semantic_cache, a PostgreSQL extension that stores query results alongside vector embeddings to enable semantic caching. By matching on meaning rather than exact text, the extension can identify duplicate intents across varied phrasing, dramatically increasing cache hit rates....

Snowflake announced the general availability of Snowflake Postgres, a fully managed PostgreSQL service built directly into the Snowflake platform. The offering delivers 100% community‑Postgres compatibility while leveraging Snowflake’s security, high‑availability architecture, and native AI capabilities. By unifying transactional and analytical...

The memo from Reis Megacorp outlines a 2028 scenario where AI agents can design, test, and deploy end‑to‑end data pipelines, rendering many data‑tooling jobs obsolete. By mid‑2027 the data labor market split: elite engineers commanding $400K+ salaries, a middle tier...

Collate, a semantic intelligence firm, unveiled a new Semantic Intelligence Graph that converts enterprise metadata into a machine‑readable RDF‑based graph. The launch includes AI Studio, offering four pre‑built agents—Data Quality, Tier Management, Documentation, and SQL Query—to automate data tasks. An...
Google Cloud Platform enables event‑driven pipelines that replace idle batch jobs with immediate reactions to data changes. The reference architecture uses Firestore as the event source, Cloud Functions or Eventarc to capture changes, Pub/Sub as the messaging backbone, and Dataflow...
Git for data is still underexplored, and it is an area that is changing so fast. That's why we look at actual tools/features that showcase how to apply a Git-like workflow for data. I compared Git-like tools for data I could...

Microsoft has launched Azure Local, a fully disconnected private cloud that unifies Azure, Microsoft 365, and Foundry services for regulated enterprises. The offering supports offline governance, policy enforcement, and AI inferencing on on‑prem hardware, ensuring data never leaves customer‑controlled boundaries....

Data validation is gaining prominence as pipelines become more complex, and Python now offers a diverse set of libraries to address this need. The article reviews five tools—Pydantic, Cerberus, Marshmallow, Pandera, and Great Expectations—each targeting a different validation paradigm, from...

KODE Labs has launched EnerG, an AI‑enabled platform that consolidates utility, sustainability, and performance data for enterprise real‑estate portfolios. The solution replaces fragmented spreadsheets, PDFs and portal pulls with automated ingestion, validation and anomaly detection. Built as an extension of...
Manufacturers and OEMs are turning to integrated SCADA and analytics software to boost overall equipment effectiveness (OEE). Real‑time visibility into availability, performance, and quality replaces manual PLC checks and paper logs, enabling instant downtime tracking and quality monitoring. The combined...

Reddit has been hit with a £14.47 million fine from the UK Information Commissioner’s Office after the regulator found the platform’s age‑verification process inadequate and that it processed personal data of users under 13 without a lawful basis. The ICO criticised...

Speedata Ltd. announced a partnership with Nebul to integrate its purpose‑built Analytics Processing Unit (APU) into Nebul’s European sovereign cloud. The APU claims up to 100× performance gains over CPUs and GPUs for Apache Spark workloads, cutting server counts and...

AI chip startup MatX, founded by two former Google semiconductor engineers, announced a funding round exceeding $500 million to accelerate development of GPUs that challenge Nvidia’s market dominance. The round was led by Jane Street and Situational Awareness, with participation from...

Dominion Energy announced that its contracted data‑center capacity now exceeds 48 GW, a three‑percent increase since September. The utility lifted its five‑year capital‑investment outlook by 30% to $65 billion, with over 90% earmarked for Virginia to meet accelerating data‑center load. A new...

ArisGlobal unveiled XDI, a Data Intelligence Cortex that federates fragmented life‑science data without centralizing it. The platform delivers continuous, explainable, decision‑grade intelligence across domains such as pharmacovigilance, benefit‑risk, and regulatory operations. XDI promises up to 80% reduction in compliance effort...

Scalo announced an expanded partnership with Databricks, joining its Consulting and Service Integration Partner Program to bolster its Data & AI practice. The collaboration enables enterprise clients to centralize data on a lakehouse foundation, streamline data flows, and deploy AI...

Nordic data‑center operator atNorth announced a 300 MW campus in Sollefteå, Sweden, to be built on a 50‑hectare plot at Hamre Industrial Park and targeted for H1 2028. The facility will feature direct liquid cooling and support rack densities up to 1 MW,...

A TUI for managing Airflow jobs? Something like k9s? Flowrs seems to be just that - haven't tried yet, but looks really cool. Will try next time I have to use Airflow :) https://github.com/jvanbuel/flowrs
Simple is good. One-line code change to switch from Apache Cassandra to a @googlecloud Spanner database. https://t.co/2n6AJutoNM Generate embeddings automatically for @googlecloud BigQuery table. https://t.co/SqIQzawOvt https://t.co/zWknasRT6r

Omnisend embedded large language models into its DataOps pipeline, using the Cursor AI editor to auto‑generate SQL, YAML and documentation, shrinking model‑building cycles from hours to minutes. A second LLM, Gemini Code Assist, acts as an automated reviewer, cutting review...
In 2026 hedge funds are pouring tens of millions of dollars into alternative data, turning information velocity into a core competitive lever. AI-driven analytics have lowered the barrier to processing vast datasets—from satellite imagery to web traffic—shifting the edge toward...
Yes, AI can write the SQL. But do you understand: * Why that join works? * Why that model makes sense? * Why that metric matters? AI lowers the barrier. Foundations raise your ceiling.

Human verification tools are emerging as essential safeguards for data‑driven enterprises, confirming that online interactions stem from real individuals rather than bots or synthetic identities. Modern solutions combine biometrics, AI, and privacy‑focused designs to validate personhood at scale, reducing fraudulent...

TiVo has pivoted from its legacy DVR brand to a data‑infrastructure player, leveraging its deep content metadata and household viewership signals. The company emphasizes independent, comprehensive data that uniquely combines the "what" (metadata) and the "who" (audience behavior) across linear...

Qdrant has launched version 1.17.0, introducing a Relevance Feedback Query that refines vector‑search results using lightweight model feedback. The release also adds latency‑reduction features such as configurable fan‑out thresholds, an update queue for up to one million pending writes, and an indexed‑only...

BMC announced a five‑year strategic collaboration with Amazon Web Services, designating AWS as the preferred cloud for its Control‑M SaaS platform. The partnership integrates BMC’s intelligent automation and generative AI advisor Jett with AWS’s scale, performance, and security. Joint customers...
MarTech outlines a framework for B2B firms to future‑proof AI deployments through robust data governance and consent management. It stresses tagging consent metadata at capture, using centralized policy tools with decentralized enforcement, and establishing a cross‑functional governance council. The guide...
Controversial opinion: don't start with a semantic layer. A semantic layer makes sense when: - You have multiple consumers (BI, notebooks, apps) - KPIs are defined inconsistently across teams - You need a universal API for metrics If you're early stage with one BI tool,...
In this episode Tim Berglund talks with Colt McNealy, founder and CEO of Little Horse, about building a Kafka‑based platform for orchestrating microservice workflows and AI agents. Colt describes how his early experience debugging monolithic code with GDB contrasted with...
Vodafone’s Network‑as‑a‑Sensor (NWaaS) program is now operating pan‑European, using thousands of microwave backhaul links to turn the carrier’s infrastructure into a distributed weather‑monitoring platform. The service can infer rain, fog, humidity and, with added mast‑mounted sensors, air‑quality data, delivering near‑real‑time...

In this episode, Ivan Poupyrev, CEO of Archetype AI, explains that "physical AI" goes far beyond robotics, embedding foundation‑model intelligence into everyday devices—from washing machines to HVAC systems—and enabling them to communicate and optimize as a unified system. He outlines...

Pulselight has become an authorised partner on the £10 bn Fortrus Digital Enablement Framework, giving NHS trusts a fast, compliant route to acquire its advanced data‑analytics platform. The framework, created by the Countess of Chester Hospital NHS Foundation Trust, streamlines procurement...
Snowflake is hosting "Data for Breakfast Canberra" on 17 March 2026, aimed at Australian Public Service (APS) data and AI professionals. The event will feature a Snowflake keynote on secure, AI‑ready data collaboration, public‑sector case studies, and deep‑dive sessions on agents and...

❌Most data science projects take 4 weeks because of meetings, reruns, and handoffs between teams ✅A good AI/DS workflow compresses it to ~15 minutes. I’m demo-ing how to do it live (free): https://learn.business-science.io/registration-ai-workshop-2

Sri Lanka has launched CROPIX DPI, a national digital platform that consolidates fragmented agricultural data into a single, mobile‑accessible system. The platform integrates the crop registry, yield forecasts and climate analytics, enabling automated data exchange among farmers, officials and policymakers....
Will Rust kill Python in data engineering? No. But it has already consumed much of the JavaScript tooling ecosystem. And it's quietly doing the same in data. The pattern: Python remains the interface, Rust becomes the engine. Polars, DataFusion, DuckDB's internals - all Rust...

ROW_NUMBER(), RANK(), DENSE_RANK(). Three functions, three different behaviors. Pick the wrong one and your rankings mislead. Here are 4 patterns to get it right: - ranking with gaps vs without - top-N per category - deduplication - running totals 1. ROW_NUMBER() vs RANK() vs DENSE_RANK() Three functions, three behaviors...

Enterprises are still paying for legacy telecom services that are unused, creating hidden cost leaks. A data‑first approach—digitizing invoices, consolidating contracts, and applying AI/ML analytics—provides clear visibility into service usage and pricing. Companies that adopt this model can shift telecom...
Salesforce is now bridging four domains at once: Salesforce Implementation (CRM) Databricks (data lake) Agentforce (AI agents) Data 360 (data platform) The platform wars are not about features. They are about who owns the most connected node in your stack.
The Data Integrity Gap: From “Big Data” to “Reliable Physics”.. click to learn everything you need to know about issues you likely don't know you have or will soon have in your organisation.. https://t.co/LrOOv5lGcm
This is a common problem and one of our biggest motivations in building Nile - to isolate tenants automatically and by default.