
TGS Taps Tape Ark to Migrate Around 40 Petabytes of Data to the Cloud
Energy intelligence firm TGS has engaged Tape Ark to move roughly 40 petabytes of seismic and subsurface data into a hyperscale cloud environment. The migration leverages Tape Ark’s parallel ingest platform to accelerate high‑throughput transfer across multiple facilities. Once in the cloud, TGS will offer customers on‑demand access, scalable high‑performance computing, and advanced analytics capabilities. The specific cloud provider remains undisclosed, though Tape Ark lists major partners such as AWS, Azure and Google Cloud.
Data Pipeline Failures Cost Enterprises $3 Million per Month, Fivetran Benchmark Finds
Fivetran’s 2026 enterprise data infrastructure benchmark, based on a survey of 500 senior data leaders at firms with over 5,000 employees, reveals that fragile data pipelines are costing large organizations roughly $3 million in lost revenue each month. Nearly 97% of...

Cortex Code Updates: Faster AI Data Engineering on Snowflake
Snowflake announced a major upgrade to its Cortex Code AI coding agent, making it generally available inside Snowsight and adding native Windows support for the CLI. The update introduces Agent Teams, a coordination layer that lets multiple sub‑agents work in...

Gaskins: How Data and Data Analytics Improve Asset Utilization and Loaded Miles
Patrick Gaskins explains how real‑time fleet data and predictive analytics are reshaping trucking operations. By giving dispatchers minute‑by‑minute visibility, carriers can match loads to trucks, cut empty miles, and lift loaded‑mile percentages. Integrated network‑wide platforms further align operations, sales, and...

How Big Data Collection Works: Process, Methods, Challenges
Enterprises are racing to harness big data, with 99% of Fortune 1000 executives reporting active programs and 96% seeing success. The data landscape spans structured, semi‑structured and unstructured sources, generating roughly 2.5 quintillion bytes daily. Effective collection relies on ETL pipelines...
Snowflake Introduces Project SnowWork to Enable AI-Driven Enterprise Task Execution
Snowflake announced a research preview of Project SnowWork, an autonomous AI platform embedded in its data cloud that lets business users trigger complex, multi‑step workflows with natural‑language prompts. The system deploys secure, data‑grounded AI agents that can query governed data,...

How Lumi AI Helps CPGs Find ‘Multi-Million-Dollar Opportunities’ Hidden in Their Supply Chain Data
Lumi AI, founded in 2023, offers a natural‑language interface that plugs into ERP systems like SAP and Oracle, letting CPG and food‑retail teams query supply‑chain data instantly. The startup has secured $3.7 million in seed funding and counts Kroger, Growmark and...

Domo Launches AI Agent Builder with Broad Enterprise Data Connectivity
Domo Inc. announced an AI agent builder that includes a library of enterprise data connectors powered by the Model Context Protocol. The platform lets users design conversational or goal‑oriented agents that can pull internal and external data, automate tasks, and...

US Clouds Cast Long Shadow over EU Data Sovereignty, Says Osmium
Osmium Data Group warns that using US‑owned cloud providers for backups undermines European data‑sovereignty, even when the physical datacenter sits in the EU. The firm evaluated four source‑and‑destination scenarios, ranking a Europe‑owned source and datacenter as highest compliance, while a...

CData Sync Adds Pipeline Orchestration with Real-Time CDC and Open Table Formats
CData Software unveiled major upgrades to its CData Sync platform, adding native pipeline orchestration, an enhanced API 2.0, and enterprise‑grade change data capture (CDC) for IBM DB2 and SAP HANA. The solution now writes directly to open table formats such...

Why IBM Paid $11B For Real-Time AI, Not Kafka
IBM completed an $11 billion acquisition of Confluent on March 17, 2026, adding the leading data‑streaming platform used by over 6,500 enterprises, including 40 % of the Fortune 500. IBM frames the deal as buying an AI‑focused data platform that delivers real‑time data to power...

Entrinsik Informer Improves Reporting for Insurance Agencies
Entrinsik Informer now offers insurance agencies an automated data‑quality layer that plugs into AMS360, surfacing missing fields, duplicate records, and inconsistent structures before reports are generated. The solution replaces manual data‑hunt routines with a continuous Data Report Card that highlights...

BMLL, Tradefeedr Partner on Analytics for Equities and Futures Data
BMLL and Tradefeedr announced a partnership to create an AI‑ready analytics layer for equities and futures trading data, leveraging BMLL’s harmonised historical order‑book datasets. The collaboration will extend Tradefeedr’s existing FX analytics APIs to cover multi‑asset execution data, delivered through...

Data in Action: Why Airports Can’t Afford to Get This Wrong
Airports are betting on data to drive efficiency, resilience and passenger experience, yet many still stumble on turning raw information into actionable insight. At the International Airport Summit in Berlin, senior leaders highlighted that reliable data, strong governance and clear...

All AI and Security Teams Need Transparent Data Pipelines
Organizations that rely on opaque AI data sources expose themselves to integrity risks, compliance gaps, and trust deficits. Without auditable pipelines, security teams cannot verify data quality, leading to hallucinations and regulatory violations such as under the EU AI Act....

Op-Ed: Singapore Cruise Centre Reimagines Passenger Operations with Real-Time Data
Singapore Cruise Centre (SCCPL) is entering the final stage of a five‑year digital transformation that centers on a real‑time data integration platform built on Solace’s event‑driven architecture. The platform unifies passenger, vessel, baggage, staff and resource data, enabling instant updates...

Immuta Introduces the First Data Provisioning Platform for Managing Agentic Data Access
Immuta unveiled the first data provisioning platform designed to manage AI agent access, treating agents as distinct identities with attributes, intent, and audit trails. The Agentic Data Access feature grants just‑in‑time, temporary roles on cloud data warehouses such as Snowflake,...

Anynines Advances Klutch to Power A9s Hub for Kubernetes Data Service Orchestration Across On-Premises and AWS Environments
anynines unveiled its open‑source Klutch control plane at KubeCon EU, positioning it as the core of the a9s Hub framework for data‑service orchestration across on‑premises and AWS environments. The solution lets platform teams expose databases, object storage and caches through...

Maryland’s Data Lead Reflects on Ongoing ‘Culture Shift’
Maryland has intensified data‑driven decision making under Governors Larry Hogan and Wes Moore, with Chief Data Officer Natalie Evans Harris describing a statewide "culture shift" toward breaking data silos. The state is building a centralized governance structure and an enterprise...

Integrating AI-Ready Data with Informatica and Snowflake
Informatica and Snowflake partnered in a DBTA webinar to showcase how metadata‑driven governance, data quality and observability can make Snowflake’s AI Data Cloud AI‑ready. The discussion highlighted Informatica’s end‑to‑end data management capabilities, including tag‑based PII masking, automated semantic classification and...

Adactin Launches AI-Powered Knowledge Platform AFIVE
Adactin unveiled AFIVE, an AI‑powered knowledge platform built on Microsoft Azure OpenAI and AI Foundry. It uses retrieval‑augmented generation with LangChain to pull data from SharePoint, Google Drive, Azure Blob Storage and Dropbox. The solution offers natural‑language queries, integrates with...

Child Protection Workers Are Under Pressure in NZ. Can Predictive Modelling Help?
Frontline child protection workers in New Zealand face growing caseloads, time pressure and fragmented information, making high‑stakes decisions about child safety and family intervention. Predictive modelling, which analyses large administrative datasets to generate risk scores, has been explored for over a...

Drowning in Data Sets? Here’s How to Cut Them Down to Size
The Square Kilometre Array Observatory (SKAO) will soon produce up to 60 exabytes of raw data annually, dwarfing the 700‑petabyte baseline currently planned for storage. Scientists are forced to discard raw observations once processed images meet quality thresholds, a practice...

Accelerating Redshift Modernization with Confidence: How Snowflake Automates and De-Risks Migration
Snowflake’s SnowConvert AI offers an end‑to‑end, AI‑driven solution for migrating Amazon Redshift workloads to Snowflake. It begins with an automated assessment that maps objects, gauges conversion complexity, and creates structured migration waves. The platform then converts SQL and procedural code...
Toward Intelligent Data Quality in Modern Data Pipelines
Modern data pipelines face growing data quality challenges that go beyond simple schema checks, as subtle semantic drift and incomplete datasets can silently degrade analytics. Current deterministic quality frameworks rely on static rules and thresholds, which become noisy and costly...

Understanding the Layers of the AI‑ready Modern Data Stack
Enterprises are rapidly replacing legacy data architectures with an AI‑ready modern data stack as AI initiatives surge. Deloitte’s 2026 survey shows strategic AI readiness rose to 42%, but confidence in data‑management capabilities slipped to 40%, while an IDC study found...
From DLT to Lakeflow Declarative Pipelines: A Practical Migration Playbook
Databricks is rebranding Delta Live Tables as Lakeflow Spark Declarative Pipelines, adding open‑source Spark alignment and new features. Existing DLT pipelines run unchanged, but Databricks recommends updating imports, decorators, expectations, and CDC logic to the new `dp` API. The migration...

How to Build an Effective Big Data Strategy
Smart organizations leverage big data to boost performance, but without a clear strategy they risk duplicated projects, compliance breaches, and wasted spend. The article outlines a four‑step framework—defining business goals, assessing data readiness, prioritizing use cases, and creating a flexible...
LightningChart Introduces No-Code Visualization Platform Dashtera
LightningChart unveiled Dashtera, a no‑code, web‑based analytics platform that leverages GPU‑accelerated rendering to display up to 100 million data points in real time. The solution removes the need for extensive implementation projects, data reduction, or custom integration, delivering instant zoom and...

Informatica Adds Microsoft Fabric Support and Opens Swiss Data Center
Informatica announced general availability of Microsoft Fabric Open Mirroring within its Intelligent Data Management Cloud (IDMC) and launched a new Azure‑based IDMC delivery point in Switzerland. The Open Mirroring feature lets customers synchronize data between OneLake and Fabric Data Warehouse...

Interview: Huy Dao, Director of Data and Machine Learning Platform, Booking.com
Booking.com’s data and machine‑learning platform, led by Huy Dao, has completed a seamless migration from on‑prem Hadoop to a Snowflake‑based cloud ecosystem. The new Booking Data Exchange serves over 1,500 practitioners, handling petabytes of data and billions of daily predictions...
SAPinsider Las Vegas: Why Data Strategy Must Start With Trust:
At SAPinsider Las Vegas 2026, Ingo Hilgefort warned that data‑driven AI projects fail when organizations lack trust in their data. He argued that inconsistent definitions and poor governance cause users to rebuild dashboards to verify numbers, stalling analytics adoption. Hilgefort...

How a Nonprofit Transforms Data with Cloudera and AI
Rare Hope, a nonprofit focused on rare‑disease hypotheses, adopted Cloudera’s hybrid data‑and‑AI platform to turn unstructured research papers and medical images into structured insights. Using PySpark pipelines, the organization extracts disease‑drug correlations and feeds them to large language models for...

Federal AI Needs a New Data Foundation. Dell’s Platform Is Built for It.
The federal government is accelerating its adoption of generative AI, retrieval‑augmented generation, and early agentic systems, but agencies are constrained by legacy data architectures. Dell’s AI data platform offers a secure, federated foundation that lets classified and regulated data remain...

Taming the IoT Firehose: How Utilities Are Scaling Cloud DataOps for Smart Metering
Utilities are grappling with an "IoT firehose" as smart meters generate massive, continuous telemetry streams. To tame the volume, they are adopting cloud‑based DataOps frameworks that automate ingestion, normalize data, and deliver analytics‑ready datasets at scale. Automated, event‑driven pipelines enable...

Microsoft Promises All-in-One Database Wrangling Hub on Fabric
Microsoft unveiled Database Hub, an early‑access tool built on the Fabric data platform that consolidates management of Azure SQL Server, Cosmos DB, PostgreSQL, MySQL, Azure Arc‑enabled SQL, and other services. The hub offers a single pane of glass for on‑premises,...

Lloyd's Register, OneOcean Report Warns Shipping Must Master Data to Remain Competitive
Lloyd’s Register and OneOcean released a report warning that the maritime sector’s surge in operational data is hampered by fragmentation and low standardisation, jeopardising compliance and commercial advantage. Their Digital Maturity Index shows data standardisation at 2.45 / 4 while overall digital...

Oracle Announced the General Availability of Oracle Analytics Server 2026
Oracle announced the general availability of Oracle Analytics Server 2026, delivering a suite of enhancements aimed at boosting adoption, performance, and governed self‑service. New defaults for the "Limit Values By" filter and a redesigned State menu streamline workbook interactions. The...

GHD Appoints David McLaren to Lead Data and AI Capabilities Globally
GHD has appointed David McLaren as its Enterprise Data & AI Leader, based in Toronto. McLaren brings experience from Coca‑Cola Canada Bottling, where he built enterprise‑scale data platforms, automation and governance. At GHD he will steer the development of an...

Data Lineage Documentation Matters for Enterprise Reliability
Enterprises are increasingly recognizing that knowing where data resides is insufficient without visibility into its lifecycle. Data lineage—tracking origin, transformations, and access—provides the transparency needed for accountability, data quality, compliance, and reduced technical debt. The article highlights how poor lineage...
Ibrar Ahmed: RAG With Transactional Memory and Consistency Guarantees Inside SQL Engines
Current retrieval‑augmented generation (RAG) systems were built for static document search, which creates consistency problems when multiple agents write concurrently. Without transactional control, memory updates can become partially committed, leading to answer drift and silent corruption. The article proposes using...

Intelligence and Interoperability: Data Catalog Must-Haves for AI Data Governance
Enterprises must move beyond static data catalogs toward a universal AI catalog that combines a business‑friendly semantic layer with cross‑platform interoperability. The semantic layer supplies machine‑readable context, preventing misinterpretations by AI agents, while universal interoperability ensures governance, security, and metadata...

Databricks, Accenture Launch Joint Business Venture Focused On Spurring AI Development
Databricks and Accenture have launched the Accenture Databricks Business Group, a joint venture designed to accelerate enterprise adoption of the Databricks Data Intelligence Platform for AI and data workloads. Backed by more than 25,000 Databricks‑trained professionals, the group will help...

Agentic AI Is Forcing Analytics and Operations to Converge
Investments in data platforms have shifted from siloed warehouses to unified, sovereign foundations as agentic AI collapses analytics, operations, and AI into single workflows. Enterprises now need platforms that govern operational execution, high‑concurrency analytics, and AI reasoning together, rather than...
Better Cotton Funds On-Farm Data-Collecting Project
The Better Cotton Initiative (BCI) is launching a $200,000 on‑farm data‑collection effort in partnership with the Soil Health Institute and ag‑tech provider Growers Guide. The program will analyze soil, plant tissue and sap samples across the Southeast and other Cotton Belt...

Big Changes in Latest GigaOm Unstructured Data Management Radar Report
GigaOm released version 6 of its Unstructured Data Management Radar, expanding the vendor set to 23 and appointing James Brown as the new analyst. The report reclassifies 11 suppliers as leaders and 12 as challengers, with notable moves such as Panzura shifting...
Noémi Ványi: We Skipped the OLAP Stack and Built Our Data Warehouse in Vanilla Postgres
Xata built a product analytics warehouse using vanilla Postgres, consolidating identity, usage, billing, and event data from four separate systems. They employed materialized views, pg_cron schedules, and database branches to flatten JSONB events, refresh data daily, and iterate safely on...
Visualizing the World with Planetary Computer
Microsoft’s Planetary Computer offers a free, standards‑based geospatial data platform that aggregates curated datasets from government, academic and commercial sources. It provides STAC‑compatible APIs, Python and R SDKs, and an Explorer UI for rapid prototyping of environmental applications such as...

Coles Sets up Standard Data Streaming Platform Groupwide
Coles Group has deployed an enterprise‑wide data streaming platform built on Confluent Cloud, unifying its real‑time data pipelines under a single Apache Kafka foundation. Previously, isolated event‑streaming stacks created silos, inconsistent models, and governance challenges. The new "enterprise event platform"...
IBM, Nvidia Tackle AI Data Woes
IBM expanded its partnership with Nvidia at GTC 2026 to address enterprise AI data management challenges. The collaboration integrates Nvidia’s cuDF toolkit with IBM’s Presto query engine and adds Nemotron models to IBM’s Docling PDF reader. Nvidia GPUs will also power...