Big Data Social Media and Updates

Data Skills Trump AI Hype in Modern Strategies
SocialFeb 26, 2026

Data Skills Trump AI Hype in Modern Strategies

The debate: which AI model to bet on. What I'm watching instead: Informatica surging. Java everywhere. GCP back on the shortlist. 73% of companies investing more in data skills. Not AI skills. Your AI strategy is only as strong as the data layer...

By Yves Mulkers
Treat Data as Assets, Not Just Jobs.
SocialFeb 26, 2026

Treat Data as Assets, Not Just Jobs.

Data Product: why do we need this data? Data Asset: what is this data and how do we maintain it? I've found the most practical definition of data products comes from Dagster's software-defined assets. Each asset has a clear definition, dependencies, and...

By SSP Data
Real‑time Data Powers HCF’s Analytics Transformation
SocialFeb 26, 2026

Real‑time Data Powers HCF’s Analytics Transformation

Recently I caught up with Amiet Dhagat, Head of Data Services Analytics and AI for HCF, to get the inside story around their stellar success in transition from static data to dynamic data, and why real-time data has been so...

By Dez Blanchfield
AI Boom Fuels Data Demand for Small Innovators
SocialFeb 26, 2026

AI Boom Fuels Data Demand for Small Innovators

SONAR is having the best quarter in years. One of my board members asked if I were concerned about the rise of AI and if it would be a drain our data business. I told her that we are seeing...

By Craig Fuller
Open‑Data Slack Clone Will Trigger Mass Migration
SocialFeb 26, 2026

Open‑Data Slack Clone Will Trigger Mass Migration

Slack will be the Waterloo of open vs closed data. Someone is going to make a slack clone where you get unfettered access to your own data, and people really will switch en masse.

By George Fraser
Privacy Must Be Built Into AI Data Workflows
SocialFeb 25, 2026

Privacy Must Be Built Into AI Data Workflows

RT High-level policies aren't enough. It's time for audits, training, DSPM, and privacy-by-design in AI workflows. If privacy isn't built into how data moves, you're hoping - not leading. #DataGovernance #AI #CIO @Star_CIO https://t.co/Naq82FuMWZ

By Isaac Sacolick
Managed Iceberg Lets Providers Own the Metadata Control Plane
SocialFeb 25, 2026

Managed Iceberg Lets Providers Own the Metadata Control Plane

Why are hyperscalers racing to offer managed Iceberg? Because whoever controls the catalog controls the ecosystem. If your tables are in a managed Iceberg service, you can query them from any engine - Spark, Trino, DuckDB, whatever. But your metadata stays with...

By SSP Data
Separate Storage and Compute Redefines Modern Databases
SocialFeb 25, 2026

Separate Storage and Compute Redefines Modern Databases

Why we're excited about Lakebase GA: most database services are based on outdated assumptions leading to poor operability, scalability and devex.

By Matei Zaharia
Data Governance: The Crucial Anchor Amid Conflicting Sources
SocialFeb 25, 2026

Data Governance: The Crucial Anchor Amid Conflicting Sources

Every data conversation I tracked this week led back to the same concept. Data Governance. Not as compliance. As the center of gravity. 47 data sources. 6 different owners. 3 definitions of "customer." Your AI agent has no idea who to believe. https://t.co/ZrW6iptSqs

By Yves Mulkers
Choosing Optional Catalogs for Open Table Formats
SocialFeb 25, 2026

Choosing Optional Catalogs for Open Table Formats

Are we still using table catalogs for open table formats? Haven't heard too much lately. I like OTFs, but making it non-optional to have a catalog isn't great. That's why I like prefer the option to use one without. But if you...

By SSP Data
Enterprise Data Stack Becomes AI Delivery Engine
SocialFeb 25, 2026

Enterprise Data Stack Becomes AI Delivery Engine

Everyone’s watching which LLM wins benchmarks. I’m watching Informatica suddenly show up everywhere. GCP coverage up 200%. The enterprise data stack isn’t being replaced by AI. It’s becoming the delivery mechanism for AI. The boring infrastructure is now the moat. https://t.co/9dxQv330NZ

By Yves Mulkers
Data Execution Gap Persists Despite Better BI Tools
SocialFeb 25, 2026

Data Execution Gap Persists Despite Better BI Tools

70% of executives say they have difficulty acting on data. Meanwhile, Power BI just won the 2025 Gartner Magic Quadrant. Again. The tools keep getting better. The problem isn’t the tools. It never was. Source: https://t.co/KNtNLIRTOQ https://t.co/ZOAPhWZKSf

By Yves Mulkers
One-Line Code Swap Moves Cassandra to Spanner, Auto-Embeds in BigQuery
SocialFeb 24, 2026

One-Line Code Swap Moves Cassandra to Spanner, Auto-Embeds in BigQuery

Simple is good. One-line code change to switch from Apache Cassandra to a @googlecloud Spanner database. https://t.co/2n6AJutoNM Generate embeddings automatically for @googlecloud BigQuery table. https://t.co/SqIQzawOvt https://t.co/zWknasRT6r

By Richard Seroter
Exploring Emerging Git‑Style Tools for Data Management
SocialFeb 24, 2026

Exploring Emerging Git‑Style Tools for Data Management

Git for data is still underexplored, and it is an area that is changing so fast. That's why we look at actual tools/features that showcase how to apply a Git-like workflow for data. I compared Git-like tools for data I could...

By SSP Data
Flowrs: New TUI for Managing Airflow Jobs
SocialFeb 24, 2026

Flowrs: New TUI for Managing Airflow Jobs

A TUI for managing Airflow jobs? Something like k9s? Flowrs seems to be just that - haven't tried yet, but looks really cool. Will try next time I have to use Airflow :) https://github.com/jvanbuel/flowrs

By SSP Data
AI Writes SQL, but Fundamentals Lift Your Ceiling
SocialFeb 23, 2026

AI Writes SQL, but Fundamentals Lift Your Ceiling

Yes, AI can write the SQL. But do you understand: * Why that join works? * Why that model makes sense? * Why that metric matters? AI lowers the barrier. Foundations raise your ceiling.

By Ebere Oyek (Nelo) — Data | AI | ML
Skip Semantic Layer Early; Use Native Metrics First
SocialFeb 23, 2026

Skip Semantic Layer Early; Use Native Metrics First

Controversial opinion: don't start with a semantic layer. A semantic layer makes sense when: - You have multiple consumers (BI, notebooks, apps) - KPIs are defined inconsistently across teams - You need a universal API for metrics If you're early stage with one BI tool,...

By SSP Data
From Weeks to Minutes: Streamlined AI/DS Workflow
SocialFeb 22, 2026

From Weeks to Minutes: Streamlined AI/DS Workflow

❌Most data science projects take 4 weeks because of meetings, reruns, and handoffs between teams ✅A good AI/DS workflow compresses it to ~15 minutes. I’m demo-ing how to do it live (free): https://learn.business-science.io/registration-ai-workshop-2

By Matt Dancho
Platform Wars Hinge on Owning the Stack’s Central Node
SocialFeb 22, 2026

Platform Wars Hinge on Owning the Stack’s Central Node

Salesforce is now bridging four domains at once: Salesforce Implementation (CRM) Databricks (data lake) Agentforce (AI agents) Data 360 (data platform) The platform wars are not about features. They are about who owns the most connected node in your stack.

By Yves Mulkers
Rust Powers Python's Data Engineering, Not Replaces It
SocialFeb 22, 2026

Rust Powers Python's Data Engineering, Not Replaces It

Will Rust kill Python in data engineering? No. But it has already consumed much of the JavaScript tooling ecosystem. And it's quietly doing the same in data. The pattern: Python remains the interface, Rust becomes the engine. Polars, DataFusion, DuckDB's internals - all Rust...

By SSP Data
Choose the Right SQL Ranking Function to Avoid Misleading Gaps
SocialFeb 22, 2026

Choose the Right SQL Ranking Function to Avoid Misleading Gaps

ROW_NUMBER(), RANK(), DENSE_RANK(). Three functions, three different behaviors. Pick the wrong one and your rankings mislead. Here are 4 patterns to get it right: - ranking with gaps vs without - top-N per category - deduplication - running totals 1. ROW_NUMBER() vs RANK() vs DENSE_RANK() Three functions, three behaviors...

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Bridging the Data Integrity Gap for Reliable Insights
SocialFeb 22, 2026

Bridging the Data Integrity Gap for Reliable Insights

The Data Integrity Gap: From “Big Data” to “Reliable Physics”.. click to learn everything you need to know about issues you likely don't know you have or will soon have in your organisation.. https://t.co/LrOOv5lGcm

By Dez Blanchfield
Automatic Tenant Isolation Built Into Nile by Default
SocialFeb 22, 2026

Automatic Tenant Isolation Built Into Nile by Default

This is a common problem and one of our biggest motivations in building Nile - to isolate tenants automatically and by default.

By Gwen (Chen) Shapira
Decentralized Back‑Office, Unified Analytics Layer Wins
SocialFeb 22, 2026

Decentralized Back‑Office, Unified Analytics Layer Wins

Centralizing analytics on a single platform? Not happening. The focus is on decentralized back-office systems and a common analytics layer for daily visualization. #Analytics #Strategy #BusinessTech https://t.co/7ObAL6iVQ5

By Eric Kimberling
Lakehouse Surge Shows Data Infrastructure Beats AI Hype
SocialFeb 21, 2026

Lakehouse Surge Shows Data Infrastructure Beats AI Hype

AGI is in the noise bucket this week. Lakehouse architecture? Up 400%. While the industry debates the AI endgame, data infrastructure quietly becomes non-negotiable. The boring skills win again.

By Yves Mulkers
Analyst Ads Overpromise Python; Excel, SQL Dominate Daily
SocialFeb 21, 2026

Analyst Ads Overpromise Python; Excel, SQL Dominate Daily

Unpopular opinion: Data analyst job postings ask for Python. Data analyst jobs don't actually use Python. What you'll use daily: Excel — every single day SQL — every single day Power BI or Tableau — multiple times per week Python — maybe once a month This pattern holds...

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Building a Proprietary Data Fusion Layer This Weekend
SocialFeb 20, 2026

Building a Proprietary Data Fusion Layer This Weekend

big fan of ontology btw, but noted. building the proprietary data fusion layer this weekend 🫡

By Bilawal Sidhu
Browse S3 Files Locally in One Fast Command
SocialFeb 20, 2026

Browse S3 Files Locally in One Fast Command

I quickly recorded how easily and conveniently it is to browse S3 files locally with a single command, blazingly fast. Even preview works with DuckDB integration. https://youtu.be/cimUvBd_9Ns

By SSP Data
Non‑engineers Building Tools Strain Professional Developers
SocialFeb 20, 2026

Non‑engineers Building Tools Strain Professional Developers

I struggle with the phrase “everyone’s a coder now.” And I hesitate to post because I don’t want you to read this as gatekeeping. If anything, I want more people to build, but in a stronger, more functional way. Building any...

By Allie Miller
Replace If‑elif Chains with Clean Python Dispatch Patterns
SocialFeb 20, 2026

Replace If‑elif Chains with Clean Python Dispatch Patterns

The more if-elif chains you write, the harder your code gets to change. Python has cleaner patterns for this. Here are 4 worth knowing: - dictionary dispatch - guard clauses - match/case - conditional expressions 1. Dictionary dispatch. Replace long equality checks with a dict. Constant-time lookup. No branching....

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
BI Dashboards Are Dying; Prepare for the Next Wave
SocialFeb 20, 2026

BI Dashboards Are Dying; Prepare for the Next Wave

RIP BI Dashboards. Tools like Tableau and PowerBI are about to become extinct. This is what's coming (and how to prepare):

By Matt Dancho
Kickstart Your Data Career with Our Free Guide
SocialFeb 19, 2026

Kickstart Your Data Career with Our Free Guide

Aspiring Data Processionals Excel, SQL and PBI are great tools to build projects with. If you're completely confused, start with my Guide 👇🏾 https://tekdlin.com/data-analytics-guide/

By Ebere Oyek (Nelo) — Data | AI | ML
Ask the Problem First, Then Match Tools
SocialFeb 19, 2026

Ask the Problem First, Then Match Tools

This is an interesting thread. Everyone is suggesting tools to solve the problem. I’d start by asking more about the data and the questions the customer is trying to answer or problems they are trying to solve first before recommending...

By Teri Radichel
Data Compliance for B2B SaaS: Navigating Hidden Complexity
SocialFeb 19, 2026

Data Compliance for B2B SaaS: Navigating Hidden Complexity

Yesterday I was talking with another founder about ensuring their B2B SaaS product is data compliant. There’s so much complexity behind meeting required standards.

By Ebere Oyek (Nelo) — Data | AI | ML
Free Resources Replace Costly Bootcamps—Discipline Is Key
SocialFeb 19, 2026

Free Resources Replace Costly Bootcamps—Discipline Is Key

You don't need a bootcamp to become a data analyst. Everything you need is free: Excel/SQL/Python/Power BI tutorials — YouTube SQL practice — SQLZoo, LeetCode, HackerRank Datasets for portfolio projects — Kaggle, data.gov, Google Dataset Search Resume feedback — Reddit (r/datascience, r/resumes), LinkedIn communities Interview prep...

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Use Exponential Backoff with Jitter for Effective Retries
SocialFeb 19, 2026

Use Exponential Backoff with Jitter for Effective Retries

Not all retries are created equal. Immediate retry: usually fails again Exponential backoff: gives systems time to recover Exponential backoff with jitter: prevents thundering herd Most orchestrators have this built in. But you need to understand what's happening or you'll wonder why your retries...

By SSP Data
Perfect Salesforce Data: My Superintelligence Benchmark
SocialFeb 19, 2026

Perfect Salesforce Data: My Superintelligence Benchmark

Test for superintelligence: when the data in Fivetran’s salesforce is 100% accurate and up to date at all times, I’ll know we’re there.

By George Fraser
Semantic Layer: Serve Data Like a Menu, Hide Complexity
SocialFeb 18, 2026

Semantic Layer: Serve Data Like a Menu, Hide Complexity

The semantic layer is like a restaurant menu: you know what you're ordering, but not how it's made. This analogy comes from Maxime Beauchemin and I think it's perfect. Users shouldn't need to understand your star schema to calculate revenue. They should...

By SSP Data
Analyze Global Data with One BigQuery Query
SocialFeb 18, 2026

Analyze Global Data with One BigQuery Query

You've got data spread across geographies. What happens when you want to bring that data together? Usually ETL jobs or other mechanisms. We just launched @googlecloud BigQuery global queries. Do multi-location analysis with a single query: https://t.co/F3p2mn5SjZ

By Richard Seroter
Data’s Objectivity Is an Illusion; Human Choices Shape It
SocialFeb 18, 2026

Data’s Objectivity Is an Illusion; Human Choices Shape It

Data is objective only in appearance. Behind every dataset lies a human decision about what to measure

By Iain Brown
Data Quality & Governance: Underrated Foundations Beyond Analytics
SocialFeb 18, 2026

Data Quality & Governance: Underrated Foundations Beyond Analytics

Data Quality and Data Governance are two of the most underrated but important areas in the data space There are other areas to explore in data outside of Analytics.

By Ebere Oyek (Nelo) — Data | AI | ML
Pivot Tables: Business Data’s Everlasting REPL
SocialFeb 17, 2026

Pivot Tables: Business Data’s Everlasting REPL

Hot take: Pivot tables are the REPL for business data. Just like programmers use REPLs to quickly test code, business users use pivot tables to quickly test hypotheses about their data. Drag a field. See a result. Adjust. Repeat. This feedback loop is...

By SSP Data
Own Your Data, Not Just Model Tuning
SocialFeb 17, 2026

Own Your Data, Not Just Model Tuning

AI teams love tuning models. But they ignore the bike chain: data. Outsourcing labeling to people that care much less on the app’s success. Messy internal docs. No structured knowledge base. No call transcripts. No clean SOPs. Then they ask: “Why isn’t the model improving?” The highest ROI in...

By Louis Bouchard
Master Six Core Concepts to Decode Regression Results
SocialFeb 17, 2026

Master Six Core Concepts to Decode Regression Results

Most analysts can run a regression. Very few can explain what the output actually means. That gap is a statistics fundamentals problem. Not a tools problem. Not a Python problem. Not a years-of-experience problem. If you can't explain what your numbers mean, you...

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Four Data‑Backed Tech Fields to Pursue in 2026
SocialFeb 16, 2026

Four Data‑Backed Tech Fields to Pursue in 2026

I did some digging with the help of ChatGPT and Claude Here are 4 tech areas you can still explore in 2026 backed by data: • AI/ML – Data Analytics falls here • Cloud & Infrastructure • Security & Governance • Data Engineering Let me...

By Ebere Oyek (Nelo) — Data | AI | ML
Decode Common SQL Errors and Their Real Fixes
SocialFeb 16, 2026

Decode Common SQL Errors and Their Real Fixes

Common SQL errors and what they REALLY mean "Column ambiguously defined" You joined tables with the same column name. Fix: Add table aliases (customers.id not just id) "Not a single-group group function" You mixed aggregated + non-aggregated columns. Fix: Add all non-aggregated columns to GROUP BY "Division...

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Start with Excel, SQL, Power BI for Analytics
SocialFeb 16, 2026

Start with Excel, SQL, Power BI for Analytics

Aspiring Data Analyst? Don’t overcomplicate it. Start building projects with tools like Excel, SQL, and Power BI.

By Ebere Oyek (Nelo) — Data | AI | ML
Data Guides Positioning, Yet Quality Creativity Wins
SocialFeb 16, 2026

Data Guides Positioning, Yet Quality Creativity Wins

Working in entertainment analytics I am often asked how best to position a title for success. But data can help you aim more accurately and efficiently. What it can’t do is provide the single most important element to success: a...

By Brandon Katz
Integrate Data Quality Assertions Directly Into Orchestration
SocialFeb 16, 2026

Integrate Data Quality Assertions Directly Into Orchestration

I see data contracts and data quality as overlapping but different: Data contracts: what is the data and how do we enforce it Data products: why do we need this data In practice, I'd argue for asset-based data quality assertions. Every time a...

By SSP Data