
Snowflake Posts 30% Revenue Surge, Record Bookings
Snowflake prints 30% revenue growth and record bookings momentum $SNOW delivered Q4 revenue of $1.28B (+30% YoY) with product revenue at $1.23B (+30%) and EPS of $0.32, beating estimates by 23%. Growth is re-accelerating at scale. RPO climbed to $9.77B (+42%), billings reached $2.21B (+39%), and net new ARR jumped 59% YoY. Enterprise demand looks firm, highlighted by a $400M+ deal and seven nine-figure contracts in the quarter. Investing

Data Skills Trump AI Hype in Modern Strategies
The debate: which AI model to bet on. What I'm watching instead: Informatica surging. Java everywhere. GCP back on the shortlist. 73% of companies investing more in data skills. Not AI skills. Your AI strategy is only as strong as the data layer...
Treat Data as Assets, Not Just Jobs.
Data Product: why do we need this data? Data Asset: what is this data and how do we maintain it? I've found the most practical definition of data products comes from Dagster's software-defined assets. Each asset has a clear definition, dependencies, and...

Real‑time Data Powers HCF’s Analytics Transformation
Recently I caught up with Amiet Dhagat, Head of Data Services Analytics and AI for HCF, to get the inside story around their stellar success in transition from static data to dynamic data, and why real-time data has been so...
AI Boom Fuels Data Demand for Small Innovators
SONAR is having the best quarter in years. One of my board members asked if I were concerned about the rise of AI and if it would be a drain our data business. I told her that we are seeing...
Open‑Data Slack Clone Will Trigger Mass Migration
Slack will be the Waterloo of open vs closed data. Someone is going to make a slack clone where you get unfettered access to your own data, and people really will switch en masse.
Privacy Must Be Built Into AI Data Workflows
RT High-level policies aren't enough. It's time for audits, training, DSPM, and privacy-by-design in AI workflows. If privacy isn't built into how data moves, you're hoping - not leading. #DataGovernance #AI #CIO @Star_CIO https://t.co/Naq82FuMWZ
Managed Iceberg Lets Providers Own the Metadata Control Plane
Why are hyperscalers racing to offer managed Iceberg? Because whoever controls the catalog controls the ecosystem. If your tables are in a managed Iceberg service, you can query them from any engine - Spark, Trino, DuckDB, whatever. But your metadata stays with...
Separate Storage and Compute Redefines Modern Databases
Why we're excited about Lakebase GA: most database services are based on outdated assumptions leading to poor operability, scalability and devex.
Data Governance: The Crucial Anchor Amid Conflicting Sources
Every data conversation I tracked this week led back to the same concept. Data Governance. Not as compliance. As the center of gravity. 47 data sources. 6 different owners. 3 definitions of "customer." Your AI agent has no idea who to believe. https://t.co/ZrW6iptSqs

Choosing Optional Catalogs for Open Table Formats
Are we still using table catalogs for open table formats? Haven't heard too much lately. I like OTFs, but making it non-optional to have a catalog isn't great. That's why I like prefer the option to use one without. But if you...

Enterprise Data Stack Becomes AI Delivery Engine
Everyone’s watching which LLM wins benchmarks. I’m watching Informatica suddenly show up everywhere. GCP coverage up 200%. The enterprise data stack isn’t being replaced by AI. It’s becoming the delivery mechanism for AI. The boring infrastructure is now the moat. https://t.co/9dxQv330NZ

Data Execution Gap Persists Despite Better BI Tools
70% of executives say they have difficulty acting on data. Meanwhile, Power BI just won the 2025 Gartner Magic Quadrant. Again. The tools keep getting better. The problem isn’t the tools. It never was. Source: https://t.co/KNtNLIRTOQ https://t.co/ZOAPhWZKSf
One-Line Code Swap Moves Cassandra to Spanner, Auto-Embeds in BigQuery
Simple is good. One-line code change to switch from Apache Cassandra to a @googlecloud Spanner database. https://t.co/2n6AJutoNM Generate embeddings automatically for @googlecloud BigQuery table. https://t.co/SqIQzawOvt https://t.co/zWknasRT6r
Exploring Emerging Git‑Style Tools for Data Management
Git for data is still underexplored, and it is an area that is changing so fast. That's why we look at actual tools/features that showcase how to apply a Git-like workflow for data. I compared Git-like tools for data I could...

Flowrs: New TUI for Managing Airflow Jobs
A TUI for managing Airflow jobs? Something like k9s? Flowrs seems to be just that - haven't tried yet, but looks really cool. Will try next time I have to use Airflow :) https://github.com/jvanbuel/flowrs
AI Writes SQL, but Fundamentals Lift Your Ceiling
Yes, AI can write the SQL. But do you understand: * Why that join works? * Why that model makes sense? * Why that metric matters? AI lowers the barrier. Foundations raise your ceiling.
Skip Semantic Layer Early; Use Native Metrics First
Controversial opinion: don't start with a semantic layer. A semantic layer makes sense when: - You have multiple consumers (BI, notebooks, apps) - KPIs are defined inconsistently across teams - You need a universal API for metrics If you're early stage with one BI tool,...

From Weeks to Minutes: Streamlined AI/DS Workflow
❌Most data science projects take 4 weeks because of meetings, reruns, and handoffs between teams ✅A good AI/DS workflow compresses it to ~15 minutes. I’m demo-ing how to do it live (free): https://learn.business-science.io/registration-ai-workshop-2
Platform Wars Hinge on Owning the Stack’s Central Node
Salesforce is now bridging four domains at once: Salesforce Implementation (CRM) Databricks (data lake) Agentforce (AI agents) Data 360 (data platform) The platform wars are not about features. They are about who owns the most connected node in your stack.
Rust Powers Python's Data Engineering, Not Replaces It
Will Rust kill Python in data engineering? No. But it has already consumed much of the JavaScript tooling ecosystem. And it's quietly doing the same in data. The pattern: Python remains the interface, Rust becomes the engine. Polars, DataFusion, DuckDB's internals - all Rust...

Choose the Right SQL Ranking Function to Avoid Misleading Gaps
ROW_NUMBER(), RANK(), DENSE_RANK(). Three functions, three different behaviors. Pick the wrong one and your rankings mislead. Here are 4 patterns to get it right: - ranking with gaps vs without - top-N per category - deduplication - running totals 1. ROW_NUMBER() vs RANK() vs DENSE_RANK() Three functions, three behaviors...
Bridging the Data Integrity Gap for Reliable Insights
The Data Integrity Gap: From “Big Data” to “Reliable Physics”.. click to learn everything you need to know about issues you likely don't know you have or will soon have in your organisation.. https://t.co/LrOOv5lGcm
Automatic Tenant Isolation Built Into Nile by Default
This is a common problem and one of our biggest motivations in building Nile - to isolate tenants automatically and by default.
Decentralized Back‑Office, Unified Analytics Layer Wins
Centralizing analytics on a single platform? Not happening. The focus is on decentralized back-office systems and a common analytics layer for daily visualization. #Analytics #Strategy #BusinessTech https://t.co/7ObAL6iVQ5
Lakehouse Surge Shows Data Infrastructure Beats AI Hype
AGI is in the noise bucket this week. Lakehouse architecture? Up 400%. While the industry debates the AI endgame, data infrastructure quietly becomes non-negotiable. The boring skills win again.
Analyst Ads Overpromise Python; Excel, SQL Dominate Daily
Unpopular opinion: Data analyst job postings ask for Python. Data analyst jobs don't actually use Python. What you'll use daily: Excel — every single day SQL — every single day Power BI or Tableau — multiple times per week Python — maybe once a month This pattern holds...
Building a Proprietary Data Fusion Layer This Weekend
big fan of ontology btw, but noted. building the proprietary data fusion layer this weekend 🫡
Browse S3 Files Locally in One Fast Command
I quickly recorded how easily and conveniently it is to browse S3 files locally with a single command, blazingly fast. Even preview works with DuckDB integration. https://youtu.be/cimUvBd_9Ns
Non‑engineers Building Tools Strain Professional Developers
I struggle with the phrase “everyone’s a coder now.” And I hesitate to post because I don’t want you to read this as gatekeeping. If anything, I want more people to build, but in a stronger, more functional way. Building any...

Replace If‑elif Chains with Clean Python Dispatch Patterns
The more if-elif chains you write, the harder your code gets to change. Python has cleaner patterns for this. Here are 4 worth knowing: - dictionary dispatch - guard clauses - match/case - conditional expressions 1. Dictionary dispatch. Replace long equality checks with a dict. Constant-time lookup. No branching....

BI Dashboards Are Dying; Prepare for the Next Wave
RIP BI Dashboards. Tools like Tableau and PowerBI are about to become extinct. This is what's coming (and how to prepare):
Kickstart Your Data Career with Our Free Guide
Aspiring Data Processionals Excel, SQL and PBI are great tools to build projects with. If you're completely confused, start with my Guide 👇🏾 https://tekdlin.com/data-analytics-guide/
Ask the Problem First, Then Match Tools
This is an interesting thread. Everyone is suggesting tools to solve the problem. I’d start by asking more about the data and the questions the customer is trying to answer or problems they are trying to solve first before recommending...
Data Compliance for B2B SaaS: Navigating Hidden Complexity
Yesterday I was talking with another founder about ensuring their B2B SaaS product is data compliant. There’s so much complexity behind meeting required standards.
Free Resources Replace Costly Bootcamps—Discipline Is Key
You don't need a bootcamp to become a data analyst. Everything you need is free: Excel/SQL/Python/Power BI tutorials — YouTube SQL practice — SQLZoo, LeetCode, HackerRank Datasets for portfolio projects — Kaggle, data.gov, Google Dataset Search Resume feedback — Reddit (r/datascience, r/resumes), LinkedIn communities Interview prep...
Use Exponential Backoff with Jitter for Effective Retries
Not all retries are created equal. Immediate retry: usually fails again Exponential backoff: gives systems time to recover Exponential backoff with jitter: prevents thundering herd Most orchestrators have this built in. But you need to understand what's happening or you'll wonder why your retries...
Perfect Salesforce Data: My Superintelligence Benchmark
Test for superintelligence: when the data in Fivetran’s salesforce is 100% accurate and up to date at all times, I’ll know we’re there.
Semantic Layer: Serve Data Like a Menu, Hide Complexity
The semantic layer is like a restaurant menu: you know what you're ordering, but not how it's made. This analogy comes from Maxime Beauchemin and I think it's perfect. Users shouldn't need to understand your star schema to calculate revenue. They should...
Analyze Global Data with One BigQuery Query
You've got data spread across geographies. What happens when you want to bring that data together? Usually ETL jobs or other mechanisms. We just launched @googlecloud BigQuery global queries. Do multi-location analysis with a single query: https://t.co/F3p2mn5SjZ
Data’s Objectivity Is an Illusion; Human Choices Shape It
Data is objective only in appearance. Behind every dataset lies a human decision about what to measure
Data Quality & Governance: Underrated Foundations Beyond Analytics
Data Quality and Data Governance are two of the most underrated but important areas in the data space There are other areas to explore in data outside of Analytics.
Pivot Tables: Business Data’s Everlasting REPL
Hot take: Pivot tables are the REPL for business data. Just like programmers use REPLs to quickly test code, business users use pivot tables to quickly test hypotheses about their data. Drag a field. See a result. Adjust. Repeat. This feedback loop is...

Own Your Data, Not Just Model Tuning
AI teams love tuning models. But they ignore the bike chain: data. Outsourcing labeling to people that care much less on the app’s success. Messy internal docs. No structured knowledge base. No call transcripts. No clean SOPs. Then they ask: “Why isn’t the model improving?” The highest ROI in...
Master Six Core Concepts to Decode Regression Results
Most analysts can run a regression. Very few can explain what the output actually means. That gap is a statistics fundamentals problem. Not a tools problem. Not a Python problem. Not a years-of-experience problem. If you can't explain what your numbers mean, you...

Four Data‑Backed Tech Fields to Pursue in 2026
I did some digging with the help of ChatGPT and Claude Here are 4 tech areas you can still explore in 2026 backed by data: • AI/ML – Data Analytics falls here • Cloud & Infrastructure • Security & Governance • Data Engineering Let me...

Decode Common SQL Errors and Their Real Fixes
Common SQL errors and what they REALLY mean "Column ambiguously defined" You joined tables with the same column name. Fix: Add table aliases (customers.id not just id) "Not a single-group group function" You mixed aggregated + non-aggregated columns. Fix: Add all non-aggregated columns to GROUP BY "Division...
Start with Excel, SQL, Power BI for Analytics
Aspiring Data Analyst? Don’t overcomplicate it. Start building projects with tools like Excel, SQL, and Power BI.
Data Guides Positioning, Yet Quality Creativity Wins
Working in entertainment analytics I am often asked how best to position a title for success. But data can help you aim more accurately and efficiently. What it can’t do is provide the single most important element to success: a...
Integrate Data Quality Assertions Directly Into Orchestration
I see data contracts and data quality as overlapping but different: Data contracts: what is the data and how do we enforce it Data products: why do we need this data In practice, I'd argue for asset-based data quality assertions. Every time a...