Ghost in the data

Ghost in the data

Publication
0 followers

Independent blog on data platforms, cost, and pragmatic approaches

Stop Building Salesforce Integrations From Scratch
NewsApr 3, 2026

Stop Building Salesforce Integrations From Scratch

Engineers often build custom Salesforce‑to‑warehouse pipelines, but frequent schema changes, API limits, and hidden failures turn maintenance into a monthly time sink. Snowflake’s OpenFlow connector automates schema detection and runs as a native, managed service within Snowflake, eliminating the need...

By Ghost in the data
You Don't Need Permission to Fix Your Data
NewsMar 20, 2026

You Don't Need Permission to Fix Your Data

A junior engineer named Sam quietly added data quality tests to a warehouse model, illustrating that fixing data doesn’t require formal permission. The article argues that data quality problems cost enterprises billions and consume a large share of engineers' time....

By Ghost in the data
You Don't Need Permission to Fix Your Data
NewsMar 6, 2026

You Don't Need Permission to Fix Your Data

The article argues that data quality improvements don’t require top‑down mandates; engineers can start fixing messy source data by writing tests, documenting issues, and building simple dashboards. By turning test failures into evidence, teams persuade source‑system owners to add validation,...

By Ghost in the data
Healing Tables: When Day-by-Day Backfills Become a Slow-Motion Disaster
NewsFeb 6, 2026

Healing Tables: When Day-by-Day Backfills Become a Slow-Motion Disaster

A data engineering team discovered that a three‑year SCD Type 2 backfill executed day‑by‑day produced 47,000 overlapping records, timeline gaps, and unrecoverable errors. The author introduced "Healing Tables," a framework that separates change detection from period construction and rebuilds the dimension in...

By Ghost in the data
Ghost in the data | Pulse