Ghost in the data - Latest News and Information
  • All Technology
  • AI
  • Autonomy
  • B2B Growth
  • Big Data
  • BioTech
  • ClimateTech
  • Consumer Tech
  • Crypto
  • Cybersecurity
  • DevOps
  • Digital Marketing
  • Ecommerce
  • EdTech
  • Enterprise
  • FinTech
  • GovTech
  • Hardware
  • HealthTech
  • HRTech
  • LegalTech
  • Nanotech
  • PropTech
  • Quantum
  • Robotics
  • SaaS
  • SpaceTech
AllNewsDealsSocialBlogsVideosPodcastsDigests

Technology Pulse

EMAIL DIGESTS

Daily

Every morning

Weekly

Sunday recap

NewsDealsSocialBlogsVideosPodcasts
Ghost in the data

Ghost in the data

Publication
0 followers

Independent blog on data platforms, cost, and pragmatic approaches

Recent Posts

Healing Tables: When Day-by-Day Backfills Become a Slow-Motion Disaster
News•Feb 6, 2026

Healing Tables: When Day-by-Day Backfills Become a Slow-Motion Disaster

A data engineering team discovered that a three‑year SCD Type 2 backfill executed day‑by‑day produced 47,000 overlapping records, timeline gaps, and unrecoverable errors. The author introduced "Healing Tables," a framework that separates change detection from period construction and rebuilds the dimension in a single deterministic pass. The six‑step pipeline extracts effectivity points, generates time slices, joins multiple sources, computes hashes, compresses rows, and validates temporal integrity. This approach turns a fragile, incremental process into a fast, reproducible rebuild that can heal historical inconsistencies.

By Ghost in the data