Confessions of a Data Guy

Confessions of a Data Guy

Publication
0 followers

Practitioner’s takes on data engineering practice and careers

Data Engineering Career Path: From Circuits to Pipelines
NewsJan 30, 2026

Data Engineering Career Path: From Circuits to Pipelines

The article maps a data‑engineering career trajectory that begins with hardware‑oriented roles and ends in building scalable data pipelines. It highlights how circuit‑design thinking translates into logical data modeling, while emphasizing the need to acquire SQL, Python, and cloud‑native tools....

By Confessions of a Data Guy
Apache Airflow vs Databricks Lakeflow | The Orchestration Battle
NewsJan 30, 2026

Apache Airflow vs Databricks Lakeflow | The Orchestration Battle

The article pits Apache Airflow, the open‑source workflow orchestrator, against Databricks Lakeflow, a newer Lakehouse‑native pipeline engine. It outlines core differences in architecture, integration depth with cloud data platforms, and pricing models. Airflow remains favored for heterogeneous environments, while Lakeflow...

By Confessions of a Data Guy
This One Polars Pattern Makes Code 10x Cleaner
NewsJan 30, 2026

This One Polars Pattern Makes Code 10x Cleaner

The article highlights a single Polars pattern—using the pipe operator—to streamline data‑frame code, cutting boilerplate and boosting readability up to tenfold. By chaining transformations in a lazy execution graph, developers avoid intermediate variables and gain clearer, more maintainable pipelines. The...

By Confessions of a Data Guy
Apache Arrow ADBC Database Drivers
NewsJan 16, 2026

Apache Arrow ADBC Database Drivers

Apache Arrow’s ADBC (Arrow Database Connectivity) introduces a modern, columnar‑native driver that can replace or complement traditional ODBC/JDBC stacks. By moving Arrow RecordBatches end‑to‑end, it eliminates row‑by‑row marshaling and dramatically reduces serialization overhead. Benchmarks show Python ADBC achieving roughly 275 k...

By Confessions of a Data Guy