VuTrinh (Substack)

VuTrinh (Substack)

Publication
0 followers

Weekly curated data engineering resources and deep dives

(Video) What Is Apache Spark?
PodcastMar 26, 20260 min

(Video) What Is Apache Spark?

The episode traces the evolution from Google’s MapReduce model to Apache Spark, explaining how Spark’s in‑memory processing and the Resilient Distributed Dataset (RDD) abstraction overcome MapReduce’s limitations for iterative and interactive workloads. It breaks down Spark’s core concepts—transformations vs. actions,...

By VuTrinh (Substack)
VuTrinh (Substack) | Pulse