VuTrinh (Substack) - Latest News and Information
  • All Technology
  • AI
  • Autonomy
  • B2B Growth
  • Big Data
  • BioTech
  • ClimateTech
  • Consumer Tech
  • Cybersecurity
  • DevOps
  • Digital Marketing
  • Ecommerce
  • EdTech
  • Enterprise
  • FinTech
  • GovTech
  • Hardware
  • HealthTech
  • HRTech
  • LegalTech
  • Nanotech
  • PropTech
  • Quantum
  • Robotics
  • SaaS
  • SpaceTech
AllNewsDealsSocialBlogsVideosPodcastsDigests

Technology Pulse

EMAIL DIGESTS

Daily

Every morning

Weekly

Tuesday recap

Top Publishers

  • The Verge AI

    The Verge AI

    21 followers

  • TechCrunch AI

    TechCrunch AI

    19 followers

  • Crunchbase News AI

    Crunchbase News AI

    15 followers

  • TechRadar

    TechRadar

    15 followers

  • Hacker News

    Hacker News

    13 followers

See More →

Top Creators

  • Ryan Allis

    Ryan Allis

    207 followers

  • Elon Musk

    Elon Musk

    79 followers

  • Sam Altman

    Sam Altman

    68 followers

  • Mark Cuban

    Mark Cuban

    56 followers

  • Jack Dorsey

    Jack Dorsey

    39 followers

See More →

Top Companies

  • SaasRise

    SaasRise

    209 followers

  • Anthropic

    Anthropic

    40 followers

  • OpenAI

    OpenAI

    22 followers

  • Hugging Face

    Hugging Face

    15 followers

  • xAI

    xAI

    12 followers

See More →

Top Investors

  • Andreessen Horowitz

    Andreessen Horowitz

    16 followers

  • Y Combinator

    Y Combinator

    15 followers

  • Sequoia Capital

    Sequoia Capital

    12 followers

  • General Catalyst

    General Catalyst

    8 followers

  • A16Z Crypto

    A16Z Crypto

    5 followers

See More →
NewsDealsSocialBlogsVideosPodcasts
VuTrinh (Substack)

VuTrinh (Substack)

Publication
0 followers

Weekly curated data engineering resources and deep dives

Parquet Fundamentals in 3 Mins
Podcast•Apr 3, 2026•0 min

Parquet Fundamentals in 3 Mins

The episode explains how Apache Parquet’s hybrid columnar‑row format optimizes storage and query performance for large datasets. It contrasts row‑wise and pure columnar layouts, highlighting the inefficiencies of each, and then describes Parquet’s structure of row groups, column chunks, and pages, along with its self‑describing metadata that enables column pruning and efficient reads. The host also notes Parquet’s origins at Twitter and Cloudera and points listeners to a newsletter for deeper data‑engineering content.

By VuTrinh (Substack)
(Video) What Is Apache Spark?
Podcast•Mar 26, 2026•0 min

(Video) What Is Apache Spark?

The episode traces the evolution from Google’s MapReduce model to Apache Spark, explaining how Spark’s in‑memory processing and the Resilient Distributed Dataset (RDD) abstraction overcome MapReduce’s limitations for iterative and interactive workloads. It breaks down Spark’s core concepts—transformations vs. actions,...

By VuTrinh (Substack)