Digital Design & Comp. Arch: L19: SIMD Architectures (Spring 2026)

Onur Mutlu Lectures
Onur Mutlu LecturesMay 1, 2026

Why It Matters

SIMD underpins the performance gains of modern AI accelerators, so mastering its concepts is essential for hardware designers and businesses seeking competitive compute efficiency.

Key Takeaways

  • SIMD exploits data-level parallelism for massive array operations.
  • GPUs and TPUs rely heavily on SIMD and decoupled execution.
  • Flynn taxonomy classifies SIMD as SISD, SIMD, MISD, MIMD categories.
  • Array processors parallelize across space; vector processors pipeline across time.
  • Vector registers store multiple elements, enabling efficient SIMD instruction streams.

Summary

The lecture introduces single‑instruction‑multiple‑data (SIMD) architectures, emphasizing their central role in today’s high‑performance computing, especially for machine‑learning workloads such as GPUs.

It reviews data‑level parallelism, explains Flynn’s taxonomy, and distinguishes array processors (space‑parallel) from vector processors (time‑parallel). The discussion highlights vector registers that hold multiple elements and the pipeline behavior that drives throughput.

Examples include GPUs as classic SIMD engines and Google’s TPU, which combines SIMD with decoupled access‑execute. A sample code sequence (load, add, multiply, store) illustrates how an array processor executes all stages simultaneously, while a vector processor pipelines them across cycles.

Understanding SIMD fundamentals helps architects design accelerators that maximize parallel throughput while managing serial bottlenecks such as reductions, a critical factor for future AI and high‑performance systems.

Original Description

Digital Design and Computer Architecture, ETH Zürich, Spring 2026 (https://safari.ethz.ch/ddca/spring2026/)
Lecture 19: SIMD Architectures
Lecturer: Dr. Mohammad Sadrosadati and Prof. Onur Mutlu
Date: 30 April 2026
L19: SIMD Architectures
Recommended Reading:
====================
A Modern Primer on Processing in Memory
Memory-Centric Computing: Solving Computing's Memory Problem
Memory-Centric Computing: Recent Advances in Processing-in-DRAM
Intelligent Architectures for Intelligent Computing Systems
RowHammer: A Retrospective
Fundamentally Understanding and Solving RowHammer
Accelerating Genome Analysis via Algorithm-Architecture Co-Design
From Molecules to Genomic Variations: Accelerating Genome Analysis via Intelligent Algorithms and Architectures
RECOMMENDED LECTURE VIDEOS & PLAYLISTS:
========================================
Digital Design and Computer Architecture Spring 2025 Livestream Lectures Playlist:
Fundamentals of Computer Architecture Fall 2025 Livestream Lectures Playlist:
Seminar in Computer Architecture Spring 2025 Livestream Lectures Playlist:
Computer Architecture Fall 2024 Lectures Playlist:
Interview with Professor Onur Mutlu:
TCuARCH meets Prof. Onur Mutlu
Arch. Mentoring Workshop @ISCA'21 - Doing Impactful Research
The Story of RowHammer Lecture:
Accelerating Genome Analysis Lecture:
Memory-Centric Computing Systems Tutorial at IEDM 2021:
Intelligent Architectures for Intelligent Machines Lecture:
Featured Lectures:

Comments

Want to join the conversation?

Loading comments...