Hardware Videos

All Technology
AI
Autonomy
B2B Growth
Big Data
BioTech
ClimateTech
Consumer Tech
Crypto
Cybersecurity
DevOps
Digital Marketing
Ecommerce
EdTech
Enterprise
FinTech
GovTech
Hardware
HealthTech
HRTech
LegalTech
Nanotech
PropTech
Quantum
Robotics
SaaS
SpaceTech

All News Deals Social Blogs Videos Podcasts Digests

Hardware Pulse

EMAIL DIGESTS

Daily

Every morning

Weekly

Sunday recap

News Deals Social Blogs Videos Podcasts

Hardware VideosDigital Design & Computer Architecture - Problem Solving II (Spring 2022)

Digital Design & Computer Architecture - Problem Solving II (Spring 2022)

•February 24, 2026

0

Onur Mutlu Lectures

Onur Mutlu Lectures•Feb 24, 2026

Why It Matters

Accurate branch prediction and efficient systolic‑array designs are critical for maximizing CPU throughput and accelerating matrix‑intensive workloads such as AI inference.

Key Takeaways

•Local correlation exists only for the outer for-loop branch (B1).
•Global correlation ties B4 (multiple of six) to B2 and B3.
•Two-bit global history predictor uses four pattern entries for branch outcomes.
•Probability of a branch being taken derived from random value distribution 1‑6.
•Systolic PE computes P=M, Q=N, R=R+M·N each cycle.

Summary

The video walks through a textbook‑style problem on branch prediction and then shifts to designing a systolic array for matrix multiplication, illustrating two core concepts in computer architecture.

It defines locally correlated branches—where knowledge of a previous iteration predicts the current one—and shows that only the outer for‑loop (B1) meets this criterion because its iteration count is deterministic. Globally correlated branches are identified by mathematical relationships: if the condition “multiple of six” (B4) is true, then the “multiple of two” (B2) and “multiple of three” (B3) conditions must also be true, establishing a bidirectional correlation.

The instructor demonstrates a two‑bit global history register feeding a four‑entry pattern history table, updating counters by +1 for taken and –1 for not‑taken, and works through the expected counter value after 120 iterations using a uniform 1‑6 random distribution. He then derives the processing‑element equations for a systolic array—P equals input M, Q equals input N, and R accumulates the product M·N—showing how to populate 30 input slots to compute a 3×3 matrix product.

Understanding these correlations helps designers choose appropriate branch predictors, directly affecting pipeline efficiency, while the systolic‑array mapping provides a concrete example of how dataflow architectures can be programmed for high‑throughput linear algebra, a cornerstone of modern AI workloads.

Original Description

Digital Design and Computer Architecture, ETH Zürich, Spring 2022 (https://safari.ethz.ch/digitaltechnik/spring2021/doku.php?id=start)

Problem Solviing II -- Solutions to Some Homework Problems

Lecturer: Professor Onur Mutlu (https://people.inf.ethz.ch/omutlu/)

Date: July 15, 2022

Questions:

00:00:00 - Branch Prediction I (HW5, Q1)

00:15:08 - Systolic Arrays I (HW5, Q8)

00:24:40 - GPUs and SIMD I (HW6, Q4)

00:33:24 - Tracing the Cache (HW7, Q3)

00:55:26 - Cache Performance Analysis (HW7, Q5)

01:15:28 - Memory Hierarchy (HW7, Q6)

01:32:21 - Prefetching (HW7, Q11)

01:38:54 - Vector Processing III (HW6, Q3, Spring 2021)

02:05:07 - GPUs and SIMD III (HW6, Q8, Spring 2021)

02:24:34 - GPUs and SIMD IV (HW6, Q9, Spring 2021)

02:38:55 - Reverse Engineering Caches II (HW7, Q3, Spring 2021)

Slides (pptx): https://safari.ethz.ch/digitaltechnik/spring2022/lib/exe/fetch.php?media=onur-digitaldesign_comparch-2022-problem-solving-ii-beforelecture.pptx

Slides (pdf): https://safari.ethz.ch/digitaltechnik/spring2022/lib/exe/fetch.php?media=onur-digitaldesign_comparch-2022-prroblem-solving-ii-beforelecture.pdf

Recommended Reading:

====================

A Modern Primer on Processing in Memory

https://arxiv.org/abs/2012.03112

Intelligent Architectures for Intelligent Computing Systems

https://arxiv.org/abs/2012.12381

RowHammer: A Retrospective

https://people.inf.ethz.ch/omutlu/pub/RowHammer-Retrospective_ieee_tcad19.pdf

Fundamentally Understanding and Solving RowHammer

https://arxiv.org/abs/2211.07613

RECOMMENDED LECTURE VIDEOS & PLAYLISTS:

========================================

Computer Architecture Fall 2021 Lectures Playlist:

https://www.youtube.com/watch?v=4yfkM_5EFgo&list=PL5Q2soXY2Zi-Mnk1PxjEIG32HAGILkTOF

Digital Design and Computer Architecture Spring 2021 Livestream Lectures Playlist:

https://www.youtube.com/watch?v=LbC0EZY8yw4&list=PL5Q2soXY2Zi_uej3aY39YB5pfW4SJ7LlN

Featured Lectures:

https://www.youtube.com/watch?v=jVYCchBGNVc&list=PL5Q2soXY2Zi8VrmOTz44l2WupethSdh-M&index=1

Interview with Professor Onur Mutlu:

https://www.youtube.com/watch?v=8ffSEKZhmvo&list=PL5Q2soXY2Zi8VrmOTz44l2WupethSdh-M&index=9

The Story of RowHammer Lecture:

https://www.youtube.com/watch?v=sgd7PHQQ1AI&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=39

Accelerating Genome Analysis Lecture:

https://www.youtube.com/watch?v=r7sn41lH-4A&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=41

Memory-Centric Computing Systems Tutorial at IEDM 2021:

https://www.youtube.com/watch?v=H3sEaINPBOE&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=35

Intelligent Architectures for Intelligent Machines Lecture:

https://www.youtube.com/watch?v=GTieZPY4Wmc&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=38

Computer Architecture Fall 2020 Lectures Playlist:

https://www.youtube.com/watch?v=c3mPdZA-Fmc&list=PL5Q2soXY2Zi9xidyIgBxUz7xRPS-wisBN

Digital Design and Computer Architecture Spring 2020 Lectures Playlist:

https://www.youtube.com/watch?v=AJBmIaUneB0&list=PL5Q2soXY2Zi_FRrloMa2fUYWPGiZUBQo2

Public Lectures by Onur Mutlu, Playlist:

https://www.youtube.com/watch?v=kgiZlSOcGFM&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl

Computer Architecture at Carnegie Mellon Spring 2015 Lectures Playlist:

https://www.youtube.com/watch?v=zLP_X4wyHbY&list=PL5PHm2jkkXmi5CxxI7b3JCL1TWybTDtKq

Rethinking Memory System Design Lecture @stanfordonline :

https://www.youtube.com/watch?v=F7xZLNMIY1E&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=4

0

Comments

Want to join the conversation?

Loading comments...