Hardware Videos

All News Deals Social Blogs Videos Podcasts Digests

Hardware Semiconductors

Digital Design & Computer Architecture D9: Problem-Solving Session 9 (Spring 2026)

•May 4, 2026

Onur Mutlu Lectures

Onur Mutlu Lectures•May 4, 2026

Why It Matters

Effective branch prediction minimizes pipeline stalls, directly enhancing CPU throughput and energy efficiency in modern high‑performance systems.

Key Takeaways

•Branch prediction mitigates control dependencies in pipelined processors.
•Simple predictors (always taken, backward‑taken) trade accuracy for hardware simplicity.
•Two‑bit saturating counters improve prediction stability over one‑bit schemes.
•Per‑branch predictors reduce interference in nested loop scenarios.
•Microbenchmark analysis reveals pipeline depth and branch stall cycles.

Summary

The video walks through a problem‑solving session on branch prediction, a core technique for handling control dependencies in modern pipelined CPUs. After a brief recap of terminology such as control dependency and branch‑resolution penalty, the instructor introduces several classic prediction strategies and explains how they fit into the processor pipeline. Key insights include the trade‑offs between ultra‑simple predictors—like PC+4 (always taken) or backward‑taken/forward‑not‑taken heuristics—and more sophisticated schemes that use finite‑state machines. Two‑bit saturating counters are highlighted for their ability to avoid rapid state flips, while compile‑time hints (branch direction) contrast with runtime predictors that learn from actual outcomes. The discussion also covers per‑branch predictor tables to prevent interference between nested loops. Illustrative examples feature the always‑taken predictor’s “guess‑and‑squash” behavior, the last‑time predictor’s state diagram, and a microbenchmark designed to reverse‑engineer pipeline depth and branch stall cycles. The instructor walks through how varying the loop counter R1 and measuring dynamic instruction counts can reveal both the number of pipeline stages and the stall penalty for a conditional branch. The implications are clear: accurate branch prediction directly reduces pipeline stalls, boosting instruction‑throughput and overall CPU performance. Understanding these fundamentals equips designers and engineers to balance hardware complexity against prediction accuracy, a critical consideration as processors scale to deeper pipelines and higher clock rates.

Original Description

Digital Design and Computer Architecture, ETH Zürich, Spring 2026 (https://safari.ethz.ch/ddca/spring2026/)

D9: Problem-Solving Session 9

Lecturer: Prof. Onur Mutlu

Date: 4 May 2026

Recommended Reading:

====================

A Modern Primer on Processing in Memory

https://arxiv.org/pdf/2012.03112.pdf

Memory-Centric Computing: Solving Computing's Memory Problem

https://www.arxiv.org/pdf/2505.00458

Memory-Centric Computing: Recent Advances in Processing-in-DRAM

https://arxiv.org/pdf/2412.19275

Intelligent Architectures for Intelligent Computing Systems

https://people.inf.ethz.ch/omutlu/pub/intelligent-architectures-for-intelligent-computingsystems-invited_paper_DATE21.pdf

RowHammer: A Retrospective

https://people.inf.ethz.ch/omutlu/pub/RowHammer-Retrospective_ieee_tcad19.pdf

Fundamentally Understanding and Solving RowHammer

https://arxiv.org/pdf/2211.07613.pdf

Accelerating Genome Analysis via Algorithm-Architecture Co-Design

https://people.inf.ethz.ch/omutlu/pub/AcceleratingGenomeAnalysis_dac23.pdf

From Molecules to Genomic Variations: Accelerating Genome Analysis via Intelligent Algorithms and Architectures

https://people.inf.ethz.ch/omutlu/pub/IntelligentGenomeAnalysis_csbj22.pdf

RECOMMENDED LECTURE VIDEOS & PLAYLISTS:

========================================

Digital Design and Computer Architecture Spring 2025 Livestream Lectures Playlist:

https://www.youtube.com/watch?v=ubhxKNlOlRg&list=PL5Q2soXY2Zi9Eo29LMgKVcaydS7V1zZW3&index=3

Fundamentals of Computer Architecture Fall 2025 Livestream Lectures Playlist:

https://www.youtube.com/watch?v=uKgMFj1eQQc&list=PL5Q2soXY2Zi_ZMtqz1r-GHm-zzuE1QfIg&index=2

Seminar in Computer Architecture Spring 2025 Livestream Lectures Playlist:

https://www.youtube.com/watch?v=rqeKNZrLzng&list=PL5Q2soXY2Zi-oIW66TLOjtiqQxlDwNHng&index=2

Computer Architecture Fall 2024 Lectures Playlist:

https://www.youtube.com/watch?v=ziMRjDlLEwo&list=PL5Q2soXY2Zi-LfDdGgWyLcTSqzm6a26wD&index=2

Interview with Professor Onur Mutlu:

https://www.youtube.com/watch?v=8ffSEKZhmvo&list=PL5Q2soXY2Zi8VrmOTz44l2WupethSdh-M&index=9

TCuARCH meets Prof. Onur Mutlu

https://www.youtube.com/watch?v=6Hpn4SAX0dI

Arch. Mentoring Workshop @ISCA'21 - Doing Impactful Research

https://www.youtube.com/watch?v=83tlorht7Mc

The Story of RowHammer Lecture:

https://www.youtube.com/watch?v=sgd7PHQQ1AI&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=39

Accelerating Genome Analysis Lecture:

https://www.youtube.com/watch?v=r7sn41lH-4A&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=41

Memory-Centric Computing Systems Tutorial at IEDM 2021:

https://www.youtube.com/watch?v=H3sEaINPBOE&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=35

Intelligent Architectures for Intelligent Machines Lecture:

https://www.youtube.com/watch?v=GTieZPY4Wmc&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=38

Featured Lectures:

https://www.youtube.com/watch?v=jVYCchBGNVc&list=PL5Q2soXY2Zi8VrmOTz44l2WupethSdh-M&index=1

Comments

Want to join the conversation?

Loading comments...