Hardware Videos

All News Deals Social Blogs Videos Podcasts Digests

Memory System Design for AI/ML & ML/AI for Memory System Design - SRC AIHW Annual Review - 23.07.24

•March 18, 2026

Onur Mutlu Lectures

Onur Mutlu Lectures•Mar 18, 2026

Why It Matters

By slashing data‑movement energy, the new PIM designs enable faster, greener AI inference and training, directly impacting the cost and scalability of future AI hardware deployments.

Key Takeaways

•Data movement dominates energy use in large AI workloads
•Processing‑in‑memory (PIM) aims to cut off‑chip traffic
•MIDM introduces fine‑grain DRAM access and low‑cost interconnects
•LLVM passes automate SIMD extraction for DRAM‑based kernels
•Energy efficiency gains reach up to 6.8× versus GPUs

Summary

The SRC AIHW annual review highlighted a critical challenge in modern AI/ML systems: data movement consumes the majority of system energy, especially in large‑scale models running on edge TPUs where over 90% of power is spent on off‑chip interconnects. The task force’s mission is to redesign memory systems that are data‑centric, data‑aware, and capable of handling massive workloads in both AI and genomics, leveraging a tight hardware‑software co‑design loop.

Key progress this year centers on processing‑in‑memory (PIM) strategies, notably the MIDM (Multiple‑Instruction Multiple‑Data in DRAM) framework presented at HPCA. MIDM refines DRAM granularity, adds lightweight inter‑bank communication, and supplies compiler and OS support to map high‑level kernels onto DRAM instructions. By segmenting word lines and enabling fine‑grain operations, the approach mitigates under‑utilization, improves SIMD utilization, and supports multi‑programming across DRAM mats.

The team demonstrated substantial performance and energy benefits across benchmarks, reporting up to 6.8× energy improvement over GPUs and 14× over prior SIMD‑based PIM systems. Compiler integration via three new LLVM passes automates vectorization, scheduling, and code generation, reducing programmer effort. Open‑source releases of architectural models and simulation tools further accelerate community adoption.

These advances suggest a shift toward memory‑centric AI architectures, where smarter memory subsystems alleviate bandwidth bottlenecks and lower power budgets. For industry, the work promises more sustainable, high‑performance AI accelerators and opens pathways for collaborations with Intel, AMD, IBM, and Qualcomm.

Original Description

Title: Memory System Design for AI/ML & ML/AI for Memory System Design

Presenter: Professor Onur Mutlu (https://people.inf.ethz.ch/omutlu/)

SRC AIHW Annual Review

Date: July 23, 2024

Slides (pptx):

Slides (pdf):

Recommended Reading:

====================

A Modern Primer on Processing in Memory

https://arxiv.org/abs/2012.03112

Intelligent Architectures for Intelligent Computing Systems

https://arxiv.org/abs/2012.12381

RowHammer: A Retrospective

https://people.inf.ethz.ch/omutlu/pub/RowHammer-Retrospective_ieee_tcad19.pdf

Fundamentally Understanding and Solving RowHammer

https://arxiv.org/abs/2211.07613

RECOMMENDED LECTURE VIDEOS & PLAYLISTS:

========================================

Computer Architecture Fall 2021 Lectures Playlist:

https://www.youtube.com/watch?v=4yfkM_5EFgo&list=PL5Q2soXY2Zi-Mnk1PxjEIG32HAGILkTOF

Digital Design and Computer Architecture Spring 2021 Livestream Lectures Playlist:

https://www.youtube.com/watch?v=LbC0EZY8yw4&list=PL5Q2soXY2Zi_uej3aY39YB5pfW4SJ7LlN

Featured Lectures:

https://www.youtube.com/watch?v=jVYCchBGNVc&list=PL5Q2soXY2Zi8VrmOTz44l2WupethSdh-M&index=1

Interview with Professor Onur Mutlu:

https://www.youtube.com/watch?v=8ffSEKZhmvo&list=PL5Q2soXY2Zi8VrmOTz44l2WupethSdh-M&index=9

The Story of RowHammer Lecture:

https://www.youtube.com/watch?v=sgd7PHQQ1AI&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=39

Accelerating Genome Analysis Lecture:

https://www.youtube.com/watch?v=r7sn41lH-4A&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=41

Memory-Centric Computing Systems Tutorial at IEDM 2021:

https://www.youtube.com/watch?v=H3sEaINPBOE&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=35

Intelligent Architectures for Intelligent Machines Lecture:

https://www.youtube.com/watch?v=GTieZPY4Wmc&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=38

Computer Architecture Fall 2020 Lectures Playlist:

https://www.youtube.com/watch?v=c3mPdZA-Fmc&list=PL5Q2soXY2Zi9xidyIgBxUz7xRPS-wisBN

Digital Design and Computer Architecture Spring 2020 Lectures Playlist:

https://www.youtube.com/watch?v=AJBmIaUneB0&list=PL5Q2soXY2Zi_FRrloMa2fUYWPGiZUBQo2

Public Lectures by Onur Mutlu, Playlist:

https://www.youtube.com/watch?v=kgiZlSOcGFM&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl

Computer Architecture at Carnegie Mellon Spring 2015 Lectures Playlist:

https://www.youtube.com/watch?v=zLP_X4wyHbY&list=PL5PHm2jkkXmi5CxxI7b3JCL1TWybTDtKq

Rethinking Memory System Design Lecture @stanfordonline :

https://www.youtube.com/watch?v=F7xZLNMIY1E&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=4

Comments

Want to join the conversation?

Loading comments...