Hardware Videos

All News Deals Social Blogs Videos Podcasts Digests

Understanding & Designing Modern Storage Systems - M5: Processing Inside NAND Flash Memory

•March 26, 2026

Onur Mutlu Lectures

Onur Mutlu Lectures•Mar 26, 2026

Why It Matters

FlashCosmos cuts data‑movement costs and improves energy efficiency, allowing data‑center operators to run large‑scale analytics directly in storage without sacrificing reliability.

Key Takeaways

•Multi-word line sensing enables single‑read bulk bitwise ops.
•FlashCosmos improves performance and energy efficiency over prior IFP.
•Enhanced SLC programming increases voltage margin for reliable computation.
•Evaluated on 160 real 3D NAND chips across three workloads.
•Reduces data movement bottlenecks from storage to compute units.

Summary

The video introduces FlashCosmos, a new in‑flash processing technique that performs bulk bitwise operations directly inside NAND flash memory. Presented as part of a recent MICRO 2022 paper, the work targets the growing data‑movement bottleneck that hampers databases, graph analytics, cryptography and other data‑intensive workloads.

Conventional systems move data from storage to CPUs or GPUs, limited by PCIe‑Gen4’s ~8 GB/s external bandwidth, while near‑data and in‑storage processing still suffer from internal channel limits (~9.6 Gb/s) and serial sensing of operands. FlashCosmos replaces serial reads with a multi‑word‑line sensing (MWS) scheme that activates several word lines simultaneously, delivering a single‑read AND/OR operation. An enhanced SLC programming mode widens the voltage margin between erased and programmed states, boosting computational reliability.

The authors demonstrate the concept on 160 real 3D‑NAND chips and run system‑level simulations using a state‑of‑the‑art SSD simulator on three real‑world workloads. Results show up to 2‑3× speedup and comparable energy reductions versus the best prior in‑flash processing designs, while maintaining low raw bit‑error rates thanks to the larger voltage margin.

By eliminating repeated reads and reducing data transfers, FlashCosmos promises to reshape storage‑centric architectures, enabling faster, greener processing for workloads that exceed DRAM capacity. Its reliability improvements also make in‑flash compute viable for production environments, potentially accelerating the adoption of computational storage solutions.

Original Description

Project and Seminars Course: Understanding and Designing Modern Storage Systems, ETH Zürich, Spring 2026

(https://safari.ethz.ch/projects_and_seminars/spring2026/doku.php?id=modern_ssds)

Lecture 5: Processing Inside NAND Flash Memory

Lecturer: Rakesh Nadig and Dr. Mohammad Sadrosadati

Date: March 27, 2026

Slides (pptx): https://safari.ethz.ch/projects_and_seminars/spring2026/lib/exe/fetch.php?media=pns_modern_storage_systems_spring2026_m5_processing_in_flash_memory.pptx

Slides (pdf): https://safari.ethz.ch/projects_and_seminars/spring2026/lib/exe/fetch.php?media=pns_modern_storage_systems_spring2026_m5_processing_in_flash_memory.pdf

Recommended Reading:

====================

A Modern Primer on Processing in Memory

https://arxiv.org/pdf/2012.03112.pdf

Memory-Centric Computing: Solving Computing's Memory Problem

https://www.arxiv.org/pdf/2505.00458

Memory-Centric Computing: Recent Advances in Processing-in-DRAM

https://arxiv.org/pdf/2412.19275

Intelligent Architectures for Intelligent Computing Systems

https://people.inf.ethz.ch/omutlu/pub/intelligent-architectures-for-intelligent-computingsystems-invited_paper_DATE21.pdf

RowHammer: A Retrospective

https://people.inf.ethz.ch/omutlu/pub/RowHammer-Retrospective_ieee_tcad19.pdf

Fundamentally Understanding and Solving RowHammer

https://arxiv.org/pdf/2211.07613.pdf

Accelerating Genome Analysis via Algorithm-Architecture Co-Design

https://people.inf.ethz.ch/omutlu/pub/AcceleratingGenomeAnalysis_dac23.pdf

From Molecules to Genomic Variations: Accelerating Genome Analysis via Intelligent Algorithms and Architectures

https://people.inf.ethz.ch/omutlu/pub/IntelligentGenomeAnalysis_csbj22.pdf

RECOMMENDED LECTURE VIDEOS & PLAYLISTS:

========================================

Digital Design and Computer Architecture Spring 2025 Livestream Lectures Playlist:

https://www.youtube.com/watch?v=ubhxKNlOlRg&list=PL5Q2soXY2Zi9Eo29LMgKVcaydS7V1zZW3&index=3

Fundamentals of Computer Architecture Fall 2025 Livestream Lectures Playlist:

https://www.youtube.com/watch?v=uKgMFj1eQQc&list=PL5Q2soXY2Zi_ZMtqz1r-GHm-zzuE1QfIg&index=2

Seminar in Computer Architecture Spring 2025 Livestream Lectures Playlist:

https://www.youtube.com/watch?v=rqeKNZrLzng&list=PL5Q2soXY2Zi-oIW66TLOjtiqQxlDwNHng&index=2

Computer Architecture Fall 2024 Lectures Playlist:

https://www.youtube.com/watch?v=ziMRjDlLEwo&list=PL5Q2soXY2Zi-LfDdGgWyLcTSqzm6a26wD&index=2

Interview with Professor Onur Mutlu:

https://www.youtube.com/watch?v=8ffSEKZhmvo&list=PL5Q2soXY2Zi8VrmOTz44l2WupethSdh-M&index=9

TCuARCH meets Prof. Onur Mutlu

https://www.youtube.com/watch?v=6Hpn4SAX0dI

Arch. Mentoring Workshop @ISCA'21 - Doing Impactful Research

https://www.youtube.com/watch?v=83tlorht7Mc

The Story of RowHammer Lecture:

https://www.youtube.com/watch?v=sgd7PHQQ1AI&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=39

Accelerating Genome Analysis Lecture:

https://www.youtube.com/watch?v=r7sn41lH-4A&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=41

Memory-Centric Computing Systems Tutorial at IEDM 2021:

https://www.youtube.com/watch?v=H3sEaINPBOE&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=35

Intelligent Architectures for Intelligent Machines Lecture:

https://www.youtube.com/watch?v=GTieZPY4Wmc&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=38

Featured Lectures:

https://www.youtube.com/watch?v=jVYCchBGNVc&list=PL5Q2soXY2Zi8VrmOTz44l2WupethSdh-M&index=1

Comments

Want to join the conversation?

Loading comments...