Science Videos

All News Social Blogs Videos Podcasts Digests

Science HealthTech BioTech

P&S Arch. & Algo. For Health & Life Sciences - L6: Overview of Genomic Workflows (II) (Spr 2026)

•April 28, 2026

Onur Mutlu Lectures

Onur Mutlu Lectures•Apr 28, 2026

Why It Matters

Efficient read‑mapping algorithms cut sequencing analysis time and cost, enabling faster, more affordable genomic insights critical for clinical and research breakthroughs.

Key Takeaways

•Read mapping transforms fragmented reads into a reconstructed genome.
•Short reads offer accuracy; long reads provide coverage but higher error rates.
•Indexing reference genomes with k‑mer seeds enables fast alignment.
•Minimizer and spaced‑seed techniques balance memory use and sensitivity.
•Advanced seed algorithms improve fuzzy matching without excessive storage.

Summary

The sixth lecture of the P&S Architecture & Algorithms for Health & Life Sciences series dives into genomic workflow analysis, concentrating on the read‑mapping stage that stitches sequenced fragments into a complete genome. It revisits earlier concepts—why genomics matters, base‑calling, and data digitization—before moving into the computational challenges of aligning millions of short and long reads to a reference.

The presenter explains the fundamental trade‑off between short reads, which are highly accurate but limited in length, and long reads, which span larger regions yet carry higher error rates. Efficient mapping relies on indexing the reference genome with k‑mer seeds, enabling rapid lookup rather than exhaustive sliding‑window searches. Various seed‑selection strategies—full k‑mer tables, minimizers, spaced‑seeds, linked k‑mers, and quasi‑seeds—are compared for their impact on memory footprint, sensitivity, and flexibility.

Illustrative analogies liken the process to solving a puzzle with or without a picture, highlighting how minimizer selection (choosing the smallest hash in a window) reduces storage while preserving most matches. The lecture cites the “blend” paper on fuzzy seed matching as an example of research that achieves high sensitivity without full‑k‑mer indexing, and demonstrates how space‑seed designs can capture similar sequences despite mismatches.

These algorithmic advances directly affect the scalability of genomic pipelines in health and life‑science applications. By optimizing the balance between speed, memory, and alignment accuracy, organizations can lower computational costs, accelerate diagnostic sequencing, and support larger population‑scale studies, ultimately advancing personalized medicine initiatives.

Original Description

Project & Seminar (P&S), ETH Zürich, Spring 2026

Architectures & Algorithms for Health & Life Sciences (https://safari.ethz.ch/projects_and_seminars/spring2026/doku.php?id=arch_and_alg_for_health)

Lecture 6: Overview of Genomic Workflows (II)

Lecturer: Nika Mansouri Ghiasi

Date: April 29, 2026

Slides (pptx):

Slides (pdf):

Recommended Reading:

====================

A Modern Primer on Processing in Memory

https://arxiv.org/pdf/2012.03112.pdf

Memory-Centric Computing: Solving Computing's Memory Problem

https://www.arxiv.org/pdf/2505.00458

Memory-Centric Computing: Recent Advances in Processing-in-DRAM

https://arxiv.org/pdf/2412.19275

Intelligent Architectures for Intelligent Computing Systems

https://people.inf.ethz.ch/omutlu/pub/intelligent-architectures-for-intelligent-computingsystems-invited_paper_DATE21.pdf

RowHammer: A Retrospective

https://people.inf.ethz.ch/omutlu/pub/RowHammer-Retrospective_ieee_tcad19.pdf

Fundamentally Understanding and Solving RowHammer

https://arxiv.org/pdf/2211.07613.pdf

Accelerating Genome Analysis via Algorithm-Architecture Co-Design

https://people.inf.ethz.ch/omutlu/pub/AcceleratingGenomeAnalysis_dac23.pdf

From Molecules to Genomic Variations: Accelerating Genome Analysis via Intelligent Algorithms and Architectures

https://people.inf.ethz.ch/omutlu/pub/IntelligentGenomeAnalysis_csbj22.pdf

RECOMMENDED LECTURE VIDEOS & PLAYLISTS:

========================================

Digital Design and Computer Architecture Spring 2025 Livestream Lectures Playlist:

https://www.youtube.com/watch?v=ubhxKNlOlRg&list=PL5Q2soXY2Zi9Eo29LMgKVcaydS7V1zZW3&index=3

Fundamentals of Computer Architecture Fall 2025 Livestream Lectures Playlist:

https://www.youtube.com/watch?v=uKgMFj1eQQc&list=PL5Q2soXY2Zi_ZMtqz1r-GHm-zzuE1QfIg&index=2

Seminar in Computer Architecture Spring 2025 Livestream Lectures Playlist:

https://www.youtube.com/watch?v=rqeKNZrLzng&list=PL5Q2soXY2Zi-oIW66TLOjtiqQxlDwNHng&index=2

Computer Architecture Fall 2024 Lectures Playlist:

https://www.youtube.com/watch?v=ziMRjDlLEwo&list=PL5Q2soXY2Zi-LfDdGgWyLcTSqzm6a26wD&index=2

Interview with Professor Onur Mutlu:

https://www.youtube.com/watch?v=8ffSEKZhmvo&list=PL5Q2soXY2Zi8VrmOTz44l2WupethSdh-M&index=9

TCuARCH meets Prof. Onur Mutlu

https://www.youtube.com/watch?v=6Hpn4SAX0dI

Arch. Mentoring Workshop @ISCA'21 - Doing Impactful Research

https://www.youtube.com/watch?v=83tlorht7Mc

The Story of RowHammer Lecture:

https://www.youtube.com/watch?v=sgd7PHQQ1AI&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=39

Accelerating Genome Analysis Lecture:

https://www.youtube.com/watch?v=r7sn41lH-4A&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=41

Memory-Centric Computing Systems Tutorial at IEDM 2021:

https://www.youtube.com/watch?v=H3sEaINPBOE&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=35

Intelligent Architectures for Intelligent Machines Lecture:

https://www.youtube.com/watch?v=GTieZPY4Wmc&list=PL5Q2soXY2Zi8D_5MGV6EnXEJHnV2YFBJl&index=38

Featured Lectures:

https://www.youtube.com/watch?v=jVYCchBGNVc&list=PL5Q2soXY2Zi8VrmOTz44l2WupethSdh-M&index=1

Comments

Want to join the conversation?

Loading comments...