10 Years of AlphaGo: The Turning Point for AI | Thore Graepel & Pushmeet Kohli

Google DeepMind
Google DeepMindMar 10, 2026

Why It Matters

AlphaGo showed that reinforcement learning can master ultra‑complex tasks, paving the way for modern AI systems that transform industries from language processing to drug discovery.

Key Takeaways

  • AlphaGo defeated world champion Lee Sedol in 2016.
  • Reinforcement learning combined with deep neural nets enabled mastery.
  • Fast thinking policy network and slow thinking tree search mimicked human intuition.
  • The match sparked global interest and validated AI’s potential beyond games.
  • Techniques from AlphaGo now power language models, protein folding, scientific AI.

Summary

The DeepMind podcast revisits AlphaGo's 2016 victory over Lee Sedol, a milestone that reshaped artificial intelligence research.

The episode explains why Go was the perfect testbed: simple rules but an astronomically large search space. AlphaGo fused a fast‑thinking policy network that predicts promising moves with a slow‑thinking Monte Carlo tree search that evaluates deep variations, mirroring the dual‑process thinking of human players.

Guests share personal anecdotes – Thore testing a baby version on his first day, a bet that the system would go ten‑nil against a European champion, and the shock of commentators over move 37, a play with only a one‑in‑10,000 chance of appearing in human games.

They connect those innovations to today’s breakthroughs: large language models, AlphaFold protein‑folding, and other scientific AI systems. The Go triumph proved reinforcement learning could handle ultra‑complex problems, opening the door to AI applications across industries.

Original Description

Seoul, March 2016. Two players sit hunched over a 19x19 grid covered in a sea of black and white stones. They are playing the ancient game of Go - a game of unimaginable complexity long thought impossible for a machine to master. On one side is Lee Sedol (Sae Dol), a legendary 18-time Go world champion. On the other, AlphaGo, a neural network based AI system built on a powerful technique called reinforcement learning.
In the blink of an eye, the world changed.
Exactly one decade later, we look back at the match that sparked the modern AI revolution. From algorithmic discovery to the solving of scientific grand challenges like protein folding, the foundation was laid right there on that wooden board.
Join Hannah Fry, Pushmeet Kohli (VP, Science) and Thore Graepel (AlphaGo team & Distinguished Research Scientist) as they unpick the legacy of AlphaGo.
Further watching:
🎥The Thinking Game: https://youtu.be/d95J8yzvjbQ
___

Comments

Want to join the conversation?

Loading comments...