DeepMind’s Latest: An AI for Handling Mathematical Proofs

DeepMind’s Latest: An AI for Handling Mathematical Proofs

Ars Technica AI
Ars Technica AINov 19, 2025

Companies Mentioned

Why It Matters

AlphaProof demonstrates that AI can approach human‑level reasoning in formal mathematics, opening pathways for automated theorem proving and potentially accelerating mathematical research, while also exposing the resource intensity needed for such breakthroughs.

Summary

DeepMind unveiled AlphaProof, an AI system that achieved silver‑medalist performance at the 2024 International Mathematical Olympiad, scoring just one point shy of a gold medal. The system combines a multi‑billion‑parameter neural net, tree‑search, and a novel test‑time reinforcement learning (TTRL) loop to generate and solve formalized statements in the Lean proof assistant, after translating natural‑language problems via a Gemini model that produced roughly 80 million formal statements. Human preprocessing and a specialized geometry module (AlphaGeometry 2) were required, and AlphaProof consumed hundreds of TPU‑days per problem, highlighting its massive computational cost. The researchers aim to extend the technology beyond competition problems toward research‑level mathematics, releasing a limited‑access tool for mathematicians.

DeepMind’s latest: An AI for handling mathematical proofs

Comments

Want to join the conversation?

Loading comments...