Amazon's Anti-Benchmark AI Bet

Amazon's Anti-Benchmark AI Bet

Sources
SourcesDec 2, 2025

Companies Mentioned

Summary

In this episode, host interviews Amazon’s AI chief Rohit Prasad, who argues that the AI community should stop obsessing over benchmark leaderboards and focus on real‑world utility, noting that current evals are noisy and incomparable. He explains Amazon’s contrarian approach, emphasizing consistent training data and held‑out evaluations as the true measure of progress, and hints at upcoming announcements at AWS re:Invent that showcase practical advancements. The conversation also touches on industry reactions, including OpenAI’s “code red” and Anthropic’s new Claude model, underscoring a shift toward performance that matters beyond scores.

Amazon's anti-benchmark AI bet

Comments

Want to join the conversation?

Loading comments...