Venture Capital Videos
  • All Technology
  • AI
  • Autonomy
  • B2B Growth
  • Big Data
  • BioTech
  • ClimateTech
  • Consumer Tech
  • Crypto
  • Cybersecurity
  • DevOps
  • Digital Marketing
  • Ecommerce
  • EdTech
  • Enterprise
  • FinTech
  • GovTech
  • Hardware
  • HealthTech
  • HRTech
  • LegalTech
  • Nanotech
  • PropTech
  • Quantum
  • Robotics
  • SaaS
  • SpaceTech
AllNewsDealsSocialBlogsVideosPodcastsDigests

Venture Capital Pulse

EMAIL DIGESTS

Daily

Every morning

Weekly

Sunday recap

NewsDealsSocialBlogsVideosPodcasts
Venture CapitalVideosTraining General Robots for Any Task: Physical Intelligence’s Karol Hausman and Tobi Springenberg
Venture Capital

Training General Robots for Any Task: Physical Intelligence’s Karol Hausman and Tobi Springenberg

•January 6, 2026
0
Sequoia Capital
Sequoia Capital•Jan 6, 2026

Why It Matters

By solving the intelligence bottleneck with foundation models, robots can become versatile, data‑driven services, dramatically expanding automation across industries and lowering deployment costs.

Key Takeaways

  • •Foundation models aim to give any robot any task.
  • •End-to-end reinforcement learning replaces traditional perception-planning-control pipeline entirely.
  • •Pi-Star 0.6 achieves multi-hour continuous operation in real world.
  • •Generalization relies on diverse robot data, not internet data.
  • •Open-sourcing models expands applications from coffee making to surgery.

Summary

Physical Intelligence is pioneering robotic foundation models that promise any robot can learn any task. By abandoning the classic perception‑planning‑control stack in favor of end‑to‑end reinforcement learning, the company’s Pi‑Star 0.6 model can ingest raw sensor data and instructions, then directly generate motor commands. The team demonstrated the system’s robustness with a robot that brewed coffee continuously for 13 hours and successfully navigated previously unseen home environments, showcasing emerging zero‑shot generalization. The breakthrough hinges on three pillars: capability, generalization, and performance. Capability was proven early with the Pi‑Zero release, showing that any collected task data can be turned into an executable policy. Generalization improves as the training set diversifies; the April Pi‑5 update let a robot operate in a brand‑new kitchen with only common‑sense behavior. Performance is now approaching deployment thresholds, as the models can run economically valuable tasks that generate their own data, creating a virtuous loop of improvement. The founders stress that the intelligence bottleneck, not hardware, limits today’s robots. They liken their open‑source approach to large language models, noting that external developers have already applied Pi‑Star to domains ranging from autonomous driving to surgical assistance. A quoted moment—"we crossed the threshold"—captures the shift from research prototypes to commercially viable agents, even as safety‑critical applications remain cautious. If the trajectory holds, scalable robot intelligence could flood multiple sectors—home assistance, manufacturing, agriculture, and healthcare—with adaptable agents that learn on the job. By turning robot operation into a data‑generating service, Physical Intelligence aims to outpace internet‑scale datasets, potentially reshaping the economics of automation and accelerating the broader adoption of autonomous systems.

Original Description

Physical Intelligence’s Karol Hausman and Tobi Springenberg believe that robotics has been held back not by hardware limitations, but by an intelligence bottleneck that foundation models can solve. Their end-to-end learning approach combines vision, language, and action into models like π0 and π*0.6, enabling robots to learn generalizable behaviors rather than task-specific programs. The team prioritizes real-world deployment and uses RL from experience to push beyond what imitation learning alone can achieve. Their philosophy—that a single general-purpose model can handle diverse physical tasks across different robot embodiments—represents a fundamental shift in how we think about building intelligent machines for the physical world.
Hosted by Alfred Lin and Sonya Huang, Sequoia Capital
0

Comments

Want to join the conversation?

Loading comments...