New Agents Accelerate Robot Scene Creation and Task Design
still very much in r&d but one of my fav projects being worked on at lucky is these agents that make creating the scene and what you actually want the robot(s) to do faster, just such a cool concept.
Building the Robotics Simulator I Dreamed Of
I’ve joined Lucky Robots as director of eng & AI to help build the robotics simulator I wish we had right now. If you’re someone who likes to make robots do cool stuff or want to shape a sim being built...
Hand‑Centric Policy Works Sim‑to‑Real, Needs Lidar Localization
a more in-home-acceptable hand-centric policy. sim2sim shown, it does work sim2real, but need to improve localisation w/ lidar. Changes: Good hand rotation, torso/head/camera actually faces where we intend to go. Semi decent walking, crouch, and general pose. Still all PPO https://t.co/hK5F0sJuJL
Fine‑tuning PPO Rewards Yields Smoother, Slower Gait
further reward crafting for hand-centric (tm) PPO model. Closing in on what I want. need to slow down max velocities and I want more stepping/stride + less tip toe on left foot ideally. almost cant believe this is actually just RL! https://t.co/ro56yI8yaV
Hand‑Centric Policy Gains Flexibility and Real‑World Viability
some improvements to the hand-centric policy crouches really well and even crouch walks, we can definitely pick stuff up off the floor with this I think I'd like to penalize torso velocity beyond a certain speed steerable dynamic hand rotation. I initially always...
Custom Reward Tweaks Yield Rapid, Satisfying Learning
we do a bit of reaching. modifying the mjlab velocity demo reward function to do custom behaviors is so satisfying because the base function just seems to learn so darn fast. It only takes a few minutes to start seeing...
First Successful Sim‑to‑Real RL Walk for Unitree G1
New video is out, teaching a Unitree G1 humanoid to walk using reinforcement learning (PPO). First time I've ever got sim2real to actually work with robotics, sharing what I've learned and testing out how good the policy actually is by...

Training a Unitree G1 to Walk W/ Reinforcement Learning
The video chronicles a creator’s effort to teach a Unitree G1 quadruped to walk using reinforcement‑learning techniques, emphasizing the transition from pure simulation (Sim2Sim) to real‑world deployment (Sim2Real). After years of attempting Sim2Real, the presenter finally succeeded thanks to advances...
End-to-End Neural Network Enables Real-Time Bipedal Gait Control
This is a vertically integrated end to end deep neural network performing forward pass inference real-time, controlling individual actuator's torque output for bidpedal gait generation in adverse, GPS denied, envs. ok its standard PPO rl trained in mjlab, strapped to a...
First‑mover Edge Fades in AI and Robotics
idk when first mover advantage was actually a thing, but it sure seems to not be a thing for AI and robotics.
Kitchen Robot Unloading Beats Self‑Driving in Complexity
im convinced getting a robot to be generally good at going into a random kitchen and unloading a dishwasher with e2e ml is orders of magnitude harder than self driving cars with e2e ml.
Gemini’s CLI Outshines GPT‑5.2, Leaving Codex Lagging
gpt-5.2 is a fine a model, but tbh, I'm going back to gemini because it's also a fine model and the CLI ui/ux is like 10x better. Really feels like codex cli is falling way behind competition.
Exploring GPT‑5.2’s Potential for Coding Agents
just when I think i'm out, openai pulls me back in with gpt-5.2. Let's see what it's all about. Any opinions on 5.2 in the context of coding agents?
Gemini 3 Pro CLI: Finally a Frustration‑Free Agent
Still my fav model and cli. Gemini 3 pro + cli also is the first agent/terminal combo that doesn't routinely make my blood boil while using it.
Training Gait Models to Detect and Recover From Falls
Today, we're hoping for the best, preparing for the worst. In gait training, the episode terminates if a fall is detected, so a fall in real is wildly out of distribution. So we need to handle for falls. Eventually be cool to...
Impressive Humanoid Jog Raises Curriculum Questions
This might be the best humanoid jog I've seen yet. Doubt this is pure RL, I'd love to know the curriculum here.
Bad Reward Function Uncovered Surprisingly Effective Robot Model
Geoff the G1 preparing to go offroading IRL. I did a terrible job at the reward function here and was actually just tuning in to see all of what was broken and instead found a pretty good model. The robots just want...

Gemini CLI + 3 Pro: Top Terminal AI, Occasional Dropouts
After a couple days of heavy use, tldr: it’s a good model and gemini cli is very good. The pair might be the best for terminal-based agents. But it’s not perfect. Pros: I never hit any limit for gemini 3 pro and...
First Successful Sim2Real Transfer Achieved on Geoff G1
Ladies and gentlemen, we have our first successful sim2real transfer on Geoff the G1! https://t.co/eywBjbiLxy
Discovered a Small Yet Effective Local Optimum
we found ourselves a nice lil local optimum. https://t.co/PXevVKYcpL
No Model yet Outperforms Codex+O3 in Real Work
This is my sentiment. I still have yet to experience a model in actual useful professional work that beats codex+o3. You can play with strengths to get some bench perf to move, at cost of another capability. All we're doing is...

Demand for AI Coding Agents Is Rising
The people want coding agents. https://t.co/dXL3ikiKwD
Top Robotics Influencers and Companies to Watch
a list of people and companies to watch for robotics
AMD's Deep Learning Gap Stems From Software Deficit
can anyone actually explain, with hard facts, how it's possible AMD still doesn't meaningfully compete in the deep learning space? I understand they lack certain pieces of software. I do not understand how they could reasonably still lack it though.
Sim-to-Real Transfer Achieves Real-World Walking and Running
sim2real got hands, but making progress towards our own gait and transferring walking/running successfully from sim to real. ignore toddler mess. its all strategically placed with purpose to test challenging scenarios for robots of course. https://t.co/Xbgayfrfik
Debating Sim2Real for New Sprinter Policy
just cooked up a new sprinter policy, do we attempt sim2real? https://t.co/DIDvOrcm23
Kids' Early Rolling Mimics Reinforcement Learning
Watching kids learn these first traits really does look a heck of a lot like reinforcement learning. Our twins' first mode of travel was just rolling to the desired location.
Neo's $20K Price Risky; $500/Mo Housekeeping Steal
After some time of thought on the 1x Neo $20K for Neo is a significant investment with a lot of risk since the bot is proprietary. $500/month is an outright steal for housekeeping tho, even if it's through a teleoped robot. No need...
Manual Curriculum Yields First Walking Gait in KScale KBot
Finally getting a semblance of a gait for the kscale kbot in mjlab sim! Ended up essentially manually managing a curriculum of first balance, then forward vector walking. https://t.co/UTuZUSF4dz
Reward Hacking with Mjlab's Seamless Wandb Integration
We do a bit of reward hacking. Enjoying mjlab's robust wandb integration for tracking runs and storing chkpts Always a fan of wandb but usually too lazy to do the integration for the stats/files like this We'll see if I ever actually make...
Browser‑Based Viz Makes Training Progress Easy Anywhere
exceptionally easy install and demo at least. Didn't realize viz ran via browser immediately makes it easy to visualize training progress from a personal computer while processing occurs on a server/workhorse machine true test will be trying this on custom problem and...

Codex CLI Exit Saves Sessions, Huge QOL Boost
whoever added this to the codex cli when you exit is a lifesaver cannot tell you how many times I've mindlessly ctrl+c trying to copy paste only to break a session then i gotta dig up the latest chat. such a simple...