🚀 ORV: 4D Occupancy-centric Robot Video Generation (CVPR 2026) https://t.co/9ILR3w3XND What if we could generate photorealistic robot manipulation videos with precise 4D control? With ORV, we condition video generation on 4D semantic occupancy, enabling: ✨ High-fidelity robot videos with fine-grained motion control 🎥 Multi-view generation for building consistent 4D scenes 🧠 Simulation-to-real transfer by plugging directly into physics simulators 🤖 Better downstream robot learning with scalable synthetic data We also release the largest tabletop manipulation occupancy dataset ever built. 🎬 Watch the video ↓