Build Your Next Story with Gemini Omni.

Google DeepMind
Google DeepMindMay 20, 2026

Why It Matters

Omni could accelerate content creation and post-production workflows by making high-fidelity video generation and natural-language editing widely accessible, raising both productivity and content quality while intensifying debates on synthetic media governance. This rollout positions Google to compete strongly in the growing market for multimodal AI tools across enterprise and creator ecosystems.

Summary

Google announced Gemini Omni, a new multimodal generative model that combines Gemini’s reasoning with advanced media models to create and edit highly realistic videos, images and simulations from any input. Omni advances intuitive physics and world understanding, enabling more accurate simulations of kinetic energy, gravity and complex processes like protein folding. It supports iterative, conversational editing of user-supplied footage—transforming selfies or scenes and updating the whole context to match edits. The company launched the first family member, Gemini Omni Flash, now rolling out across its products, with a pro version promised later.

Original Description

Comments

Want to join the conversation?

Loading comments...