What's New with ChatGPT Voice

OpenAI
OpenAIDec 5, 2025

Why It Matters

Embedding voice directly into ChatGPT creates a frictionless, multimodal interface that can boost user engagement, streamline local search, and open fresh channels for businesses to reach customers through conversational AI.

Summary

The video introduces the latest iteration of ChatGPT’s voice feature, now embedded directly into the chat interface, delivering a seamless spoken‑dialogue experience with a live transcript that mirrors the conversation in real time. This integration expands beyond simple text‑to‑speech, allowing the assistant to pull up dynamic content such as maps, weather updates, and other contextual data while the user talks.

During the demonstration, the user asks for a map of the best bakeries in San Francisco’s Mission District. ChatGPT instantly displays a map highlighting top spots, with Tartine prominently featured, and then enumerates its signature pastries—morning butter, classic croissants, rich pain au chocolat, and a “franapan” croissant filled with almond cream. The assistant also provides pronunciation guidance, clarifying that “franapan” is pronounced like “fran.”

Key moments include the assistant’s ability to switch between spoken answers and visual aids, and the explicit pronunciation cue: “It’s pronounced franapan.” This showcases the model’s capacity to handle nuanced language tasks, such as phonetic clarification, while simultaneously delivering location‑based recommendations.

The rollout signals a shift toward truly multimodal AI interactions, where voice, text, and visual data converge in a single workflow. For businesses, this means new avenues for real‑time customer engagement, localized marketing, and accessibility, as users can now obtain actionable information without typing, simply by speaking and seeing instant visual feedback.

Original Description

You can now use ChatGPT Voice right inside chat — no separate mode needed.
You can talk, watch answers appear, review earlier messages, and see visuals like images or maps in real time.
Rolling out to all users on mobile and web. Just update your app.
If you prefer the original experience, turn on “Separate mode” under Settings → Voice Mode.

Comments

Want to join the conversation?

Loading comments...