Enhanced Conversations with ChatGPT Advanced Voice Mode

Published On Wed Sep 25 2024
Enhanced Conversations with ChatGPT Advanced Voice Mode

ChatGPT Advanced Voice Mode Rolling Out with New UI, More ...

Months ago, during the GPT-4o launch, ChatGPT Advanced Voice was demoed. However, the release was postponed by OpenAI due to safety concerns. A controversy later arose regarding the 'Sky' voice, which bore a strong resemblance to Scarlett Johansson’s voice. After a delay of five months, OpenAI is now rolling out Advanced Voice to all ChatGPT Plus and Team users, with the rollout expected to be completed within the week.

Enhancements and Features

For those unfamiliar, Advanced Voice represents a significant upgrade over the standard voice chat available to free ChatGPT users. Leveraging the multimodal capabilities of the GPT-4o model, Advanced Voice offers a more natural conversational experience with support for interruptions.

ChatGPT Voice Mode Is Here: Will It Revolutionize AI Communication

With the rollout to Plus and Team users in the ChatGPT app underway this week, several enhancements have been introduced. These include Custom Instructions, Memory, the addition of five new voices, improved accents, and the ability to apologize in over 50 languages.

Advanced Voice in ChatGPT, though reminiscent of Google’s Gemini Live, features a notable distinction. While Gemini Live utilizes text-to-speech engines to extract responses from an LLM and respond, ChatGPT Advanced Voice supports direct audio input/output. Although Gemini Live also accommodates interruptions, it does not provide a fully multimodal experience.

Current State of ChatGPT Advanced Voice

Despite the promises made during the initial demo, it appears that ChatGPT Advanced Voice may have lost certain multimodal features. OpenAI's showcase included capabilities such as singing, mood/emotion detection through speech, sound identification, accents, and more. However, current limitations indicate that speech identification is unavailable, and camera input is not yet supported.

GPT-4o: Early Impressions and Insights - Gradient Flow

It seems that some features might have been removed by OpenAI to prevent potentially awkward interactions with ChatGPT users. Regardless, the question remains—are you eagerly anticipating the use of ChatGPT Advanced Voice? Share your thoughts below.