Former OpenAI Executive Greg Brockman announced the rollout of ChatGPT Voice, making it available to all users and ushering in a new era of AI interaction. Amidst leadership turbulence at OpenAI, this feature release signifies a substantial leap forward in enhancing the conversational experience within the app.
Users can now engage in dialogue with ChatGPT, experiencing a seamless and natural conversational flow, a departure from conventional text-based interactions. Brockman highlighted the transformative impact of ChatGPT Voice, emphasizing its ability to redefine user engagement.
Latest Update from OpenAI on Voice Mode
September 24, 2024
OpenAI Rolls Out Advanced Voice Mode (AVM) for ChatGPT Plus and Teams Users
OpenAI has announced the rollout of its new Advanced Voice Mode (AVM) to ChatGPT’s paying customers, starting with users in the Plus and Teams tiers. Enterprise and Edu users will gain access next week. The updated feature introduces a redesigned blue animated sphere, replacing the original black dots from its earlier showcase.
Additionally, ChatGPT now offers five new natural-sounding voices—Arbor, Maple, Sol, Spruce, and Vale—bringing the total number of voices to nine. This update enhances the conversational experience, allowing for more fluid interactions in over 50 languages.
July 31, 2024
We’re starting to roll out advanced Voice Mode to some ChatGPT Plus users. This mode offers more natural, real-time conversations, lets you interrupt anytime, and responds to your emotions. If you’re part of this alpha test, you’ll get an email and a message in your mobile app with instructions. We’ll gradually include more people, aiming for all Plus users to have access by fall. Video and screen sharing will come later.
Since our first demo of advanced Voice Mode, we’ve been working to ensure voice conversations are safe and high-quality. We tested GPT-4o’s voice with over 100 external testers in 45 languages. To protect privacy, the model only uses four preset voices and blocks outputs that differ from those voices. We’ve also added safeguards to block requests for violent or copyrighted content. Feedback from this alpha test will help us make the Voice Mode safer and more enjoyable. We plan to share a detailed report on GPT-4o’s abilities, limitations, and safety measures in early August.
How to use ChatGPT voice on mobile?
- Mobile App Installation: Download and install the ChatGPT mobile app for Android or iOS, since voice functionality is not available on the web version.
- Sign In: Open the app and sign in to your ChatGPT Plus account.
- Access Voice Feature: Once on the main prompt screen, tap the headphones icon located in the lower right corner to initiate a voice conversation with ChatGPT.
- Choose a Voice: You will see a splash screen explaining the feature. Here, tap “Choose a voice” and select from one of the five available voice options. Listen to a short preview of each voice and tap “Confirm” to select your preferred voice.
- Start Speaking: Begin talking to your phone. ChatGPT will process your spoken words and generate a response. The conversation continues as long as you speak, and ChatGPT may ask related questions to keep the dialogue going.
- Manual Input Option: If automatic voice recognition is not working well, you can manually input voice commands. Tap and hold the screen, speak your query, and then release your finger for the chat to be processed.
- Enjoy Interactive Responses: Engage with ChatGPT in a conversational manner. You can ask it to tell stories, recite poems, or discuss various topics.
- Return to Text Interface: To go back to the main ChatGPT text interface, tap the red and white cross icon. You’ll see your previous voice interactions displayed in text format.
The feature introduces five distinct voices—Juniper, Sky, Breeze, Ember, and Cove—each offering a more human-like tonality and rhythm compared to existing voice assistants like Siri or Alexa. This diversity of voices amplifies the immersive nature of conversations with the AI.
Despite internal upheavals at OpenAI, including staff discontent leading to calls for the board’s resignation and reinstatement of former leaders Sam Altman and Greg Brockman, the company continues to innovate and release cutting-edge features.
ChatGPT Voice utilizes OpenAI’s Whisper technology, leveraging brief samples of real human voices to generate endless hours of conversational AI interactions. This technology extends beyond the app, with Spotify leveraging it to translate podcasts into multiple languages, showcasing its broad applications.
Tech analyst Ben Thompson praised the conversational ease of ChatGPT Voice, likening the experience to engaging in profound philosophical discussions with artificial intelligence. He emphasized the effortless transition from traditional text-based communication to voice interaction, enhancing the user experience.
OpenAI’s decision to democratize access to this voice feature, previously limited to premium subscribers, signifies a shift towards inclusivity in AI technology. By enabling users to engage through voice commands, the app caters to a wider audience, including those preferring voice-based interactions.
This update holds promise for content and search marketers, offering avenues to explore voice-optimized strategies. Marketers can now delve into creating interactive and personalized campaigns, enhancing user engagement and the customer experience. Furthermore, the update opens doors for refining search engine strategies through voice search optimization.
As OpenAI continues to push the boundaries of AI innovation, the introduction of ChatGPT Voice stands as a testament to the company’s commitment to pioneering advancements in conversational AI. The feature’s impact on user interaction and its role in reshaping content creation and SEO strategies remain compelling areas for exploration and development.
Also Read: Sam Altman is the New CEO of OpenAI