OpenAI announced on Tuesday that it is postponing the release of its “Voice Mode” feature by a month to July due to technical issues. Initially, the company intended to launch the realistic voice conversation experience to a select group of ChatGPT Plus users in late June, but it now requires additional time to meet its launch standards.
“We’re improving the model’s ability to detect and refuse certain content. We’re also enhancing the user experience and preparing our infrastructure to scale to millions while maintaining real-time responses,” OpenAI explained in a post on the social media platform X.
The feature will first be released to a limited group of users to collect feedback and will be made available to all Plus users in the fall, pending safety and reliability checks. OpenAI is also developing new video and screen-sharing capabilities.
In May, the company announced plans to release a new AI model called GPT-4o, capable of realistic voice conversation and interaction across text and image, marking its latest effort to lead in emerging AI technology. The new audio features will allow users to speak to ChatGPT and receive real-time responses, as well as interrupt ChatGPT during its responses—both key aspects of realistic conversations that have been challenging for AI voice assistants.