2 Sources
[1]
ChatGPT leak reveals new Bidi 1 voice model that can listen and speak simultaneously
The unannounced model has already started rolling out to a select group of app users, hinting at an official release window this week. OpenAI is reportedly planning to turn ChatGPT into a superapp, with a major overhaul in the pipeline. The overhaul is said to focus on OpenAI's Codex coding tool and agentic AI tools that can perform tasks for users. But there seems to be more in store, as a new bidirectional audio model named "GPT Bidi 1" has also been spotted, which would be a massive upgrade to ChatGPT's conversational abilities. Bidi is said to be shorthand for bidirectional design, which allows the assistant to speak, hear, and listen simultaneously. TestingCatalog spotted references to Bidi 1 last week, with internal code presenting it as a "major leap in intelligence," and "the next generation of Voice." Bidi 1 is said to sit in the model selector under settings, besides the standard and advanced options. The voice bubble turns yellow once Bidi 1 is picked. According to a recent report from TestingCatalog, the new model has already begun rolling out to a subset of ChatGPT app users, suggesting a possible release this week. The model is said to offer small, natural acknowledgments, like an "okay," when you pause or slow down, without cutting you off. It is also said to switch tasks on the fly: for example, ask it to count to ten, interrupt to reverse the count, and it adjusts immediately. Perhaps one of the biggest changes would be that the model is said to hold the thread of the whole long conversation, rather than dropping earlier context, the weak point that has long dogged ChatGPT's current voice stack. It also no longer jumps in during longer pauses. The report further highlights that the release of Bidi 1 can be seen as a way for OpenAI to close the gap between its capable text models and its older voice layer. This is important because OpenAI is betting that speech will be the primary way most people access AI, rather than text. OpenAI has not yet announced the Bidi 1 model, nor has it detailed GPT 5.6 yet. Hopefully, the company releases official details soon.
[2]
OpenAI may introduce GPT Bidi 1 model for users soon: Here is what it may do
HIGHLIGHTS OpenAI may soon add a new voice feature to ChatGPT. The update could make conversations feel more natural. The feature is reportedly being tested with some users. OpenAI is working to make ChatGPT more human-like and user-friendly as the company tests its latest voice model called GPT Bidi 1. With this new feature, the company is said to improve the AI model, ensuring that it can understand people better and communicate in a more natural way. Reports also suggest that with the new changes ChatGPT may become more useful as an everyday assistant, and it will only add on to its current capabilities, like stronger coding. Furthermore, you can also use the AI tools to complete tasks without even typing what you want. OpenAI has been tight-lipped about the new feature. However, the leaks and rumours had shed enough light on the upcoming AI feature. Here's everything you need to know about the GPT Bidi 1. According to a report from TestingCatalog, GPT Bidi 1 has started appearing for a small number of ChatGPT app users, which may indicate that a wider rollout is coming soon. The development follows OpenAI's continued investment in voice-based AI experiences. What is GPT Bidi 1? OpenAI is said to expand its voice modules under the name GPT Bidi 1. The feature is currently being tested within ChatGPT, as some users have reported that they received the new feature. GPT Bidi 1 is anticipated to stand for 'bidirectional', which means that the AI will be able to listen and speak at the same time much like us humans. Reports indicate that the users can find the GPT Bidi 1 under the ChatGPT's voice settings alongside the existing Standard and Advanced voice options. Also read: Samsung Galaxy Z Fold 8 and Galaxy Z Flip 8 may launch next month: Date, India price, camera and all other leaks What GPT Bidi 1 may offer One of the biggest reported improvements is natural conversation flow. The model can reportedly provide small acknowledgements, such as "okay", when a user pauses, without taking over the conversation. Additionally, the AI can supposedly perform better when it comes to handling interruptions. For instance, if the person requests the AI to count numbers from one to ten but decides to change the direction in between, the AI can reportedly change its actions to suit the new instruction immediately. Moreover, an important improvement that can potentially occur is the capability of the model to remember past conversations within the voice chats. The model can apparently manage to remember the past parts of the conversation instead of getting lost after a few exchanges. In addition, the AI can apparently avoid interrupting the conversation at long pauses. Also read: Apple iPhone 17 Pro deal: Save up to Rs 13,000, here is how How to get GPT Bidi 1 At the time of writing, Open AI has not officially announced anything related to GPT Bidi 1. However, reports suggest that the feature has automatically started appearing for some users inside the ChatGPT app. Hence, we can assume that more users will be provided access to the feature in a similar manner, and they won't have to subscribe or do any additional steps.
Share
Copy Link
OpenAI is testing a new bidirectional audio model called GPT Bidi 1 that represents a major upgrade to ChatGPT's voice capabilities. The leaked feature allows the AI to listen and speak at the same time, provide natural acknowledgments without interrupting, and maintain conversation context throughout long exchanges. Early reports suggest it has already started rolling out to select ChatGPT app users.
OpenAI is quietly testing a new voice model that could transform how users interact with ChatGPT. The unannounced GPT Bidi 1, spotted by TestingCatalog last week, has already begun appearing for a small subset of ChatGPT app users, signaling a possible wider release this week
1
. Internal code references describe it as a "major leap in intelligence" and "the next generation of Voice," positioning it as a major upgrade to ChatGPT's voice capabilities1
.
Source: Digit
The name Bidi stands for bidirectional design, enabling the AI to achieve simultaneous listening and speaking much like humans do during natural conversations
2
. This bidirectional audio model sits in the model selector under settings, alongside standard and advanced voice options, with the voice bubble turning yellow once selected1
.One of the most significant improvements involves how GPT Bidi 1 handles interruptions during AI interactions. When users pause or slow down, the model offers small, natural acknowledgments like "okay" without cutting them off
1
. More impressively, it can switch tasks on the fly—ask it to count to ten, interrupt to reverse the count, and it adjusts immediately1
.The new voice model addresses a critical weakness that has long plagued ChatGPT's current voice stack: maintaining conversation context. GPT Bidi 1 reportedly holds the thread of entire long conversations rather than dropping earlier context, and it no longer jumps in during longer pauses
1
. This capability to remember past parts of the conversation instead of getting lost after a few exchanges makes it far more useful as an everyday assistant2
.Related Stories
The release of GPT Bidi 1 represents OpenAI's effort to close the gap between its capable text models and its older voice layer—a strategic move considering the company is betting that speech will be the primary way most people access AI, rather than text
1
. This leak aligns with broader reports that OpenAI is planning to turn ChatGPT into a superapp, with a major overhaul focusing on Codex coding tools and agentic AI tools that can perform tasks for users1
.
Source: Android Authority
At the time of writing, OpenAI has not officially announced GPT Bidi 1, nor has it detailed GPT 5.6 yet
1
. Reports suggest the feature has automatically started appearing for some users inside the ChatGPT app, indicating that more users will likely gain access without needing to subscribe or take additional steps2
. The development follows OpenAI's continued investment in voice-based AI experiences, making the speech interface more natural and accessible for daily use2
.Summarized by
Navi
[1]