Curated by THEOUTPOST
On Wed, 20 Nov, 12:11 AM UTC
5 Sources
[1]
OpenAI Brings ChatGPT's Advanced Voice Mode to Your Browser
To start, web access is limited to paid users with a Plus, Enterprise, Teams, or Edu account. OpenAI this week continued its rollout of ChatGPT's Advanced Voice Mode by adding support for web browsers. Previously, the feature was only available on the desktop and mobile apps. "You can now talk to ChatGPT right from your browser," Kevin Weil, OpenAI's CPO, tweeted alongside a short video showing Advanced Voice Mode responding to an inquiry about Greek mythology. To start, web access is limited to paid users with a Plus, Enterprise, Teams, or Edu account. "We'll look to roll to free users in the coming weeks," Weil says. Voice mode on the web uses OpenAI's ChatGPT-4o model, so it should work the same as it does on the mobile and desktop apps, providing a natural-sounding back-and-forth with the AI. After a delay, OpenAI started a small Advanced Voice Mode rollout in July before a larger release to Plus and Team users in September. This comes after OpenAI launched an official Windows ChatGPT app last week, which lets you open a "companion window" on your PC and easily use the AI chatbot alongside any other Windows programs. An update to the ChatGPT app for macOS also added the ability to read computer code from third-party apps. OpenAI is also rumored to be working on allowing ChatGPT to see what you're doing on your device and use that information to take action. Per TechRadar, this Operator Agent is the next great race between OpenAI, Google, and Anthropic and may change how people interact with AI on their computers.
[2]
ChatGPT's Advanced Voice Mode lands in your desktop browser - and it's a big step towards its rumored Operator agent
It's a vital first step towards browser-based AI agents for ChatGPT It's been a busy time for ChatGPT and OpenAI. Hot on the heels of rumors that ChatGPT Advanced Voice mode (the ability to have a free-flowing conversation with the AI) is about to get the ability to 'see', and rolling out the ChatGPT Windows app to all free users last week, it has just announced that Advanced Voice mode is now available in the browser-based version of ChatGPT, for paid subscribers only. So, if you're a ChatGPT Plus or Teams subscriber, a visit to ChatGPT.com (or the newly purchased Chat.com domain) will soon give you access to the Advanced Voice mode option that has previously only been available only in the app versions of ChatGPT. ChatGPT Advanced Voice Mode was released in September on mobile and was recently added to the desktop apps. The browser release is described as "rolling out", so you might not see the Advanced Voice mode when you log in with ChatGPT (we currently don't have access), but that should change in the coming days. Free users will eventually get access to Advanced Voice Mode too. In a post on X.com, which also contains a video that shows how ChatGPT Advanced Voice Mode works in a browser, Kevin Weil, CPO of OpenAI said, "We'll look to roll to free users in the coming weeks." ChatGPT Advanced Voice mode is a vital first step towards the rumored ChatGPT Operator Agent, a tool that might change the way we interact with our computers and technology in general. ChatGPT Operator Agent is an AI Agent that can interact directly with your computer on your behalf. Agents aren't unique to OpenAI - everybody from Anthropic to Google and Microsoft is also developing autonomous AI agents that can see what's on your screen and interact with it. You could, for example, get an AI Agent to pay your bills, or book a holiday for you, taking the virtual assistant model to the next level. Voice control in the browser will be a necessary first step for using an AI Agent since the majority of its work will be browser-based. Don't expect the announcements from OpenAI to slow down before the end of the year. We're still expecting ChatGPT search, which launched recently for paid users, to be made available to users on the free tier any day now. It launched with the note, "We'll roll out to Free users over the coming months."
[3]
OpenAI brings ChatGPT's Advanced Voice Mode to the web | TechCrunch
OpenAI is expanding ChatGPT's Advanced Voice Mode feature to the web, letting users talk to the AI chatbot right from their browser. The company's chief product officer, Kevin Weil, announced the launch on X. The feature, which makes ChatGPT more natural to speak with, is rolling out to ChatGPT's paying customers this week, which means you need to be a Plus, Enterprise, Teams or Edu subscriber to access it. The web launch follows the debut of OpenAI's Advanced Voice Mode in ChatGPT's iOS and Android apps in September. Advanced Voice Mode uses OpenAI's GPT-4o's native audio capabilities to allow for natural, real-time conversations between users and ChatGPT. The chatbot is able to understand and respond to non-verbal cues, including things like your talking speed. The chatbot can also respond with emotion. To start a voice conversation on the web, you need to select the Voice icon on the bottom-right of ChatGPT's prompt window. You will then have to give your browser permission to access your computer's microphone. Once you have started a voice conversation, you will be taken to a screen with a blue orb in the center. You can choose from nine output voices for ChatGPT, each of which has its own tone and character. For instance, you can select "Arbor," which is "easygoing and versatile," or, you can choose "Ember," which is "confident and optimistic." Weil says OpenAI will look to roll out the feature to free users in "the coming weeks." Users on a Plus and Team subscription are subject to a limit each day when using Advanced Voice Mode. According to a help page on the feature, daily limits may change. OpenAI will notify you when you have 15 minutes left of advanced voice for the day. Free users will get access to a monthly preview to try the feature.
[4]
OpenAI just launched ChatGPT Advanced Voice Mode for the web -- here's how to get it
OpenAI today announced the expansion of ChatGPT's Advanced Voice Mode to the web. Currently only available to paid subscribers, this latest update enhances the platform's interactivity by enabling voice-based conversations directly through a web browser. This development gives users another way to engage with the chatbot, this time with a natural, real-time audible exchange on the web. Moving beyond traditional text inputs, Advanced Voice Mode was previously only accessible to subscribers of ChatGPT's premium services on mobile. The recent update extends this functionality to the web, broadening availability to a wider audience. ChatGPT Plus subscribers can now initiate voice conversations by clicking the Advanced Voice icon adjacent to the input prompt bar, which activates a pulsing blue orb indicating readiness for voice communication. This feature leverages OpenAI's GPT-4o model, known for its native audio capabilities, facilitating more natural and responsive conversations. The model can interpret non-verbal cues, such as speech speed and emotional tone, allowing for a more nuanced understanding and interaction. Additionally, the AI can be interrupted or told to recall information in real time. The company's chief product officer, Kevin Weil announced that users can try the new Advance Voice feature by going to the site and either logging in as a Plus subscriber or starting a subscription and mentioned that they hope to roll out the feature for free to users within the next few weeks. This update aligns with OpenAI's commitment to enhancing user experience by integrating more intuitive interaction methods. By incorporating voice capabilities into the web platform, OpenAI aims to make AI interactions more personal and engaging, catering to users who prefer verbal communication over typing. The rollout of Advanced Voice Mode on the web is part of OpenAI's broader strategy to democratize access to advanced AI tools, ensuring that both free and premium users can benefit from these innovations at some point in the near future. This move is expected to foster greater user engagement and expand the practical applications of ChatGPT across various domains. This latest integration of voice functionalities represents a significant step toward more immersive and human-like interactions.
[5]
You can now talk with ChatGPT's Advanced Voice Mode on the web
Now you can ask ChatGPT for help - out loud - right from your web browser. Here's how it works. If you rely on ChatGPT for your everyday workflow, you are likely used to having a tab open with the chatbot on your desktop at all times. Now, right from your desktop, you'll have the chance to access OpenAI's Advanced Voice Mode -- and you'll want to. Also: Google's Gemini Advanced gets a very useful ChatGPT feature - but how does it compare? On Tuesday, OpenAI announced -- via an X post -- that Advanced Voice Mode is beginning to roll out on the web, extending the voice assistant's availability beyond the desktop and mobile apps. This rollout makes Advanced Voice Mode the most accessible it has ever been, as it removes the barrier of downloading an app to get started. Advanced Voice refers to OpenAI's AI-powered voice assistant, which can be interrupted, hold multi-turn conversations, and respond to user emotions for a much more intuitive and helpful conversation experience. It tackles the issue that most voice assistants have when struggling to understand what is said. Although it sounds too good to be true, in my testing, Advanced Voice Mode has been skilled at carrying out lengthy conversations and understanding what I mean even when my thoughts are not linear. Some lighthearted use cases include chatting with ChatGPT about your day, playing a trivia game, or talking about yourself. Still, it has the same practical use cases as a regular voice assistant. Also: Microsoft offers $4 million in AI and cloud bug bounties - how to qualify Unfortunately, despite the expansion of availability, users are still required to subscribe to ChatGPT Plus, which costs $20 per month. If you are a ChatGPT superuser, the upgrade may be worth it as it comes with other perks such as access to all of the latest OpenAI models, including o1-preview, five times more messages for GPT-4o, image generation, and more. Users will still be unable to access Voice Mode's multimodal capabilities, including assisting with content on users' screens and using the user's phone camera as context for a response, for which OpenAI has still not shared a release date.
Share
Share
Copy Link
OpenAI has rolled out ChatGPT's Advanced Voice Mode for web browsers, allowing users to have voice conversations with the AI chatbot directly from their desktop. Initially available for paid subscribers, this feature marks a significant step in AI interaction and accessibility.
OpenAI has taken a significant step in enhancing user interaction with AI by expanding ChatGPT's Advanced Voice Mode to web browsers. This move allows users to engage in voice conversations with the AI chatbot directly from their desktop, marking a new era in AI accessibility and functionality 12.
The Advanced Voice Mode, previously limited to mobile and desktop apps, is now being rolled out to web browsers. Initially, access is restricted to paid subscribers, including those with Plus, Enterprise, Teams, or Edu accounts 13. OpenAI's Chief Product Officer, Kevin Weil, announced that the company plans to extend this feature to free users in the coming weeks 24.
The web version of Advanced Voice Mode utilizes OpenAI's ChatGPT-4o model, known for its native audio capabilities 13. This integration enables:
To initiate a voice conversation, users need to select the Voice icon in the bottom-right corner of ChatGPT's prompt window and grant microphone access to their browser 34.
This development is seen as a crucial step towards more immersive and human-like interactions with AI. It aligns with OpenAI's strategy to make AI tools more accessible and intuitive 4. The integration of voice capabilities on the web platform caters to users who prefer verbal communication over typing, potentially expanding the practical applications of ChatGPT across various domains 45.
While this update significantly enhances ChatGPT's accessibility, some limitations remain:
Industry experts speculate that this development could be a precursor to more advanced features, such as the rumored ChatGPT Operator Agent. This potential future tool might allow AI to interact directly with users' computers, performing tasks like paying bills or booking holidays 2.
OpenAI's move to expand Advanced Voice Mode to web browsers positions ChatGPT competitively in the AI market. It comes at a time when other tech giants like Google, Microsoft, and Anthropic are also developing autonomous AI agents with similar capabilities 2. This development in voice interaction technology represents a significant advancement in making AI more accessible and user-friendly for a broader audience.
Reference
[1]
[2]
OpenAI launches a new voice-based interaction feature for ChatGPT Plus subscribers, allowing users to engage in conversations with the AI using voice commands and receive spoken responses.
29 Sources
29 Sources
OpenAI brings ChatGPT's Advanced Voice Mode to Windows and Mac desktop apps, offering users a more natural and intuitive way to interact with AI through voice conversations.
6 Sources
6 Sources
OpenAI has rolled out an advanced voice mode for ChatGPT, allowing users to engage in verbal conversations with the AI. This feature is being gradually introduced to paid subscribers, starting with Plus and Enterprise users in the United States.
12 Sources
12 Sources
OpenAI has finally released its advanced voice feature for ChatGPT Plus and Team users, allowing for more natural conversations with the AI. The feature was initially paused due to concerns over potential misuse.
14 Sources
14 Sources
OpenAI introduces an advanced voice mode for ChatGPT, allowing users to have spoken conversations with the AI. This feature is currently available for Plus and Enterprise users on iOS and Android devices.
2 Sources
2 Sources
The Outpost is a comprehensive collection of curated artificial intelligence software tools that cater to the needs of small business owners, bloggers, artists, musicians, entrepreneurs, marketers, writers, and researchers.
© 2025 TheOutpost.AI All rights reserved