Curated by THEOUTPOST
On Thu, 31 Oct, 4:05 PM UTC
6 Sources
[1]
OpenAI ChatGPT Advanced Voice Mode Arrives on Desktop
OpenAI's ChatGPT has taken a significant leap forward with the introduction of Advanced Voice Mode for desktop users. This feature represents a major milestone in AI communication, giving users a more natural and intuitive way to interact with artificial intelligence. With seamless voice-based conversations, ChatGPT enables engagement that closely resembles human interaction, maintaining context and flow throughout the dialogue. Advanced Voice Mode is designed to make interactions with AI more intuitive and inclusive, capable of understanding various accents and speech patterns. Whether you're navigating complex software issues or simply looking for a more engaging way to tell a bedtime story, this mode aims to transform how we communicate with technology. This feature goes beyond simple conversation. Advanced Voice Mode offers practical capabilities that cater to both technical and creative needs. The AI remembers past interactions, providing personalized advice and resuming conversations where they left off. From assisting with coding challenges to suggesting creative ways to integrate email newsletters into Slack, this tool is built to make your digital life smoother and more productive. The Advanced Voice Mode is built on enhanced voice recognition technology that supports a wide range of accents and speech patterns. This inclusivity ensures that users from diverse linguistic backgrounds can effectively communicate with the AI, breaking down barriers and creating a more accessible platform for global users. To utilize Advanced Voice Mode on your desktop: Please note that Advanced Voice Mode is currently not supported in the web browser version of ChatGPT. To access this feature, the desktop application is required. One of the standout features of the Advanced Voice Mode is its ability to switch between multiple accents. This capability adds a new dimension to AI interaction, particularly in the realm of storytelling and narrative experiences. Whether you're: The AI adapts its vocal output to match your chosen accent, creating a more immersive and engaging experience. This feature not only enhances the quality of interactions but also opens up new possibilities for creative and educational applications. As AI becomes increasingly integrated into our daily lives, the importance of ethical AI interactions cannot be overstated. The Advanced Voice Mode places a strong emphasis on providing ethically sound responses, particularly when addressing unusual or potentially problematic AI behavior. Users are encouraged to: This focus on ethical AI use helps ensure that interactions remain beneficial and aligned with human values, fostering trust between users and AI systems. Below are more guides on Voice recognition from our extensive range of articles. Beyond enhancing conversational experiences, the Advanced Voice Mode offers practical assistance for a variety of desktop-based tasks. Users can benefit from: For instance, the AI can suggest creative ways to integrate email newsletters into Slack channels, potentially streamlining communication and boosting team productivity. This practical approach ensures that the Advanced Voice Mode is not just a novelty but a valuable tool for everyday use. A key feature of the Advanced Voice Mode is its ability to remember details from previous conversations. This memory function allows for highly personalized interactions, as the AI can: By building a profile of your technical and creative pursuits, the AI ensures that each interaction is relevant, engaging, and tailored to your unique needs. The Advanced Voice Mode caters to a wide spectrum of user interests, from technical problem-solving to creative exploration. It can engage in discussions about innovative technology while also exploring abstract creative concepts. For example, it might help you brainstorm ideas for visual art that combines themes of food, sensuality, and danger, pushing the boundaries of AI-assisted creativity. For users looking to improve their language skills, the Advanced Voice Mode offers valuable pronunciation assistance. It can: This feature is particularly beneficial for non-native speakers, as it helps enhance language learning and comprehension in a practical, interactive manner. OpenAI's ChatGPT Advanced Voice Mode for desktop represents a significant advancement in AI communication technology. By combining natural language processing with voice interaction, personalized responses, and practical applications, it offers users a powerful tool for both personal and professional use. As this technology continues to evolve, it has the potential to reshape how we interact with AI in our daily lives, making digital assistance more intuitive, accessible, and valuable than ever before.
[2]
You Can Now Use ChatGPT Advanced Voice on Mac and Windows Apps
Users can also set custom instructions for the Advanced Mode ChatGPT Advanced Voice Mode, a feature that first started rolling out in September, is now being added to the artificial intelligence (AI) chatbot's desktop apps. Announced on Thursday, OpenAI's native chatbot will now offer a human-like voice chat experience to Mac and Windows users. The feature was first unveiled at the OpenAI Spring Updates event in May and it can express emotions, modulate the voice, and react to what the user is saying. So far, only the paid subscribers of the platform have access to the feature. In a post on X (formerly known as Twitter), the official handle of OpenAI announced that the Advanced Voice Mode has been rolled out to macOS and Windows desktop apps. The move is interesting as major AI firms have started focusing their attention towards the desktop to offer more powerful and comprehensive AI capabilities to users. On the same day, Anthropic released its desktop apps for Mac and Windows, paving the way for the Computer Use tool. Google is also reportedly working on a new agentic AI browser tool that will be able to complete tasks such as booking movie tickets and purchasing a product. Now, with OpenAI's Advanced Voice, users can finally utilise the full capability of voice-based AI in a desktop environment. Notably, so far the feature was only available to Android and iOS apps. Some of the ways users can take advantage of the ChatGPT Advanced Voice Mode is by verbally prompting the AI to write a code, or having a back-and-forth while writing a research paper or college assignment. Users can also upload data files and then have a two-way conversation about its analysis and insights. ChatGPT app users will find the option to turn on the Advanced Voice Mode by tapping the waveform icon placed next to the text field. Tapping on the icon activates the new voice mode. Users now have five new voices to choose from -- Vale, Spruce, Arbor, Maple, and Sol. Each of these voices has a different pitch, tonality, and regional accent. However, the feature is still only available to the ChatGPT Teams and Plus users. Additionally, those residing in the EU, the UK, Switzerland, Iceland, Norway, and Liechtenstein will not get the new feature.
[3]
ChatGPT Advanced Voice is now on macOS and Windows
OpenAI has rolled out its Advanced Voice mode for the desktop versions of ChatGPT, available on macOS and Windows. Previously exclusive to mobile versions, this feature expands the capabilities of the desktop ChatGPT app, allowing users to have voice conversations with the AI in a more natural way. The announcement came with the tagline, "Big day for desktops," highlighting the importance of this update for desktop users. While the macOS version of the ChatGPT app has been available for some time, the Windows version has just been launched, bringing the convenience of ChatGPT directly to PC users. Advanced Voice mode, however, was not part of the browser-based ChatGPT experience, making this new feature a significant addition to the desktop versions. The voice functionality on desktop closely mirrors that of the mobile versions. Users can click the Advanced Voice icon next to the prompt bar, opening a new window that shows the familiar floating blue orb, which pulses as ChatGPT listens. This feature allows users to hold conversations with the AI using any of the nine available voices. To change voices, users simply click an icon in the top right corner of the screen. ChatGPT Advanced Voice mode uses OpenAI's latest ChatGPT-4o model, ensuring interactions are as seamless as possible. Users can interrupt the AI whenever needed, prompting it to stop talking and listen, which helps keep conversations efficient and focused. This function is particularly useful when responses become lengthy or if the conversation takes an unexpected turn. Accessing Advanced Voice mode on both desktop and mobile platforms requires a ChatGPT Plus subscription, which costs $20 per month. There is, however, an option for free-tier users, though it limits voice interactions to ten minutes per month. The voice feature has been available in the U.S. for a while and recently became available in Europe, broadening its accessibility to a larger user base. A new version of ChatGPT, named Orion, is rumored to be released soon, though OpenAI CEO Sam Altman has dismissed these claims as "fake news." The recent launch of the Windows version of ChatGPT brought with it some notable limitations. Unlike the macOS version, which had Advanced Voice mode integrated earlier, the initial release of the Windows app did not include this feature. The absence of Voice mode meant that Windows users were unable to use one of the most anticipated functions of ChatGPT, leaving the experience feeling somewhat incomplete. OpenAI made sure to create an intuitive experience for Windows users despite the missing features. To get started, users need to download the app from OpenAI's official website, which then redirects them to the Microsoft Store for the actual installation. Once installed, users can summon ChatGPT by pressing Alt + Space, providing a quick and easy way to access the chatbot without leaving their current tasks. This functionality is designed to embed ChatGPT deeper into users' workflows, allowing for easier and more direct interaction. Advanced Voice mode in ChatGPT offers a way for users to communicate with the AI more naturally. You can use it to set reminders, ask questions about anything from work-related topics to general knowledge, or simply chat about daily matters. The ability to interrupt the AI mid-sentence allows for a more dynamic conversation, unlike many voice assistants that require a set question-and-answer format. A significant advantage of using Advanced Voice mode on desktop is its integration into daily tasks. By pressing Alt + Space on Windows or simply clicking the ChatGPT icon on macOS, users can instantly access the voice feature without needing to switch contexts or open a browser. This reduces friction, making ChatGPT more of an integrated productivity tool rather than a standalone service. Another notable aspect of the desktop version is its similarity to the mobile experience. Users accustomed to using voice mode on their phones will find the desktop version intuitive and easy to use. The floating blue orb, the option to change voices, and the interaction flow are all designed to offer a consistent experience across devices, ensuring that users can transition between platforms.
[4]
ChatGPT Advanced Voice is now on Mac and Windows -- how to get access
OpenAI is finally bringing Advanced Voice mode to the desktop. It will be available in both the Windows and Mac versions of the ChatGPT app and works the same as the mobile release. This means you can finally have a conversation with your computer. Not in the way that you can talk to Siri or Alexa (and yes, they were both triggered as I dictated this copy), but a full conversation as if you were talking to another human being. Advanced Voice is native speech-to-speech. This means that OpenAI's voice bot can understand everything you say, how you say it, and even the pauses between your words. It responds just as naturally, including adding vocal tics such as "ums" and breathing sounds between each sentence. We still don't quite have the full promise made during OpenAI's spring update of screen sharing and live video with ChatGPT, but it is coming eventually and this is still a major upgrade on other voice models. You access Advanced Voice in the desktop app in the same way you would in iOS or Android -- click the icon in the chat bar. Once you click the button, it will open a new view with that now infamous gradiating blue circle. You can continue talking to the AI while you get on with other tasks. And while it can't see what you're doing, it can respond to descriptions of the task or your performance. So for example, if you're using it while playing Minecraft, you could describe the scene, and it could propose a building or block type to use. Bringing Advanced Voice to the desktop is the next logical step for OpenAI and further cements ChatGPT as more than just a gimmick, but a full productivity platform. Being able to hold a conversation with an AI allows you to brainstorm ideas or perform tasks that you might not be able to do alone. In the future, you'll be able to also share your screen with Advanced Voice so it can watch what you're doing. And one day, as AI agents take off, you may even be able to have it take control of your screen and talk you through a process. While Advanced Voice is an incredibly useful tool, what's more powerful is the underlying real-time API. This is the back end of Advanced Voice used by developers to build their own versions or build them into their own tools. During a recent briefing I had with the OpenAI team, the company's developer liaison lead, Romain Huet, showed this impressive demo of the solar system. You could instruct the voice to move between planets, and it was able to offer insights into the nature of each of the worlds that we visited in real-time and answer questions in a conversational style. In another demo, he showed off using it as a virtual travel agent to help you not just book a flight but find the best deal. You could tell it your explicit requirements, and it could ask questions or follow up with feedback based on what was available, rather than the logic tree approach that we see from automated calls at the moment. All of these features are going to start to roll out, not just in OpenAI's apps but in apps from other developers over the coming months and years. I think voice is going to become the new way that we all interact with our computers. Now I just need to find a better dictation software that doesn't require me to spend hours going back over everything that I typed with my voice to fix the glaring errors.
[5]
OpenAI launch ChatGPT Advanced Voice mode on desktop and now PCs and Macs can join the conversation
ChatGPT Advanced Voice mode arrives on desktop for Windows and Mac. OpenAI has just announced ChatGPT Advanced Voice mode is now available for the Mac and PC versions of its chatbot, in addition to the mobile versions. The update was revealed with the phrase "Big day for desktops" in a tweet on X.com. While the Mac version of the ChatGPT app has been out for a while now, the Windows version only just launched. Until now, however, Advanced Voice mode wasn't available as it currently does not work in the browser-based version of ChatGPT. We've tried the desktop version of Advanced Voice mode on PC and the experience is refreshingly similar to the mobile version: You click the Advanced Voice icon that's on the right of the prompt bar and a new window pops up with the familiar floating blue orb that pulses as ChatGPT listens. You can immediately start having a free-flowing, natural conversation with ChatGPT using one of its nine different voices about pretty much any subject you like. To change voice you simply need to hit the icon in the top right of the screen and you can switch between its nine different voices. Advanced Voice mode uses ChatGPT-4o, which is OpenAI's most accessible current model, for all interactions. A key feature of ChatGPT Advanced Voice mode is that you can interrupt the AI at any time and it should stop talking and start listening to what you're saying. This is particularly handy when you find its answers are going on a bit too long, and it also helps keep the conversation going. As with the mobile version, you need to be a ChatGPT Plus subscriber ($20, £16, AU$30) to access Advanced Voice Mode, but there is an option for people to use it on the free tier, although it's limited to just 10 minutes a month of talk time. ChatGPT Advanced Voice mode has been available in the US for some time now but recently launched in Europe. A new version of ChatGPT called Orion is rumored to be released before the end of the year, but Open AI CEO Sam Altman has dismissed the rumor as 'fake news'.
[6]
One of the most impressive features of ChatGPT finally arrives on its desktop app
OpenAI has released a long-awaited update for the ChatGPT app on Mac and Windows: the advanced voice feature. This feature, which was previously only available in the mobile ChatGPT apps, represents a new, much more natural method of interaction, allowing us to converse with the language model in a fluid manner. It also adds a more realistic voice and the option to pause during the conversation. Hello, I'm ChatGPT, what would you like to talk about? Although this update marks a significant advancement for OpenAI, it is not intended to replace the assistants we already know. The coexistence of ChatGPT with other assistants like Siri on Mac or Microsoft Copilot on Windows is clear, as their functioning is, in fact, entirely different. Although in the case of the Mac, ChatGPT is integrated into Siri thanks to Apple Intelligence (which we can now use on Windows), the goal of each of our assistants is different. While ChatGPT can discuss particle physics with us, it is thanks to Siri or Microsoft Copilot that we can adjust the screen brightness, the volume, or perform a search in our files. Beyond this, the updates in the ChatGPT desktop app come at a time when Apple and other manufacturers are renewing their devices to enhance their performance with artificial intelligence tools. Good news that, along with OpenAI's new advanced voice mode, allow us to have a conversation with our computer in a way very similar to how we would with any person. Quite impressive, if we stop to think about it.
Share
Share
Copy Link
OpenAI brings ChatGPT's Advanced Voice Mode to Windows and Mac desktop apps, offering users a more natural and intuitive way to interact with AI through voice conversations.
OpenAI has made a significant leap in AI communication by introducing ChatGPT's Advanced Voice Mode to desktop applications for both Windows and macOS [1][2]. This expansion marks a pivotal moment in human-AI interaction, offering users a more natural and intuitive way to engage with artificial intelligence through voice-based conversations.
The Advanced Voice Mode on desktop closely mirrors its mobile counterpart, allowing users to have seamless voice interactions with ChatGPT. Key features include:
The feature is designed to be inclusive and easily accessible:
Advanced Voice Mode utilizes OpenAI's latest ChatGPT-4o model, ensuring high-quality interactions [3]. However, there are some limitations:
The desktop version of Advanced Voice Mode opens up new possibilities for AI assistance in various tasks:
This development signifies a shift towards more integrated AI experiences in daily computing:
The rollout of Advanced Voice Mode on desktop platforms positions OpenAI competitively in the AI market:
As AI continues to evolve, ChatGPT's Advanced Voice Mode on desktop represents a significant step towards more intuitive and accessible AI interactions, potentially reshaping how we communicate with and utilize AI in our daily lives and work environments.
Reference
[1]
[2]
[3]
OpenAI has finally released its advanced voice feature for ChatGPT Plus and Team users, allowing for more natural conversations with the AI. The feature was initially paused due to concerns over potential misuse.
14 Sources
OpenAI launches a new voice-based interaction feature for ChatGPT Plus subscribers, allowing users to engage in conversations with the AI using voice commands and receive spoken responses.
29 Sources
OpenAI introduces an advanced voice mode for ChatGPT, allowing users to have spoken conversations with the AI. This feature is currently available for Plus and Enterprise users on iOS and Android devices.
2 Sources
OpenAI has rolled out ChatGPT's Advanced Voice Mode for web browsers, allowing users to have voice conversations with the AI chatbot directly from their desktop. Initially available for paid subscribers, this feature marks a significant step in AI interaction and accessibility.
5 Sources
OpenAI has rolled out an advanced voice mode for ChatGPT, allowing users to engage in verbal conversations with the AI. This feature is being gradually introduced to paid subscribers, starting with Plus and Enterprise users in the United States.
12 Sources
The Outpost is a comprehensive collection of curated artificial intelligence software tools that cater to the needs of small business owners, bloggers, artists, musicians, entrepreneurs, marketers, writers, and researchers.
© 2024 TheOutpost.AI All rights reserved