5 Sources
[1]
Google adds voice-based prompting to Docs and Keep | TechCrunch
At the Google I/O developer conference, Google announced it's bringing a voice-based prompting feature to Workspace apps such as Docs, Keep, and Gmail. These features can help you create drafts, take notes, and search for emails. In Docs, you can create a draft document using your voice. For instance, in a demo shown by TechCrunch, Google showed that a user can fetch resume details from Drive, add event logistics from an email on top of the document, and even include some humorous anecdotes. Previously, users would have to type all this out, and it's possible that, while typing, they might write short sentences and then various follow-ups, resulting in a multi-turn conversation that took time. Google's idea is that with voice, you can use long sentences or ask for multiple tasks in one go. Plus, the feature understands when you change your mind and ask for a detail to be changed in the same conversation turn. Google CEO Sundar Pichai said that in the future, users will be able to create and edit documents using their voice. What's more, Google is adding a way to use your voice to dump your thoughts into Keep, and the app will use AI to turn that transcription into a structured note or a list. Note-taking apps like Voicenote.com and AudioPen added this kind of feature a few years ago. Lately, dictation apps such as Wispr Flow, Monolouge, and Aqua voice have built this functionality into their voice-based typing products. Earlier this month, Google released its own dictation product called Rambler, which is built into Gboard and works across apps. Besides Docs and Keep, the company is adding voice-based functionality to Gmail. With the new feature, users will be able to converse with Gemini and ask for details like their next flight, code to their Airbnb booking, or time for the physician appointment. Tech companies are cramming AI into all products and features, and as a result, users are getting attuned to asking lengthier queries. In some cases, voice is an easier input method to blurt out long sentences and queries describing complex multi-step requests. What's more, the current generation of models is good enough to understand if users change their mind about something in the middle of a sentence and output the final query accordingly. Google is paying attention to this trend and adding more voice-based features across its apps.
[2]
Google brings more conversational features to Gmail, Docs and Keep - Engadget
Google thinks that it's good to speak out loud and so, to that end, is bringing more conversational features to Gmail, Docs and Keep in the form of Live. Gmail Live, for instance, will let you ask your inbox a natural language question to save you the turmoil of having to search for the keyword yourself. In its example, say you're rushing to the airport and need to know your gate number, you can now just ask the system "What's my flight's gate number?" while hoping the system isn't hallucinating. In many ways, it feels a lot like Ask Gemini but with each one plugged in to the specific app in question. Similarly, Docs Live enables you to talk through your initial ideas for a document which the system will then organize for you. With your permission, it will then take that structure and delve into your Gmail and Drive to add in any relevant details from your own notes and communications. If you're feeling brave enough, you can even ask the system to pull any relevant information from the internet. It's the same situation in Keep, where you can dictate reminders to the system for it to impose some form of structure upon. Whatever you vomit forth from your mind, Google says Keep will now be able to understand, curate and create reminders and prompts for whatever's relevant. As for when you'll be able to take advantage of these features, Google says they'll roll out to AI Pro and Ultra subscribers this Summer. Around the same time, it'll be available as a preview to Google Workspace business customers.
[3]
Google brings voice prompting to Docs, Keep, and Gmail at I/O 2026
Google is betting that the future of productivity software starts with your voice, not your keyboard. At its I/O 2026 developer conference on Monday, the company unveiled voice-based prompting features for Docs, Keep, and Gmail, all powered by its Gemini AI models. The headline feature is Docs Live, which lets users create and edit documents entirely by speaking. In a demo, Google showed a user verbally instructing the tool to pull résumé details from Drive, layer in event logistics from an email thread, and sprinkle in a few humorous anecdotes, all in a single, unscripted stream of speech. The idea is that voice enables longer, more complex prompts than most people would bother typing out, and that current models are now good enough to follow along even when a speaker changes direction mid-sentence. CEO Sundar Pichai framed the shift as inevitable, saying that users will soon create and edit documents using voice as a matter of course. It is a bold claim, but the technical groundwork is arguably already in place. Google recently launched a standalone dictation product called Rambler, built into its Gboard keyboard, which strips filler words and handles multilingual code-switching on the fly. Rambler shipped earlier this month for Samsung Galaxy and Google Pixel devices. Keep is getting a similar voice overhaul. Users will be able to dump a stream of unstructured thoughts, jumping from gift ideas to grocery lists to home renovation plans, and the AI will sort the transcription into separate, neatly organised notes. The concept is not new. Apps such as Voicenotes and AudioPen have offered voice-to-structured-text workflows for years, and desktop dictation tools like Wispr Flow, Monologue, and Aqua Voice have built loyal followings. What Google brings to the table is scale: Keep is already baked into the broader Workspace ecosystem, meaning voice notes can flow straight into Docs, Sheets, and the rest of the suite. Gmail, meanwhile, is gaining what Google calls Gmail Live, a conversational voice interface for your inbox. Instead of typing search queries, you can ask Gmail to surface specific details, flight confirmation codes, Airbnb check-in instructions, or your child's school schedule, and get spoken answers drawn from your messages. It is essentially an AI agent for your email, one that understands context well enough to handle multi-step requests. The broader trend here is clear. Users are asking increasingly complex, multi-part questions of AI tools, and voice is simply a more natural interface for that kind of interaction than a text box. Google is not the only company to notice. Its own Cloud Next conference last month showcased a wave of agentic AI features across Workspace, and rivals from OpenAI to Apple are racing to embed voice-first AI into their own productivity stacks. The new voice features will roll out this summer for Google AI Premium subscribers and Google Workspace business users. Whether talking to your documents catches on as a mainstream habit remains to be seen, but Google is clearly convinced that the keyboard's monopoly on productivity is overdue for a challenge.
[4]
Your favorite Google apps are getting a massive Gemini Live-like upgrade
Gmail Live will roll out this summer on Android and iOS to Google AI Pro and Google AI Ultra subscribers in the US, in English. It will also roll out in preview to Google Workspace business customers this summer. Keep's AI voice capabilities are very similar to Gboard's Rambler upgrade, but arguably even better. Users can just say what's on their mind, and Keep will handle the rest. AI voice capabilities will intelligently capture your spoken, free-flowing thoughts and organize them into neatly written notes in Keep. You could even talk about completely different things, like buying a gift, making a grocery list, and painting a spare room, and Keep will make separate notes for each topic.
[5]
Docs Live and voice AI in Google Keep let you talk to create, 'Google Pics' announced
Besides Gmail announcements, Google at I/O 2026 announced AI voice features for Docs and Keep. The company also unveiled "Google Pics" as the newest Workspace app. In Keep, you'll find a new floating action button with Google's Live icon above the "Create a note" FAB. This launches a fullscreen experience, which matches Gmail Live with a waveform that hugs the perimeter. Just talk, or ramble, to create notes. Google Keep will organize free-flowing thoughts into concise notes. Notably, your conversation can be about multiple notes, and Google will split them up for you. Notes are generated in real time with users able to edit them via voice before tapping "Save to Keep" at the bottom of the screen. This goes beyond transcription as the underlying Gemini model understands intent and what you're actually asking for. This feature will be coming to Google Keep on Android this summer for Google AI Pro and Ultra subscribers in English in the US. Similarly, Docs Live can be used to create and edit documents. It can organize your thoughts, structure the document, and grab relevant details from your Gmail, Drive, Chat, and the web. For example, you could ask for a travel itinerary that automatically pulls information from Gmail and Drive. This will roll out to Google Docs for Android and iOS this summer with Google AI Pro and Ultra in English globally. After Vids, the newest Workspace app is "Google Pics." This is an AI image generation and design app. The name is definitely too close to Google Photos, but it's clearly meant to be in the same ballpark as Google Vids. This is more so aimed at creating graphics that you'd use in business (presentations) or personal (making an event invite or poster) use cases. Advanced Gemini reasoning is leveraged to treat every element in an image as an editable object. Of note is how the interface lets you select a portion and edit as if you're leaving a Docs comment. Google Pics is fully integrated into Workspace, with future updates letting you access this editor in any application. It's first launching as a standalone website this summer for Google AI Ultra subscribers in English in the US.
Share
Copy Link
Google unveiled voice-based prompting features for Docs, Keep, and Gmail at its I/O 2026 developer conference. The new capabilities let users create documents, organize notes, and search emails entirely by speaking. Rolling out this summer to Google AI Pro and Ultra subscribers, the features mark a shift toward voice as the primary interface for productivity software.
Google is betting that productivity software starts with your voice, not your keyboard. At the Google I/O developer conference on Monday, the company unveiled voice-based prompting features for Docs, Keep, and Gmail, all powered by its Gemini AI models
1
3
. The new Google AI features represent a fundamental shift in how users interact with Google Workspace, moving from typed commands to natural language conversations that can handle complex, multi-step requests in a single breath.
Source: Android Authority
The headline feature is Docs Live, which lets users create and edit documents entirely by speaking. In a demo, Google showed a user verbally instructing the tool to pull résumé details from Drive, layer in event logistics from an email thread, and sprinkle in humorous anecdotes, all in a single, unscripted stream of speech
3
. Previously, users would have to type all this out, potentially writing short sentences and various follow-ups that resulted in a multi-turn conversation taking considerable time1
. Google's idea is that with voice, you can use long sentences or ask for multiple tasks in one go, and the feature understands when you change your mind and ask for a detail to be changed in the same conversation turn.
Source: Engadget
Gmail is gaining what Google calls Gmail Live, a conversational voice interface for your inbox
3
. Instead of typing search queries, you can ask Gmail to surface specific details like flight confirmation codes, Airbnb check-in instructions, or your child's school schedule, and get answers drawn from your messages. Say you're rushing to the airport and need to know your gate number—you can now just ask the system "What's my flight's gate number?" using natural language2
. It's essentially an AI agent for your email, one that understands context well enough to handle multi-step requests. In many ways, it feels similar to Ask Gemini but with each feature plugged into the specific app in question.Google Keep is getting a similar voice overhaul with AI voice capabilities that let users dump a stream of unstructured thoughts
5
. You could talk about completely different things, like buying a gift, making a grocery list, and painting a spare room, and Keep will make separate notes for each topic. In Keep, you'll find a new floating action button with Google's Live icon above the "Create a note" FAB, launching a fullscreen experience with a waveform that hugs the perimeter5
. This goes beyond transcription as the underlying Gemini AI model understands intent and what you're actually asking for. Notes are generated in real time with users able to edit them via voice before tapping "Save to Keep" at the bottom of the screen.The concept isn't entirely new. Apps like Voicenotes and AudioPen have offered voice-to-structured-text workflows for years, and desktop dictation tools like Wispr Flow, Monologue, and Aqua Voice have built loyal followings
3
. What Google brings to the table is scale: Keep is already baked into the broader Google Workspace ecosystem, meaning voice notes can flow straight into Google Docs, Sheets, and the rest of the suite. Earlier this month, Google released its own dictation product called Rambler, which is built into Gboard and works across apps1
.The new voice features will roll out this summer for Google AI Premium subscribers and Google Workspace business users
3
. Gmail Live will roll out this summer on Android and iOS to Google AI Pro and Google AI Ultra subscribers in the US, in English, and will also roll out in preview to Google Workspace business customers. Docs Live will roll out to Google Docs for Android and iOS this summer with Google AI Pro and Ultra in English globally5
. Keep's AI voice capabilities will be coming to Google Keep on Android this summer for Google AI Pro and Ultra subscribers in English in the US5
.Related Stories
Beyond voice features, Google also unveiled Google Pics as the newest Google Workspace app
5
. This is an AI image generation and design app aimed at creating graphics for business presentations or personal use cases like making an event invite or poster. Advanced Gemini reasoning is leveraged to treat every element in an image as an editable object, with the interface letting you select a portion and edit as if you're leaving a Docs comment. Google Pics is fully integrated into Workspace, with future updates letting you access this editor in any application. It's first launching as a standalone website this summer for Google AI Ultra subscribers in English in the US.CEO Sundar Pichai framed the shift as inevitable, saying that users will soon create and edit documents using voice as a matter of course
3
. The broader trend is clear: users are asking increasingly complex, multi-part questions of AI tools, and voice is simply a more natural interface for that kind of interaction than a text box. Tech companies are cramming AI into all products and features, and as a result, users are getting attuned to asking lengthier queries1
. In some cases, voice is an easier input method to blurt out long sentences and queries describing complex multi-tasking requests. What's more, the current generation of models is good enough to understand if users change their mind about something in the middle of a sentence and output the final query accordingly. Whether talking to your documents catches on as a mainstream habit remains to be seen, but Google is clearly convinced that the keyboard's monopoly on productivity is overdue for a challenge. This Gemini Live-like upgrade across core productivity apps signals Google's commitment to making voice the default interface for work.Summarized by
Navi
[4]
Yesterday•Technology

10 Apr 2025•Technology

Yesterday•Technology

1
Technology

2
Technology

3
Technology
