The Outpost is a comprehensive collection of curated artificial intelligence software tools that cater to the needs of small business owners, bloggers, artists, musicians, entrepreneurs, marketers, writers, and researchers.
© 2025 TheOutpost.AI All rights reserved
Curated by THEOUTPOST
On Sat, 22 Mar, 12:01 AM UTC
4 Sources
[1]
Google will let you make AI podcasts from Gemini's Deep Research
Emma Roth is a news writer who covers the streaming wars, consumer tech, crypto, social media, and much more. Previously, she was a writer and editor at MUO. Google's Gemini app now lets you generate Audio Overviews based on Deep Research. That means you can turn the in-depth reports generated by Gemini into a conversational podcast featuring two AI "hosts." Since launching Audio Overviews within its AI note-taking app NotebookLM last September, Google has been steadily adding to the feature by letting you guide and interact with the hosts. It also brought Audio Overviews to the Gemini app for free users and Advanced subscribers earlier this week, allowing you to transform slides and documents into AI podcast-like conversations. The feature should be even more helpful for Deep Research, Google's "agentic" AI feature that lets you call upon Gemini to explore a specific topic by scanning the web and generating a detailed report based on its findings. When Gemini finishes generating a report, you can select a new "Generate Audio Overview" option to listen to an Audio Overview based on the research.
[2]
Google Gemini Can Now Turn Almost Anything Into a Podcast
Summary Google Gemini introduces Audio Overviews, allowing users to create podcasts from uploaded documents. Audio Overviews utilize AI to generate realistic voices and engaging discussions on document content. Audio Overviews provide a convenient way to extract information from documents in a podcast format. They say that you're never more than six feet from a rat, and these days, the same is probably true of podcasters. It seems that almost everyone on the planet either has a podcast or is going to start one. With Google Gemini you can now create your own bespoke podcasts using a feature called Audio Overviews. All you need to do is upload a document, and Gemini will create a short podcast deep dive into the contents of the document with two AI hosts. What are Audio Overviews in Google Gemini? Audio Overviews is a new feature in Gemini that was previously available in Google's NotebookLM AI-powered note-taking app. The feature is able to summarize information in a unique way. Instead of giving you a bland text summary of the information, Audio Overviews generates an audio file of a podcast with the two AI-generated hosts discussing the information that you want summarized. The hosts have a back-and-forth conversation discussing the topic of whatever they are summarizing and asking questions of each other to glean more information on specific key points. The overall result is what sounds like a real podcast with two informed people discussing the topic at hand. In my testing, Gemini generated Audio Overviews that ranged between five and fifteen minutes in length, depending on how much content was in the uploaded documents. The 15-minute podcast, for example, was generated from a 146-page manual for an SLR camera, while even a single-page PDF of a garbage collection schedule generated a podcast that was five minutes long. Related How to Use GarageBand to Record a Podcast Plus, some recommendations for better tools when you need more flexibility. Posts What Can You Use to Generate Audio Overviews? You can create Audio Overviews from a wide range of different sources in Gemini. You can upload a document, and Gemini will turn whatever information the document holds into your own bespoke podcast. These don't just have to be text documents, either; you can upload a Google Slides presentation, and Gemini will create an Audio Overview based on the content of the slides. Another really useful option is that you can generate Audio Overviews from a Deep Research report. Deep Research is a feature that generates a report on any topic you choose by coming up with a plan of what to research, finding the appropriate content on the web, and then collating the information that it finds into a report. The results are in the form of a detailed written response that breaks down everything that was discovered, but these reports can often be quite long and fairly dry. Once you've generated a Deep Research report, however, you can get Gemini to turn it into an Audio Overview. Then, instead of having to read through the entire report, you can sit back and listen to two AI-generated podcasters discuss it in detail. It can make it easier to digest the information from a Deep Research report, without having to read through all the details. Audio Overviews seem like they would be a great way to get information from web pages with a lot of information on them, but currently, there's no way to generate an Audio Overview from a web link. However, you can copy the content to a text file, or save the contents of the web page as a PDF, and then Gemini will happily create an Audio Overview from the content. I saved the Wikipedia page on the history of Brazil as a PDF, and Gemini created a podcast from the file discussing Brazil's history, which was useful and informative. Related Reddit Is the Wikipedia of the Human Experience There's a reason so many people append their Google searches with "Reddit" Posts You can't generate Audio Overviews from most image files, either, but I found that if I saved images as PDF, it would at least try to generate an Audio Overview from the file. If there's no readable text in the image, however, then the Audio Overview generation will fail. If the image does contain text, it will work; I was able to get the AI-generated podcast hosts to have an enthusiastic and in-depth discussion about the PDF image of my local waste collection schedule. How to Create an Audio Overview When you upload a document to Gemini by clicking the "+" icon, you should see a suggestion pop up above the prompt window that you can click to generate your Audio Overview. If it doesn't pop up, however, all you need to do is ask Gemini to generate an Audio Overview from the document and, as long as it's a valid document with readable text, an Audio Overview will be generated. You can upload a wide range of files, although not all of them may be suitable for generating an Audio Overview. Supported file types include the following: * C, CPP, PY, JAVA, PHP, and SQL files * TXT, DOC, DOCX, PDF, RTF, DOT, DOTX, HWP, and HWPX files * PPTX, XLS, and CSV files * Google Docs and Google Slides If you have a Gemini Advanced subscription, you can also upload HTML, XLSX, TSV, and Google Sheets files. As mentioned above, you can upload images to Gemini, but you won't be able to generate an Audio Overview from image files. However, if you save an image as a PDF, it's possible to create an Audio Overview, as long as the image contains some readable text. Generating an Audio Overview from a Deep Research report is also easy to do; once you've generated the Deep Research report, you should see an option to generate an Audio Overview for the report. However, I found that this doesn't always happen. If the option doesn't appear, you can just ask Gemini to generate an Audio Overview, and it will create one for you. How Good Are Gemini's Audio Overviews? Since AI chatbots burst onto the scene, a lot of the things they can do have felt a little bit like magic. It still blows my mind that, in a matter of moments, AI can generate images of things that have never existed in images before, such as a unicorn with three legs rollerblading at a disco. Gemini's Audio Overviews can also feel a little like magic, too. That's because the results are genuinely impressive. For a start, the voices are very realistic, and make it feel like you're listening to real people talking. The way they interact is also really well done, with interruptions and the hosts talking over each other on occasions. In trying out the feature, my results have usually been very good at picking out the key points of the documents and discussing them in a very accessible way. I tried uploading the manual for an old Canon EOS 3 film camera I own, and the hosts had a highly informative discussion about the eye-tracking autofocus feature. I also uploaded an unpublished screenplay and the hosts talked through the key points of the plot in a very entertaining way, picking up a lot of the humor, and most of the central parts of the plot. The results aren't always perfect, however; the screenplay summary missed a key part of the plot which is required to understand both the title of the screenplay and its poignant last line. Audio Overviews Are (Mostly) a Great Way to Access Information Some AI features can feel like companies showcasing what the AI can do rather than genuinely useful features. The Audio Overview feature doesn't feel like that, however. Reading through a long document isn't always the easiest way to extract the key information from it. Listening to two people discussing the information can make it easier to distill the key facts without having to sift through it all yourself. Having two people discussing it is a clever touch, as often one of the hosts will ask the question you've been thinking about yourself. In particular, I found Audio Overviews to be very useful for Deep Research reports. These reports are often long walls of text, and while they are packed with useful information, reading through the entire report can feel like something of a chore. An Audio Overview of the report is far easier to digest, and the AI does a pretty good job of extracting the important information rather than waffling on about less important facts. That's not to say that Audio Overviews are perfect. I found that I often have the same problem with Audio Overviews that I have when listening to audiobooks: I start to tune out and miss what's being said. I then have to rewind the Audio Overview to catch up on what I've missed. This isn't the fault of the Audio Overviews, of course, but I'm sure I'm not the only person who suffers from this problem. For me, they work best when I have no other distractions, such as if I'm going for a walk with headphones on, but your mileage may vary. The podcasts don't always feature all the information that you might want to extract, either. The Audio Overview for the screenplay did extract most of the central plot points, but it missed something that wasn't necessarily central to the plot but was certainly a central theme of the script. If you don't enjoy reading through large amounts of text to extract the information that you want, then Audio Overviews can be a useful alternative. You can turn almost anything you want into your own bespoke podcast and have other people explain the key information to you, rather than having to read it for yourself. Hopefully, Google will add the ability to generate Audio Overviews of content from websites at some point, because right now you still have to jump through a few hoops to make it happen.
[3]
Gemini app rolling out Audio Overviews to Android, iOS
As announced earlier this week, the Gemini app is getting the ability to generate Audio Overview podcasts, with Android and iOS users beginning to see this capability today. Audio Overviews have widely rolled out on the web for free Gemini and paid Advanced subscribers in English, with more languages coming soon. On mobile, uploading a document or slideshow will show a new "Generate Audio Overview" suggestion chip alongside "Talk Live about this" (which has not fully launched on Android). You can also open the overflow menu in Deep Research reports to generate a podcast. After tapping, it takes a few minutes for the conversation between two AI hosts to generate. Google will send a notification when it's ready. The other way to access is by going to the Chats history in the top-left corner of the homescreen. Before today's launch, you could not access past Audio Overviews in your history on mobile. Something unexpected about this capability is how the Gemini app has no built-in audio player. Instead, tapping the "Gemini Audio Overview" directly opens the audio file in a browser tab. As such, you're interacting with the default Chrome, iOS, etc. audio player, though it does allow for straightforward downloads. This is strange since gemini.google.com has an inline player for Audio Overviews. These podcasts can be up to several minutes in length, with the aim being to "summarize the material, draw connections between topics, engage in a dynamic back-and-forth and provide unique perspectives." One thing to keep in mind with file-based Audio Overviews is that the conversation is based on that source and not real-world knowledge. It's different with Deep Research-derived podcasts.
[4]
You Can Now Generate Podcast-Style 'Audio Overviews' Directly in Gemini AI
Last year, Google released a niche feature in its experimental NotebookLM product that turned out to be a sleeper hit. You could upload any document to NotebookLM, and it would use AI to generate a two-person conversational podcast out of it. These aren't meant to be uploaded online for viewership, but for you to listen to as a learning ad. The idea is that there are times when it's easier to understand a concept when two people are just casually conversing about it. If that sounds appealing to you, good news: you no longer need to dive into the unfamiliar NotebookLM interface to try it out. Google's AI-generated podcasts, called Audio Overviews, are now available for free directly in the Gemini app and website. And now that Audio Overviews are integrated directly into Gemini, you can even use Gemini's own Deep Research reports as sources for your podcast. Personally, I've found it useful to first prompt Gemini to create a Deep Research report on a topic, then directly generate an Audio Overview from it (skipping past having to read it myself). To get started, use the Gemini website or the Gemini app. To upload your own document or slide show, click the Plus button and add in your desired source file. As soon as the file is processed, you'll see a button for Generate Audio Overview. Press the button and wait for Gemini to begin its work. It can take anywhere between 3-5 minutes to generate your podcast audio, depending on the depth of the material. Alternatively, you can prompt Gemini to create a report for you using the Deep Research option below the text box. Here, too, you'll have to wait a couple of minutes until the research is ready. Then, open the Deep Research document, click the down-arrow button, and then use the Generate Audio Overview button. You can also just enter "Generate Audio Overview" in the text box. When the processing is done, you'll get a notification from the Gemini website or the app. You'll now see a player in the chat box. Hit the Play button to start playback, and use the seek bar to jump to any point in the recording. Yes, there's speed control too, but you can only increase the playback speed up to 1.5x. I got 10 minutes of podcast overview from a 12-page document, so the detail isn't bad. If you don't want to listen to your podcast right away, or if you'd like to share it, you can also download the audio for offline payback. Click the three-dot menu button in the audio player and choose the Download option. From here, you can also choose the Share conversation option to share your Gemini chat and recording using a link. And while you're in Gemini, you might want to try creating your own custom AI Bots, called Gems, since those are now free for everyone to use.
Share
Share
Copy Link
Google's Gemini app now offers Audio Overviews, an AI-powered feature that transforms documents, presentations, and Deep Research reports into podcast-style conversations, enhancing user engagement with content.
Google has rolled out a new feature called Audio Overviews in its Gemini app, allowing users to transform various types of content into AI-generated podcast-style conversations. This innovative tool, previously available in Google's NotebookLM, is now accessible to both free and Advanced Gemini subscribers 12.
Audio Overviews utilize artificial intelligence to create engaging discussions between two AI-generated hosts based on the content of uploaded documents, presentations, or Deep Research reports. The feature aims to summarize material, draw connections between topics, and provide unique perspectives in a conversational format 34.
Users can generate Audio Overviews from various sources:
To create an Audio Overview, users can:
The generated podcasts typically range from 5 to 15 minutes in length, depending on the content volume. For instance, a 146-page camera manual produced a 15-minute podcast, while a single-page PDF resulted in a 5-minute discussion 2.
A particularly useful application of Audio Overviews is its integration with Gemini's Deep Research feature. Users can prompt Gemini to create a comprehensive report on any topic and then generate an Audio Overview based on that research. This allows for easy digestion of complex information without the need to read through lengthy reports 24.
While the feature is widely available on web browsers for both free and paid Gemini users, the mobile app experience has some limitations. The Gemini app lacks a built-in audio player, instead opening audio files in a browser tab. This allows for easy downloads but creates a somewhat disjointed user experience 3.
Currently, Audio Overviews are available in English, with support for more languages planned in the future. The feature is accessible on both Android and iOS devices, as well as through the Gemini website 3.
Audio Overviews present numerous possibilities for content consumption and learning:
As this technology continues to evolve, it could potentially reshape how we consume information, making it more accessible and engaging for a wider audience.
Reference
[2]
[3]
Google introduces an AI-powered feature that converts text notes into engaging podcast-style discussions. This innovative tool, part of the NotebookLM app, uses artificial intelligence to generate conversations between two AI hosts based on user-provided notes.
13 Sources
13 Sources
Google's NotebookLM, an AI tool for creating podcast-like audio summaries, might be integrated into the Gemini app on Android, potentially revolutionizing how users consume and learn from digital content.
3 Sources
3 Sources
Google introduces new personalization features for Gemini AI, allowing it to access users' search history for more tailored responses, alongside improvements in research capabilities and content creation tools.
54 Sources
54 Sources
Google's new AI experiment transforms text into professional-sounding podcasts, sparking excitement and ethical debates. While it offers accessibility and efficiency, concerns about authenticity and potential misuse arise.
3 Sources
3 Sources
Google has updated its NotebookLM AI tool with new customization features for its viral podcast generator, allowing users to guide AI-generated conversations and focus on specific topics.
7 Sources
7 Sources