2 Sources
[1]
Gemini's image analysis feature could soon get a time-saving addition (APK teardown)
With a future update, Google will allow users to upload up to ten images in a single prompt to Gemini, saving time and enabling better context-building. Google is hard at work improving Gemini's functionality. Users can already feed in images and other files to add context to their search queries or have them analyzed by Gemini. However, this functionality is limited to one image upload at a time, vastly restricting the utility of the AI digital assistant. Google could be looking to upgrade this functionality, allowing users to upload multiple images to Gemini in the near future.
[2]
Google Gemini's image toolset might add important new feature
Google Gemini is constantly enhancing its features. One area that may see improvements soon is the ability to import images. Currently, Gemini allows users to import only one image or file at a time when providing context for search queries. If users try to add another file, they are prompted to replace the existing one instead of being able to upload multiple files. According to Android Authority, Google is working on an upgrade to this functionality. Soon, users might be able to upload multiple images to Gemini simultaneously. Recommended Videos The site has successfully enabled the new feature in the latest beta version of the Google app (v16.11.32), which allows users to upload multiple images for analysis. With this feature activated, users can attach up to 10 images in a single prompt. Google can then analyze all the uploaded photos and provide contextual answers based on them. This isn't the first instance where Gemini's multi-image upload feature has been discovered. Last year, TestingCatalog found it in the web-based version of Gemini; however, the feature has not yet been activated. Please enable Javascript to view this content Now that Google appears to be testing the multi-image upload functionality on Android, it may soon be available across multiple platforms. This would be a significant advancement and would expand Gemini's capabilities. Google Gemini, which arrived last year, is a notable development in artificial intelligence, designed as a multimodal model capable of handling various forms of information, including text, code, images, audio, and video. This multimodality enables it to engage in diverse interactions and perform tasks requiring the integration of different data types. Recent efforts have focused on enhancing its reasoning abilities through techniques such as "Flash Thinking," improving personalization by utilizing user history, and strengthening its coding and data analysis capabilities. Additionally, Google is exploring integrating Gemini into its products and services to enhance user experiences. With ongoing development and expanding features, Gemini has the potential to influence how technology is interacted with and how information is accessed.
Share
Copy Link
Google is testing a new feature for Gemini that will allow users to upload up to ten images simultaneously, significantly improving the AI's context-building and analysis capabilities.
Google is on the verge of introducing a significant upgrade to its AI digital assistant, Gemini, with a new multi-image upload feature. This development promises to enhance the AI's ability to analyze and provide context-based responses, marking a notable advancement in user interaction with AI technology 1.
At present, Gemini's functionality allows users to upload only one image or file at a time when providing context for search queries or requesting image analysis. This limitation has restricted the utility of the AI assistant, as users are prompted to replace existing files rather than add multiple ones 2.
According to reports, Google has successfully enabled the new feature in the latest beta version of the Google app (v16.11.32) on Android. This update allows users to attach up to 10 images in a single prompt, significantly expanding Gemini's analytical capabilities 2.
Interestingly, this isn't the first time the multi-image upload feature has been spotted. Last year, it was discovered in the web-based version of Gemini by TestingCatalog, although it wasn't activated at that time. The current testing on Android suggests that Google may be preparing to roll out this feature across multiple platforms, which would represent a substantial improvement in Gemini's functionality 2.
Launched last year, Google Gemini is a multimodal AI model designed to handle various forms of information, including text, code, images, audio, and video. This versatility allows it to engage in diverse interactions and perform tasks that require the integration of different data types 2.
Google continues to enhance Gemini's capabilities, focusing on improving its reasoning abilities through techniques like "Flash Thinking," enhancing personalization by utilizing user history, and strengthening its coding and data analysis capabilities. The company is also exploring ways to integrate Gemini into its various products and services to enhance user experiences 2.
OpenAI CEO Sam Altman reveals Meta's aggressive recruitment tactics, offering $100 million signing bonuses to poach AI talent. Despite the lucrative offers, Altman claims no top researchers have left OpenAI for Meta.
34 Sources
Business and Economy
18 hrs ago
34 Sources
Business and Economy
18 hrs ago
YouTube announces integration of Google's advanced Veo 3 AI video generator into Shorts format, potentially revolutionizing content creation and raising questions about the future of user-generated content.
7 Sources
Technology
2 hrs ago
7 Sources
Technology
2 hrs ago
Pope Leo XIV, the first American pope, has made artificial intelligence's threat to humanity a key issue of his papacy, calling for global regulation and challenging tech giants' influence on the Vatican.
3 Sources
Policy and Regulation
2 hrs ago
3 Sources
Policy and Regulation
2 hrs ago
Google introduces Search Live, an AI-powered feature enabling back-and-forth voice conversations with its search engine, enhancing user interaction and multitasking capabilities.
11 Sources
Technology
2 hrs ago
11 Sources
Technology
2 hrs ago
OpenAI CEO Sam Altman announces GPT-5's summer release, hinting at significant advancements and potential shifts in AI model deployment. Meanwhile, OpenAI renegotiates with Microsoft and expands into new markets.
2 Sources
Technology
2 hrs ago
2 Sources
Technology
2 hrs ago