2 Sources
[1]
Gemini's image analysis feature could soon get a time-saving addition (APK teardown)
With a future update, Google will allow users to upload up to ten images in a single prompt to Gemini, saving time and enabling better context-building. Google is hard at work improving Gemini's functionality. Users can already feed in images and other files to add context to their search queries or have them analyzed by Gemini. However, this functionality is limited to one image upload at a time, vastly restricting the utility of the AI digital assistant. Google could be looking to upgrade this functionality, allowing users to upload multiple images to Gemini in the near future.
[2]
Google Gemini's image toolset might add important new feature
Google Gemini is constantly enhancing its features. One area that may see improvements soon is the ability to import images. Currently, Gemini allows users to import only one image or file at a time when providing context for search queries. If users try to add another file, they are prompted to replace the existing one instead of being able to upload multiple files. According to Android Authority, Google is working on an upgrade to this functionality. Soon, users might be able to upload multiple images to Gemini simultaneously. Recommended Videos The site has successfully enabled the new feature in the latest beta version of the Google app (v16.11.32), which allows users to upload multiple images for analysis. With this feature activated, users can attach up to 10 images in a single prompt. Google can then analyze all the uploaded photos and provide contextual answers based on them. This isn't the first instance where Gemini's multi-image upload feature has been discovered. Last year, TestingCatalog found it in the web-based version of Gemini; however, the feature has not yet been activated. Please enable Javascript to view this content Now that Google appears to be testing the multi-image upload functionality on Android, it may soon be available across multiple platforms. This would be a significant advancement and would expand Gemini's capabilities. Google Gemini, which arrived last year, is a notable development in artificial intelligence, designed as a multimodal model capable of handling various forms of information, including text, code, images, audio, and video. This multimodality enables it to engage in diverse interactions and perform tasks requiring the integration of different data types. Recent efforts have focused on enhancing its reasoning abilities through techniques such as "Flash Thinking," improving personalization by utilizing user history, and strengthening its coding and data analysis capabilities. Additionally, Google is exploring integrating Gemini into its products and services to enhance user experiences. With ongoing development and expanding features, Gemini has the potential to influence how technology is interacted with and how information is accessed.
Share
Copy Link
Google is testing a new feature for Gemini that will allow users to upload up to ten images simultaneously, significantly improving the AI's context-building and analysis capabilities.
Google is on the verge of introducing a significant upgrade to its AI digital assistant, Gemini, with a new multi-image upload feature. This development promises to enhance the AI's ability to analyze and provide context-based responses, marking a notable advancement in user interaction with AI technology 1.
At present, Gemini's functionality allows users to upload only one image or file at a time when providing context for search queries or requesting image analysis. This limitation has restricted the utility of the AI assistant, as users are prompted to replace existing files rather than add multiple ones 2.
According to reports, Google has successfully enabled the new feature in the latest beta version of the Google app (v16.11.32) on Android. This update allows users to attach up to 10 images in a single prompt, significantly expanding Gemini's analytical capabilities 2.
Interestingly, this isn't the first time the multi-image upload feature has been spotted. Last year, it was discovered in the web-based version of Gemini by TestingCatalog, although it wasn't activated at that time. The current testing on Android suggests that Google may be preparing to roll out this feature across multiple platforms, which would represent a substantial improvement in Gemini's functionality 2.
Launched last year, Google Gemini is a multimodal AI model designed to handle various forms of information, including text, code, images, audio, and video. This versatility allows it to engage in diverse interactions and perform tasks that require the integration of different data types 2.
Google continues to enhance Gemini's capabilities, focusing on improving its reasoning abilities through techniques like "Flash Thinking," enhancing personalization by utilizing user history, and strengthening its coding and data analysis capabilities. The company is also exploring ways to integrate Gemini into its various products and services to enhance user experiences 2.
Apple is reportedly in talks with OpenAI and Anthropic to potentially use their AI models to power an updated version of Siri, marking a significant shift in the company's AI strategy.
29 Sources
Technology
22 hrs ago
29 Sources
Technology
22 hrs ago
Cloudflare introduces a new tool allowing website owners to charge AI companies for content scraping, aiming to balance content creation and AI innovation.
10 Sources
Technology
6 hrs ago
10 Sources
Technology
6 hrs ago
Elon Musk's AI company, xAI, has raised $10 billion in a combination of debt and equity financing, signaling a major expansion in AI infrastructure and development amid fierce industry competition.
5 Sources
Business and Economy
14 hrs ago
5 Sources
Business and Economy
14 hrs ago
Google announces a major expansion of AI tools for education, including Gemini for Education and NotebookLM, aimed at enhancing learning experiences for students and supporting educators in classroom management.
8 Sources
Technology
22 hrs ago
8 Sources
Technology
22 hrs ago
NVIDIA's upcoming GB300 Blackwell Ultra AI servers, slated for release in the second half of 2025, are poised to become the most powerful AI servers globally. Major Taiwanese manufacturers are vying for production orders, with Foxconn securing the largest share.
2 Sources
Technology
14 hrs ago
2 Sources
Technology
14 hrs ago