Curated by THEOUTPOST
On Fri, 30 Aug, 4:06 PM UTC
2 Sources
[1]
Gemini relaunches image generation: Custom bots, better accuracy
What's the news: Google will resume its image generation service for Gemini's Advanced, Business, and Enterprise users in English, as per a blog post by the company. Senior Director of Product Management and Gemini Experiences Dave Citron informed via the blog that users can use image generation model Imagen 3 over the coming days. In February 2024, Google put a hold on the image generation feature in Gemini following user complaints of 'historical inaccuracies' in the generated images. Six months later, Google will now resume the service. "With Imagen 3, we've made significant progress in providing a better user experience when generating images of people. We don't support the generation of photorealistic, identifiable individuals, depictions of minors or excessively gory, violent or sexual scenes," said Google in the blog post, adding that the new version uses SynthID for watermarking AI-generated images. Apart from the resumption of image generation, Gemini will soon allow Gemini Advanced, Business and Enterprise subscribers to create Gems, custom versions of Gemini, on desktop and mobile devices. Users can customize Gems to act as experts on specific topics and for specific goals. The bot can also remember a set of instructions for repetitive tasks. As part of the launch, Google announced the following pre-made Gems: Around the same time as the image generation controversy, a user had flagged that when asked whether Indian Prime Minister Narendra Modi is a fascist, the chatbot responded by saying that the man "has been accused of implementing policies that some experts have characterized as fascist. These accusations are based on a number of factors including the BJP's Hindu Nationalist ideology, its crackdown on dissent, and its use of violence against religious minorities." However, when asked the same question about Trump, Gemini called these "complex topics." The user who shared the screenshots of the images accused the company of bias when dealing with information regarding Americans, their allies and non-allies. Minister of State for Electronics and Information Technology Rajeev Chandrasekhar accused Gemini of violating Rule 3(1)(b) of the Information Technology (IT) Rules, 2021 for the same. Here's what MediaNama Founder Nikhil Pahwa had to say about the allegations of bias. Similarly, in May 2024, users accused Google's AI Overview of providing misleading and false information to search queries. Many claimed that AI Overview was unable to differentiate between fact and fiction, humour or satire, and could not understand the context in case of certain references. Also Read: STAY ON TOP OF TECH NEWS: Our daily newsletter with the top story of the day from MediaNama, delivered to your inbox before 9 AM. Click here to sign up today!
[2]
Google Gemini Gets Imagen 3 Image Generation and More
AI image generation has transformed the way we create visual content, opening up new possibilities for artists, designers, and content creators alike and now this feature is coming to Google Gemini. With the latest advancements in AI technology, generating high-quality images has become more accessible and efficient than ever before. Google Gemini's new Imagen 3 model is at the forefront of this innovation, offering users the ability to create stunning, diverse images with just a few descriptive words. This innovative technology leverages the power of machine learning to interpret and generate images based on textual input, allowing for unprecedented levels of creativity and customization. Google Gemini's Imagen 3 model represents a significant leap forward in AI image generation capabilities. This advanced model can produce an impressive array of image styles, ranging from photorealistic landscapes to textured oil paintings and even whimsical claymation scenes. The model's versatility and attention to detail enable users to bring their wildest imaginations to life with ease. Whether you're a professional artist looking for inspiration or a casual user exploring creative possibilities, Imagen 3 offers a powerful tool for visual expression. One of the key advantages of Imagen 3 is its ability to generate images with remarkable coherence and contextual understanding. The model has been trained on a vast dataset of images and their corresponding descriptions, allowing it to grasp the relationships between objects, scenes, and styles. This means that users can input complex, detailed prompts and expect Imagen 3 to generate images that accurately reflect their intended vision. From "a majestic mountain landscape at sunset with a lone hiker in the foreground" to "a surreal, Dali-esque dreamscape with melting clocks and floating elephants," Imagen 3 can handle a wide range of creative challenges. In addition to the powerful Imagen 3 model, Google Gemini now offers Advanced subscribers the ability to create custom AI experts called Gems. These Gems can be tailored to provide personalized assistance on a wide range of subjects, from coding and data analysis to creative writing and career guidance. Users can name their Gems, provide detailed instructions, and interact with them to receive expert advice and support whenever they need it. The introduction of custom Gems marks a significant step forward in the realm of AI-assisted learning and problem-solving. By allowing users to create their own specialized AI assistants, Google Gemini empowers individuals to access the knowledge and skills they need to succeed in their chosen fields. Whether you're a student looking for tutoring support, a professional seeking advice on a complex project, or a hobbyist exploring a new area of interest, custom Gems can provide the personalized guidance you need to achieve your goals. As with any powerful technology, AI image generation raises important questions about responsible use and potential misuse. Google Gemini has taken proactive steps to address these concerns by implementing built-in safeguards and adhering to strict product design principles. These measures help to prevent the generation of harmful, offensive, or misleading content, ensuring that users can explore the creative possibilities of Imagen 3 and custom Gems with confidence. Google Gemini's commitment to ethical AI extends beyond just technical safeguards. The company actively engages with the broader AI research community to explore the societal implications of this technology and develop best practices for its use. By fostering open dialogue and collaboration, Google Gemini aims to ensure that AI image generation and personalized assistance are developed and deployed in a manner that benefits society as a whole. The new features, including custom Gems and the Imagen 3 image generation model, are rolling out to Gemini Advanced, Business, and Enterprise users. These features are available on both desktop and mobile devices in over 150 countries and in most languages. Pricing details for Gemini subscriptions can be found on the official Google Gemini website. Google Gemini's new Imagen 3 model and custom Gems represent a major milestone in the field of AI-assisted creativity and learning. By harnessing the power of advanced machine learning techniques, these tools offer users unprecedented opportunities for visual expression, personalized assistance, and skill development. As AI technology continues to evolve, we can expect to see even more innovative applications emerge, transforming the way we create, learn, and solve problems. With Google Gemini at the forefront of this revolution, the possibilities are truly endless.
Share
Share
Copy Link
Google has relaunched its Gemini AI with significant upgrades, including image generation powered by Imagen 3, custom bot creation, and expanded language support. These enhancements aim to improve user experience and compete with other AI platforms.
Google has unveiled a major update to its Gemini AI platform, introducing a host of new features and improvements that promise to enhance user experience and expand the platform's capabilities 1. This relaunch marks a significant step forward in Google's AI offerings, positioning Gemini as a strong competitor in the rapidly evolving artificial intelligence landscape.
One of the most notable additions to Gemini is the integration of Imagen 3, Google's advanced image generation technology. This feature allows users to create high-quality, photorealistic images from text descriptions 2. The image generation capability is now available to Gemini Advanced users in most English-speaking countries, with plans to expand to other regions and languages in the future.
Gemini's relaunch introduces the ability for users to create custom AI bots tailored to specific tasks or interests. This feature, reminiscent of OpenAI's GPTs, enables users to design personalized AI assistants that can be shared with others or kept private [1]. The custom bot creation tool is currently available to Gemini Advanced users in the US, with a global rollout expected in the coming weeks.
Google has significantly broadened Gemini's language capabilities, now supporting over 40 languages for text-based interactions. This expansion includes voice conversations in nine languages, making the AI more accessible to a global audience [1]. The increased language support is a crucial step in Google's efforts to make AI technology more inclusive and widely available.
The Gemini app for Android and iOS has received substantial updates, improving its functionality and user interface. Notable additions include the ability to upload images for analysis and the option to change Gemini's voice and speaking style [2]. These enhancements aim to make the mobile AI experience more engaging and personalized.
Google has further integrated Gemini with its Workspace suite, allowing users to leverage AI capabilities within popular applications like Docs, Sheets, and Slides. This integration enables features such as text generation, summarization, and proofreading directly within these productivity tools [1].
While many of the new features are exclusive to Gemini Advanced subscribers, Google continues to offer a free tier with access to core AI functionalities. The Advanced subscription, priced at $19.99 per month, provides access to the full range of new features and capabilities [2].
As Google continues to innovate and expand Gemini's capabilities, the AI landscape becomes increasingly competitive. With these latest enhancements, Google aims to solidify Gemini's position as a leading AI platform, offering users a comprehensive suite of tools for creativity, productivity, and problem-solving in the digital age.
Reference
[2]
Google has unveiled 'Gems,' a new feature for Gemini subscribers that allows users to create personalized AI chatbots. The update also includes improvements to image generation capabilities with Imagen 3 integration.
14 Sources
Google is set to reintroduce the feature of generating images of people on its Gemini AI model, following a temporary pause due to inaccuracies in historical representations. The company has addressed the issues and plans to roll out the improved version soon.
8 Sources
Google's AI chatbot Gemini receives a significant update to its image generation capabilities, introducing Imagen 3 and potential resizing options, enhancing user experience and creative possibilities.
10 Sources
Google updates Gemini with streamlined image sharing on Android and develops inline image editing features, aiming to improve user experience and compete with other AI assistants.
8 Sources
Google is set to relaunch its AI image generation tool, addressing previous controversies and inaccuracies. The improved version promises enhanced accuracy and diversity in human depictions.
4 Sources
The Outpost is a comprehensive collection of curated artificial intelligence software tools that cater to the needs of small business owners, bloggers, artists, musicians, entrepreneurs, marketers, writers, and researchers.
© 2024 TheOutpost.AI All rights reserved