Google Unveils Whisk: A Novel AI Image Generation Tool Using Visual Prompts

16 Sources

Google introduces Whisk, an experimental AI tool that generates images using other images as prompts, streamlining the creative process for visual exploration.

News article

Google Introduces Whisk: A New Approach to AI Image Generation

Google has unveiled Whisk, an experimental AI tool that revolutionizes the process of creating AI-generated images. Unlike traditional text-based prompt systems, Whisk allows users to generate images using other images as prompts, offering a more intuitive and accessible approach to visual creation 123.

How Whisk Works

Whisk utilizes a unique two-step process to generate images:

  1. Users input up to three images as prompts: one for the subject, one for the scene, and one for the style.
  2. Google's Gemini AI model analyzes these images and automatically generates detailed captions.
  3. The captions are then fed into Google's Imagen 3 image generation model to create the final output 13.

This process allows Whisk to capture the essence of the input images rather than creating exact replicas, enabling users to remix subjects, scenes, and styles in novel ways 3.

User Interface and Features

Whisk offers both simple and advanced interfaces:

  • The basic interface allows users to choose from predefined styles like sticker, enamel pin, and plushie 2.
  • The advanced editor enables users to upload their own images or use text prompts for subject, scene, and style 24.
  • Users can refine results by editing the underlying text prompts or adding more details 13.
  • A "dice" feature provides AI-generated images as prompts for those without source images 3.

Applications and Limitations

Google positions Whisk as a tool for "rapid visual exploration" rather than for creating production-ready content or pixel-perfect edits 135. It's designed to help users brainstorm and visualize ideas quickly, allowing for easy experimentation with different concepts 1.

However, users should be aware that Whisk may sometimes produce unexpected results:

  • Generated images might differ from the source material in details like height, weight, hairstyle, or skin tone 13.
  • The tool may focus on different aspects of an image than the user intended 1.

Availability and Future Implications

Currently, Whisk is available only to users in the United States through Google Labs 14. It's free to use and requires a Google account 5. As an experimental tool, data from user interactions will be used to refine and develop future AI products 1.

Whisk represents a significant step in making AI image generation more accessible to visual creators and could potentially influence future developments in AI-assisted creative tools 15. Its unique approach of using images as prompts may open up new possibilities for visual ideation and concept exploration in various creative fields.

Explore today's top stories

Microsoft Announces 9,000 Layoffs Amid AI Investment Push

Microsoft has announced its second major round of layoffs in 2025, cutting 9,000 jobs across various divisions as it continues to invest heavily in artificial intelligence while streamlining operations.

The New York Times logoFortune logoAustralian Financial Review logo

13 Sources

Business and Economy

7 hrs ago

Microsoft Announces 9,000 Layoffs Amid AI Investment Push

Vinod Khosla Predicts AI Will Replace 80% of Jobs by 2030, Disrupting Fortune 500 Companies

Silicon Valley investor Vinod Khosla forecasts massive job automation and economic shifts due to AI advancements, predicting an era of abundance by 2040.

Fortune logoAnalytics India Magazine logoEconomic Times logo

3 Sources

Technology

15 hrs ago

Vinod Khosla Predicts AI Will Replace 80% of Jobs by 2030,

Nvidia Reclaims Top Spot in Global Market Value, Driven by AI Leadership

Nvidia surpasses Microsoft in market capitalization, reaching $3.86 trillion, as AI chip demand surges. Other tech giants also see significant growth, while Tesla faces challenges.

Reuters logoEconomic Times logoBNN logo

4 Sources

Business and Economy

15 hrs ago

Nvidia Reclaims Top Spot in Global Market Value, Driven by

Autonomous Vehicles Reach 'ChatGPT Moment': A $1.2 Trillion Market Opportunity

Bank of America reports that autonomous vehicles are experiencing their 'ChatGPT moment', with breakthroughs in AI and computing driving rapid commercial deployment. The market is estimated to reach $1.2 trillion by 2040, encompassing cars, trucks, and other sectors.

CNBC logoBenzinga logo

2 Sources

Technology

7 hrs ago

Autonomous Vehicles Reach 'ChatGPT Moment': A $1.2 Trillion

Taiwan Semiconductor's AI Dominance Drives Stock Surge Amid Market Outperformance and Geopolitical Risks

Taiwan Semiconductor Manufacturing Co. (TSMC) experiences significant stock growth, outperforming major market indexes, driven by its AI chip production dominance and strong financial performance. However, the company faces geopolitical and currency risks.

Benzinga logoThe Motley Fool logo

2 Sources

Technology

7 hrs ago

Taiwan Semiconductor's AI Dominance Drives Stock Surge Amid
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Twitter logo
Instagram logo
LinkedIn logo