Google Unveils Whisk: A Novel AI Image Generation Tool Using Visual Prompts

16 Sources

Share

Google introduces Whisk, an experimental AI tool that generates images using other images as prompts, streamlining the creative process for visual exploration.

News article

Google Introduces Whisk: A New Approach to AI Image Generation

Google has unveiled Whisk, an experimental AI tool that revolutionizes the process of creating AI-generated images. Unlike traditional text-based prompt systems, Whisk allows users to generate images using other images as prompts, offering a more intuitive and accessible approach to visual creation

1

2

3

.

How Whisk Works

Whisk utilizes a unique two-step process to generate images:

  1. Users input up to three images as prompts: one for the subject, one for the scene, and one for the style.
  2. Google's Gemini AI model analyzes these images and automatically generates detailed captions.
  3. The captions are then fed into Google's Imagen 3 image generation model to create the final output

    1

    3

    .

This process allows Whisk to capture the essence of the input images rather than creating exact replicas, enabling users to remix subjects, scenes, and styles in novel ways

3

.

User Interface and Features

Whisk offers both simple and advanced interfaces:

  • The basic interface allows users to choose from predefined styles like sticker, enamel pin, and plushie

    2

    .
  • The advanced editor enables users to upload their own images or use text prompts for subject, scene, and style

    2

    4

    .
  • Users can refine results by editing the underlying text prompts or adding more details

    1

    3

    .
  • A "dice" feature provides AI-generated images as prompts for those without source images

    3

    .

Applications and Limitations

Google positions Whisk as a tool for "rapid visual exploration" rather than for creating production-ready content or pixel-perfect edits

1

3

5

. It's designed to help users brainstorm and visualize ideas quickly, allowing for easy experimentation with different concepts

1

.

However, users should be aware that Whisk may sometimes produce unexpected results:

  • Generated images might differ from the source material in details like height, weight, hairstyle, or skin tone

    1

    3

    .
  • The tool may focus on different aspects of an image than the user intended

    1

    .

Availability and Future Implications

Currently, Whisk is available only to users in the United States through Google Labs

1

4

. It's free to use and requires a Google account

5

. As an experimental tool, data from user interactions will be used to refine and develop future AI products

1

.

Whisk represents a significant step in making AI image generation more accessible to visual creators and could potentially influence future developments in AI-assisted creative tools

1

5

. Its unique approach of using images as prompts may open up new possibilities for visual ideation and concept exploration in various creative fields.

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo