Google launches Gemini 3 Flash as default AI model, delivering speed with Pro-grade reasoning

Reviewed byNidhi Govil

19 Sources

Share

Google has released Gemini 3 Flash, making it the default in the Gemini app and AI Mode in Google Search globally. The new AI model combines the powerful reasoning capabilities of Gemini 3 Pro with faster performance and lower operational costs. Early benchmarks show significant improvements over its predecessor, with companies like JetBrains, Figma, and Cursor already deploying the model.

Google Releases Gemini 3 Flash as Default AI Model

Google has launched Gemini 3 Flash, positioning the new AI model as the default in the Gemini app and AI Mode in Google Search globally

1

2

. The release arrives just one month after Gemini 3 Pro debuted and six months following the Gemini 2.5 Flash announcement, marking an accelerated push in Google's competition with OpenAI and other AI developers

4

. Tulsee Doshi, Senior Director and Head of Product for Gemini Models at Google DeepMind, describes the transition as a "huge upgrade" for most users, promising faster turnaround from a latency perspective alongside more detailed, nuanced answers compared to the previous generation

5

.

Source: Google

Source: Google

The faster and more efficient AI model is now available for developers through the Gemini API, Vertex AI, AI Studio, Google Antigravity, Gemini CLI, and Android Studio

1

4

. Companies including JetBrains, Figma, Cursor, Harvey, and Latitude are already using Gemini 3 Flash in production environments

2

.

Improved Performance Across Benchmarks Shows Significant Gains

Gemini 3 Flash demonstrates substantial improvements in testing, with particularly striking gains in specialized assessments. On Humanity's Last Exam, which tests advanced domain-specific knowledge, the model scored 33.7% without tool useβ€”a threefold increase over Gemini 2.5 Flash's 11% score and competitive with GPT-5.2's 34.5%

2

. The model achieved 81.2% on the MMMU-Pro multimodality and reasoning benchmark, outperforming all competitors including Gemini 3 Pro

2

.

Coding capabilities show marked advancement, with Gemini 3 Flash gaining almost 20 points on the SWE-Bench Verified test compared to the 2.5 branch

1

. The model outperformed both Gemini 3 Pro and Claude Sonnet 4.5 on this agentic coding benchmark

4

. In the GPQA Diamond test requiring PhD-level reasoning, Gemini 3 Flash achieved 90.4%, approaching Gemini 3 Pro's 91.9%

3

. General knowledge accuracy also improved dramatically, with Simple QA Verified scores jumping from 28.1% in Gemini 2.5 Flash to 68.7% in the new model

1

.

Multimodal Capabilities and Cost Structure for Developers

Gemini 3 Flash retains the powerful reasoning capabilities and multimodal capabilities that defined Gemini 3 Pro, including the ability to process images, text, videos, and audio simultaneously

1

4

. Users can upload pickleball videos for technique analysis, submit sketches for identification, or provide audio recordings for detailed assessment

2

. The model can generate app prototypes directly within the Gemini app using natural language prompts and create plans based on series of videos and images in just seconds

4

5

.

Source: TechRadar

Source: TechRadar

For developers, pricing is set at $0.50 per 1 million input tokens and $3.00 per 1 million output tokensβ€”a modest increase from Gemini 2.5 Flash's $0.30 and $2.50 respectively, but substantially lower than Gemini 3 Pro's $2 and $12 rates

1

2

. Google claims Gemini 3 Flash outperforms Gemini 2.5 Pro while operating three times faster and using 30% fewer tokens on average for thinking tasks, potentially offsetting the higher per-token cost

1

2

. Doshi positioned the model as "more of your workhorse model" suitable for bulk tasks and quick, repeatable workflows like customer support agents or in-game assistants

2

4

.

Competitive Landscape and Strategic Implications

The rapid deployment reflects intensifying competition in the AI sector. Since releasing Gemini 3, Google has processed over 1 trillion tokens per day through its API

2

. Reports indicate OpenAI CEO Sam Altman sent an internal "Code Red" memo after ChatGPT traffic declined as Google's market share rose, prompting OpenAI to release GPT-5.2 and a new image generation model

2

. OpenAI claims ChatGPT message volume has grown 8x since November 2024 [2](https://techcrunch.com/2025/12/17/google-launches-gemini-3-flash-makes-it-the-default-model-in-the- Gemini-app/).

Google's decision to immediately embed Gemini 3 Flash into its cornerstone products signals strong internal confidence, contrasting with the cautious, gradual rollouts that characterized earlier phases of the AI race

4

. Gemini 3 Pro and Nano Banana Pro, Google's image model, are also expanding availability in AI Mode for all US-based users, with free access subject to unspecified limits while Pro and Ultra subscribers receive higher usage allowances

1

3

. Users can still select Gemini 3 Pro from the model picker for specialized math and coding questions

2

.

Source: Digital Trends

Source: Digital Trends

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Β© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo