Google launches Gemini 3 Flash as default AI model, delivering speed with Pro-grade reasoning

Reviewed byNidhi Govil

37 Sources

Share

Google has released Gemini 3 Flash, making it the default in the Gemini app and AI Mode in Google Search globally. The new AI model combines the powerful reasoning capabilities of Gemini 3 Pro with faster performance and lower operational costs. Early benchmarks show significant improvements over its predecessor, with companies like JetBrains, Figma, and Cursor already deploying the model.

Google Releases Gemini 3 Flash as Default AI Model

Google has launched Gemini 3 Flash, positioning the new AI model as the default in the Gemini app and AI Mode in Google Search globally

1

2

. The release arrives just one month after Gemini 3 Pro debuted and six months following the Gemini 2.5 Flash announcement, marking an accelerated push in Google's competition with OpenAI and other AI developers

4

. Tulsee Doshi, Senior Director and Head of Product for Gemini Models at Google DeepMind, describes the transition as a "huge upgrade" for most users, promising faster turnaround from a latency perspective alongside more detailed, nuanced answers compared to the previous generation

5

.

Source: Google

Source: Google

The faster and more efficient AI model is now available for developers through the Gemini API, Vertex AI, AI Studio, Google Antigravity, Gemini CLI, and Android Studio

1

4

. Companies including JetBrains, Figma, Cursor, Harvey, and Latitude are already using Gemini 3 Flash in production environments

2

.

Improved Performance Across Benchmarks Shows Significant Gains

Gemini 3 Flash demonstrates substantial improvements in testing, with particularly striking gains in specialized assessments. On Humanity's Last Exam, which tests advanced domain-specific knowledge, the model scored 33.7% without tool use—a threefold increase over Gemini 2.5 Flash's 11% score and competitive with GPT-5.2's 34.5%

2

. The model achieved 81.2% on the MMMU-Pro multimodality and reasoning benchmark, outperforming all competitors including Gemini 3 Pro

2

.

Coding capabilities show marked advancement, with Gemini 3 Flash gaining almost 20 points on the SWE-Bench Verified test compared to the 2.5 branch

1

. The model outperformed both Gemini 3 Pro and Claude Sonnet 4.5 on this agentic coding benchmark

4

. In the GPQA Diamond test requiring PhD-level reasoning, Gemini 3 Flash achieved 90.4%, approaching Gemini 3 Pro's 91.9%

3

. General knowledge accuracy also improved dramatically, with Simple QA Verified scores jumping from 28.1% in Gemini 2.5 Flash to 68.7% in the new model

1

.

Multimodal Capabilities and Cost Structure for Developers

Gemini 3 Flash retains the powerful reasoning capabilities and multimodal capabilities that defined Gemini 3 Pro, including the ability to process images, text, videos, and audio simultaneously

1

4

. Users can upload pickleball videos for technique analysis, submit sketches for identification, or provide audio recordings for detailed assessment

2

. The model can generate app prototypes directly within the Gemini app using natural language prompts and create plans based on series of videos and images in just seconds

4

5

.

Source: TechRadar

Source: TechRadar

For developers, pricing is set at $0.50 per 1 million input tokens and $3.00 per 1 million output tokens—a modest increase from Gemini 2.5 Flash's $0.30 and $2.50 respectively, but substantially lower than Gemini 3 Pro's $2 and $12 rates

1

2

. Google claims Gemini 3 Flash outperforms Gemini 2.5 Pro while operating three times faster and using 30% fewer tokens on average for thinking tasks, potentially offsetting the higher per-token cost

1

2

. Doshi positioned the model as "more of your workhorse model" suitable for bulk tasks and quick, repeatable workflows like customer support agents or in-game assistants

2

4

.

Competitive Landscape and Strategic Implications

The rapid deployment reflects intensifying competition in the AI sector. Since releasing Gemini 3, Google has processed over 1 trillion tokens per day through its API

2

. Reports indicate OpenAI CEO Sam Altman sent an internal "Code Red" memo after ChatGPT traffic declined as Google's market share rose, prompting OpenAI to release GPT-5.2 and a new image generation model

2

. OpenAI claims ChatGPT message volume has grown 8x since November 2024 [2](https://techcrunch.com/2025/12/17/google-launches-gemini-3-flash-makes-it-the-default-model-in-the- Gemini-app/).

Google's decision to immediately embed Gemini 3 Flash into its cornerstone products signals strong internal confidence, contrasting with the cautious, gradual rollouts that characterized earlier phases of the AI race

4

. Gemini 3 Pro and Nano Banana Pro, Google's image model, are also expanding availability in AI Mode for all US-based users, with free access subject to unspecified limits while Pro and Ultra subscribers receive higher usage allowances

1

3

. Users can still select Gemini 3 Pro from the model picker for specialized math and coding questions

2

.

Source: Digital Trends

Source: Digital Trends

Today's Top Stories

TheOutpost.ai

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Instagram logo
LinkedIn logo
Youtube logo
© 2026 TheOutpost.AI All rights reserved