2 Sources
[1]
Is Google Delaying Gemini 3.5 Pro Launch to July for Further Testing?
The company launched Gemini 3.5 Flash as the first model in the new series. Google described Flash as a model built for coding, AI agents and complex workflows. Gemini 3.5 Pro is expected to offer greater computing power for demanding tasks. The company has reportedly used feedback from Flash while developing the Pro model. Some users said Flash consumed tokens too quickly, which could raise costs when handling long prompts or completing extended tasks. Google is reportedly examining that issue while preparing Gemini 3.5 Pro. Token use affects how much text a model processes and generates. Higher usage can increase costs for developers and companies that run large numbers of AI requests. The upcoming model is also expected to improve long-horizon work. These tasks require an AI system to plan, use tools and maintain context over several stages. They include software development, data analysis and automated business processes. Nevertheless, Google has not published full technical details or benchmark results for Gemini 3.5 Pro. Claims about its speed, efficiency and coding performance will remain unverified until Google releases official testing data or gives users wider access.
[2]
Google reportedly postpones Gemini 3.5 Pro launch, here is why
The company is reportedly collecting more feedback from early testers and making changes based on their experience. Google has reportedly postponed the launch of its next advanced AI model called Gemini 3.5 Pro. For those who don't know, the company was expected to release the model this month, but a new report from Business Insider claims that the launch has now been pushed to July. According to the report, Google decided to delay the release to improve the model before making it available to everyone. The company is reportedly collecting more feedback from early testers and making changes based on their experience. Google likely wants to make sure that the AI model performs better in real-world situations. Google first teased Gemini 3.5 Pro during its I/O developer conference in May. At the event, CEO Sundar Pichai said the model would arrive in June. Do note that Google has not officially confirmed the delay. Also read: OpenAI introduces its first AI chip designed for LLM workloads: Check details The delay comes at a time when competition in the AI industry is growing. Companies like OpenAI and Anthropic are launching powerful AI models, especially for coding tasks. Coding has become one of the most important uses of AI for businesses. Gemini 3.5 Pro is expected to offer better performance on long and complex tasks. Google is also said to have used feedback from its Gemini Flash 3.5 model while working on Gemini 3.5 Pro. One of the main complaints about Flash 3.5 was that it used tokens too quickly. If the latest report is true, the extra development time could help Google launch a more polished and refined AI model. Also read: GTA 6 pre orders now live: India price, benefits and other details Meanwhile, Google introduced a new AI model called Gemma 4 12B earlier this month. The tech giant describes Gemma 4 12B as a "unified transformer" which is designed to bring agentic multimodal intelligence directly to laptops. One of the biggest highlights of Gemma 4 12B is that it can run locally on devices with just 16GB of RAM or VRAM. According to Google, the model delivers advanced reasoning abilities while maintaining a relatively small memory footprint. Google also claimed that Gemma 4 12B is its first mid-sized model with native audio input support.
Share
Copy Link
Google has reportedly pushed back the launch of Gemini 3.5 Pro from June to July. The delay follows feedback from early testers of Gemini 3.5 Flash, which revealed issues with token consumption that could increase costs for developers. The company is refining the advanced AI model to improve real-world performance before its public release.
Google has reportedly delayed the launch of Gemini 3.5 Pro, its next advanced AI model, from June to July. According to a Business Insider report, the Gemini 3.5 Pro launch delay stems from the company's decision to collect more feedback from early testers and implement changes based on their experience
2
. The tech giant first announced the AI model during its Google I/O developer conference in May, where CEO Sundar Pichai indicated a June release. However, Google has not officially confirmed the postponement2
.
Source: Digit
The delay arrives at a critical moment as competition intensifies in the AI industry. Companies like OpenAI and Anthropic continue launching powerful models, particularly for coding tasks, which have become essential for businesses
2
. The extra development time could allow Google to deliver a more polished product that meets the demands of developers and enterprises relying on AI for software development and data analysis workflows.Google is reportedly using feedback from Gemini 3.5 Flash while developing the Pro model. The company launched Gemini 3.5 Flash as the first model in the new series, describing it as built for coding, AI agents and complex workflows
1
. However, some users reported that Flash consumed tokens too quickly, which could raise costs when handling long prompts or completing extended tasks .
Source: Analytics Insight
Token consumption affects how much text a model processes and generates. Higher usage can increase costs for developers and companies that run large numbers of AI requests . Google is reportedly examining this issue while preparing Gemini 3.5 Pro, suggesting the company wants to ensure better real-world performance before the model reaches a wider audience. One of the main complaints about Flash 3.5 was precisely this rapid token usage, making cost efficiency a priority for the upcoming release
2
.Gemini 3.5 Pro is expected to offer greater computing power for demanding tasks and improved performance on long and complex tasks
1
2
. The upcoming model is designed to improve long-horizon work, which requires an AI system to plan, use tools and maintain context over several stages. These tasks include software development, data analysis and automated business processes1
.Google has not published full technical details or benchmark results for Gemini 3.5 Pro. Claims about its speed, efficiency and coding performance will remain unverified until Google releases official testing data or gives users wider access
1
. The company's cautious approach suggests it aims to avoid a problematic launch that could damage its competitive position in the rapidly evolving AI landscape.While Google refines Gemini 3.5 Pro, the company continues advancing its AI portfolio. Google introduced Gemma 4 12B earlier this month, describing it as a "unified transformer" designed to bring agentic multimodal intelligence directly to laptops. The model can run locally on devices with just 16GB of RAM or VRAM and features native audio input support
2
. This release demonstrates Google's multi-pronged strategy, targeting both cloud-based enterprise solutions and on-device AI capabilities. As July approaches, developers and businesses should monitor whether Google can deliver on the promised improvements while maintaining competitive pricing and performance against rivals in coding tasks and enterprise workflows.Summarized by
Navi
[1]
1
Policy and Regulation

2
Policy and Regulation

3
Technology
