Alibaba unveils Qwen3.5 AI model with visual agentic capabilities, claims edge over GPT-5.2

Reviewed byNidhi Govil

9 Sources

Share

Alibaba released Qwen3.5, its latest AI model featuring visual agentic capabilities that enable autonomous task execution across applications. The company claims the model outperforms leading models from OpenAI, Anthropic, and Google DeepMind on multiple benchmarks while operating 60% cheaper than its predecessor. The launch intensifies competition in China's AI market as companies race to develop advanced AI agents.

Alibaba Releases Qwen3.5 with Advanced AI Agent Features

Alibaba unveiled its Qwen3.5 AI model series on Monday, timing the release to coincide with the eve of the Chinese Lunar New Year

1

. The launch represents a significant push into the emerging AI agents market, with the company positioning the model as purpose-built for autonomous task execution. Qwen3.5 arrives as an open-weight AI model, allowing developers to download, run, fine-tune, and deploy it on their own infrastructure, alongside a hosted version running on Alibaba Cloud's Model Studio platform

1

.

Source: Market Screener

Source: Market Screener

The flagship model features visual agentic capabilities that enable it to take actions across phone and computer applications without requiring constant user supervision

4

. These AI agents can independently complete multi-step tasks on behalf of users, a capability that has garnered intense attention following Anthropic's recent release of new agent tools

1

. The potential for these systems to replace traditional software as a service companies has already begun impacting markets.

Mixture of Experts Architecture Delivers Cost-Effective Development

The Qwen3.5-397B-A17B model employs a mixture of experts architecture with 397 billion total parameters but activates only 17 billion per token

2

. This architectural approach marks a direct evolution from last September's experimental Qwen3-Next, scaling aggressively from 128 experts in previous models to 512 experts in the new release

2

.

The engineering decisions translate into substantial operational advantages. Alibaba claims the model is 60% cheaper to operate than its predecessor and eight times more capable of handling large concurrent workloads

2

4

. The model also runs at approximately 1/18th the inference cost of Google's Gemini 3 Pro

2

. At 256K context lengths, Qwen3.5 decodes 19 times faster than Qwen3-Max and 7.2 times faster than Qwen3's 235B-A22B model

2

.

The model operates within a 256K context window in the open-weight version, expandable to 1 million tokens in the hosted Qwen3.5-Plus variant

2

3

. Alibaba equipped the model with hybrid attention mechanisms combining standard quadratic attention heads with linear attention heads, which require considerably less memory

3

.

Native Multimodal Understanding Sets New Standard

Unlike previous iterations that attached vision encoders to language models, Qwen3.5 features native multimodal understanding trained from scratch on text, images, and video simultaneously

2

. This approach weaves visual reasoning into the model's core representations rather than grafting it on afterward. The model can process prompts with up to 262,144 tokens by default, including text in more than 201 languages and dialects along with images such as data visualizations

3

5

.

Source: Inc.

Source: Inc.

The expanded language support represents a significant jump from Qwen3's 119 languages

2

, now covering dialects used in South Asia, Oceania, and Africa

5

. The model's vocabulary grew to 250k tokens from 150k in prior generations, now comparable to Google's ~256K tokenizer

2

. This tokenizer upgrade reduces token counts by 15-40% for non-Latin scripts, translating directly to lower costs and faster response times for global deployments.

Qwen3.5 Outperforms Leading Models on Multiple Benchmarks

Alibaba claims Qwen3.5 delivers competitive performance against GPT-5.2, Claude Opus 4.5, and Gemini 3 Pro across numerous benchmarks

1

5

. The model outperformed both OpenAI and Anthropic on IFBench, which measures how well models follow user instructions

3

. On MathVista, it scored 90.3, and on MMMU, it achieved 85.0

2

.

Notably, the 397B-A17B model claims benchmark wins against Alibaba's own previous flagship, Qwen3-Max, which exceeded one trillion parameters

2

. The model also outperformed Qwen3-VL, built specifically for image analysis tasks, across several visual reasoning and coding benchmarks

3

. While CNBC could not independently verify these claims

1

, the self-reported results suggest enterprise AI solutions that balance performance with operational efficiency.

Intensifying Competition in China's AI Market

The Qwen3.5 release escalates rivalry within China's AI market, where Alibaba currently trails ByteDance's Doubao chatbot

5

. QuestMobile data from late December shows Doubao leads with 155 million weekly active users, while DeepSeek holds 81.6 million

5

. ByteDance launched Doubao 2.0 over the weekend, also targeting the agent era, while Zhipu AI released upgraded models aimed at supporting more agent capabilities

1

4

.

Alibaba has deployed aggressive marketing to gain traction, including a 3-billion-yuan ($433 million) campaign allowing users to buy food and beverages via the Qwen chatbot, resulting in a seven-fold increase in active users

5

. Lin Junyang, technical lead of Alibaba Cloud's Qwen team, indicated the company expects to release more open-weight models during the Chinese New Year period

1

. Google DeepMind head Demis Hassabis told CNBC last month that Chinese AI models were just "months" behind Western rivals

1

, suggesting the competitive gap continues narrowing as Chinese firms accelerate development cycles.

Source: Seeking Alpha

Source: Seeking Alpha

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2026 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo