Alibaba bets $290 million on ShengShu's world model AI to move beyond text-based chatbots

6 Sources

Share

Alibaba Cloud led a $290 million investment in ShengShu Technology, the Chinese AI startup behind video generator Vidu. The funding signals a strategic shift from large language models to world models that process visual, audio, and tactile data. ShengShu plans to build a general world model bridging digital and physical environments for applications in robotics and autonomous driving.

News article

Alibaba Investment Fuels ShengShu's Ambitious World Model Vision

Alibaba Cloud has led a 2 billion yuan ($290 million) Series B funding round for ShengShu Technology, marking one of the largest recent investments in China's competitive AI landscape

1

2

. The Alibaba Cloud funding round also drew participation from TAL Education Group, Baidu Ventures, and Luminous Ventures, while existing investors including LINK-X CAPITAL and Delta Capital increased their stakes

2

. This capital injection comes just two months after the Chinese AI startup secured 600 million yuan from Qiming Venture Partners and other backers, demonstrating rapid investor confidence in ShengShu's technology trajectory

1

3

.

From Language Models to Physical Intelligence

The ShengShu Technology funding represents a strategic pivot in AI development, shifting focus from large language models trained primarily on text to world model systems built on multimodal data including vision, audio, and touch

3

5

. Zhu Jun, the Tsinghua University professor who founded ShengShu in March 2023 and serves as chief scientist, explained that "a general world model, built on multimodal data such as vision, audio, and touch, more naturally captures how the physical world works than large language models"

5

. The general world model processes sensory information to simulate human perception and interaction, which ShengShu describes as a step toward artificial general intelligence in physical environments

2

4

. The company aims to bridge two currently separate domains: the digital world of games and AI video generation, and the physical world of autonomous driving and robots

3

.

Vidu's Position in the AI Video Generation Race

ShengShu became the first Chinese company to release a video generation model when it launched Vidu in April 2024, positioning itself as a competitor to OpenAI's Sora, which the U.S. company later discontinued

2

4

. The startup has since released several updated versions, including the Vidu Q3 model announced earlier this year

4

. Vidu currently ranks ninth on the Artificial Analysis chart for text-to-video services, trailing ByteDance's Seedance 2 and the recently released Happy Horse generator

1

. The platform has reached users in over 200 countries and regions worldwide, spanning industries such as animation, advertising, and film

1

. In 2025, ShengShu reported more than tenfold growth in both users and revenue, though specific figures were not disclosed

1

.

Expanding Into Robotics Applications Through Motus

ShengShu's ambitions extend beyond AI video generation into robotics applications. In December 2025, the company open-sourced Motus, a new AI model line designed to control robots by processing multimodal data including video and audio

2

4

. The Motus system enables robots and other machine-intelligence systems to better perceive and understand real-world environments

1

. This move positions ShengShu alongside Chinese companies ranging from industry giants like ByteDance to startups such as humanoid robot specialist Unitree, which have begun exploring similar world model technologies

2

.

Strategic Implications for Alibaba's Cloud Business

For Alibaba, the investment in ShengShu aligns with its broader strategy as one of the most active backers of Chinese artificial intelligence startups. The company hopes such financing will drive usage of its cloud computing platform, which has overtaken e-commerce as its fastest-growing revenue source

1

. Alibaba is also backing other AI video generation competitors, including PixVerse, highlighting the capital-intensive race that has pulled in tech heavyweights like ByteDance and Kuaishou Technology

1

. Internationally, companies such as Google and startups including Runway are developing similar technologies, intensifying global competition

4

. ShengShu did not provide a timeline for when its general world model system would be commercially available, leaving observers to watch how quickly the company can translate this substantial funding into market-ready products

2

.

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2026 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo