2 Sources
[1]
ByteDance's New AI Video Model Can Make 30-Second Clips From a Single Prompt
Blake has over a decade of experience writing for the web, with a focus on mobile phones, where he covered the smartphone boom of the 2010s and the broader tech scene. When he's not in front of a keyboard, you'll most likely find him playing video games or watching horror movies. The Chinese tech company ByteDance revealed Seedance 2.5, the latest version of its artificial intelligence video generator model, during a conference in Beijing on Tuesday, according to a report from The Information. AI video generation has come a long way since its debut, with each new version becoming increasingly better at producing realistic imagery. We're far from the first time we saw Will Smith eating spaghetti, which was horrifyingly bad. Now, we need watermarking for these AI-generated videos to help identify deepfakes and other synthetic or false content. The latest version of the video model allows users to provide up to 50 reference pieces, whether they're images, videos or audio files -- up from 12 in its predecessor, Seedance 2.0. Increasing the number of references will give you greater control over the video creation process. The model can generate 30-second, 4K videos with a single prompt. ByteDance has consistently released some of the most impressive AI video generation models, rivaling those of OpenAI's now-dead Sora and Google's Veo 3. ByteDance, which previously held a majority stake in TikTok, is said to release the new model in China next month, according to the report. A release time window for other countries was not mentioned. The introduction of the new model may turn some heads, and not in the best way. Seedance 2.0's US rollout was delayed earlier this year under pressure from Hollywood to stop using copyrighted works that appeared to be used for training the model. If the latest model is significantly better than its predecessor, it could see a similar backlash if it can't address legal and copyright issues. ByteDance did not immediately respond to a request for comment.
[2]
ByteDance unveils Seedance 2.5, a 30-second native 4K AI video model that accepts 50 reference inputs
ByteDance announced Seedance 2.5 at its Beijing conference, generating 30-second native 4K video from up to 50 multimodal reference inputs. ByteDance unveiled Seedance 2.5 on Tuesday at its Volcano Engine FORCE conference in Beijing, a video generation model that produces 30-second clips at native 4K resolution from a single prompt. The company skipped four intermediate versions entirely, jumping straight from its predecessor to signal what it described as a generational leap. An enterprise beta is already live, with public launch targeted for early July. CEO Liang Rubo told the conference that climbing the AI summit is the company's top priority, with its model-as-a-service business evolving into a foundational operation backed by long-term investment. The headline upgrade is reference capacity: the model accepts up to 50 multimodal inputs, including images, audio clips, 3D white models, and style references, up from 12 in its predecessor. Those inputs give Seedance 2.5 far more granular control over style, motion, and composition than a text prompt alone. The model generates at 4K natively rather than upscaling from a lower resolution, a distinction that matters for professional production pipelines. It supports 10-bit colour depth for smoother gradients and more room for post-production colour grading. ByteDance also claims 20 percent better prompt adherence, meaning fewer generations before a usable result. Audio is now co-processed within the same latent space as visual signals, producing native synchronisation between onscreen actions and their corresponding sound effects. A new 3D white-box preview function lets creators generate low-fidelity animations before committing to a full-quality render. Together, the features position the model as a production tool rather than a novelty generator. The announcement comes three months after ByteDance was forced to add watermarking and IP guardrails to Seedance 2.0 following cease-and-desist letters from Disney, Warner Bros Discovery, Paramount, and Netflix. A viral deepfake of Tom Cruise fighting Brad Pitt on a rooftop drew a formal complaint from the Motion Picture Association and a rebuke from SAG-AFTRA. ByteDance paused the global rollout in mid-March and did not resume it through CapCut until late March, with face-blocking filters, C2PA watermarks, and copyrighted character detection in place. No timeline has been offered for making the new model available in the United States. The competitive context has shifted dramatically since February. OpenAI shut down Sora in March after the video tool peaked at roughly one million users and reportedly cost about a million dollars a day to operate, generating just over two million dollars in total revenue. Google's Veo 3.1 has filled much of the vacuum, offering native 4K output, audio generation, and up to three reference images for style control. But the new ByteDance model substantially exceeds Veo's reference input capacity, accepting 50 inputs to Veo's three, a gap that matters for professional workflows. The AI video generation market has fragmented rapidly, with Chinese models moving faster on production tooling than Western competitors. Third-party platforms like Reallusion's AI Studio have already built professional pipelines around the predecessor model, and Runway's fourth-generation tool has dropped out of the Artificial Analysis top 10. Whether the new model can reach global markets without reigniting the copyright battles that stalled its predecessor remains the central question. ByteDance has the model, the distribution through CapCut's 400 million monthly active users, and the vertical integration from generation to editing to sharing. What it does not yet have is a settlement with Hollywood, and every feature that makes the model more capable also raises the stakes of that unresolved conflict.
Share
Copy Link
ByteDance revealed Seedance 2.5 at its Beijing conference, an AI video generation model that creates 30-second native 4K videos from a single prompt. The model accepts up to 50 multimodal reference inputs, a significant jump from its predecessor's 12. But copyright concerns loom as Hollywood studios previously forced delays over training data issues.
ByteDance unveiled Seedance 2.5 on Tuesday at its Volcano Engine FORCE conference in Beijing, marking what the company describes as a generational leap in its AI video model capabilities
2
. The latest AI video generation model can produce 30-second 4K videos from a single prompt, skipping four intermediate versions to jump directly from its predecessor2
. CEO Liang Rubo told attendees that climbing the AI summit represents the company's top priority, with its model-as-a-service business evolving into a foundational operation backed by long-term investment2
.
Source: CNET
The headline upgrade centers on reference capacity. Seedance 2.5 accepts up to 50 multimodal reference inputs, including images, audio clips, 3D white models, and style references, up from 12 in Seedance 2.0
1
2
. This dramatic increase gives creators far more granular control over style, motion, and composition than a text prompt alone could provide2
. ByteDance also claims 20 percent better prompt adherence, meaning fewer generations before achieving a usable result2
.The generative AI video model generates at 4K natively rather than upscaling from lower resolution, a distinction that matters for professional production tool workflows
2
. It supports 10-bit colour depth for smoother gradients and more room for post-production colour grading2
. Audio is now co-processed within the same latent space as visual signals, producing native audio synchronization between onscreen actions and their corresponding sound effects2
. A new 3D white-box preview function lets creators generate low-fidelity animations before committing to a full-quality render2
. An enterprise beta is already live, with public launch targeted for early July2
.Related Stories
ByteDance has consistently released impressive AI video generation models, rivaling those of OpenAI Sora and Google Veo
1
. The competitive landscape shifted dramatically after OpenAI shut down Sora in March following peak usage of roughly one million users while reportedly costing about a million dollars a day to operate, generating just over two million dollars in total revenue2
. Google Veo 3.1 has filled much of the vacuum, offering native 4K output, audio generation, and up to three reference images for style control2
. But the new ByteDance model substantially exceeds Veo's reference input capacity, accepting 50 inputs to Veo's three, a gap that matters for professional workflows2
.The announcement comes three months after ByteDance was forced to add watermarking and IP guardrails to Seedance 2.0 following cease-and-desist letters from Disney, Warner Bros Discovery, Paramount, and Netflix
2
. A viral deepfake content incident featuring Tom Cruise fighting Brad Pitt on a rooftop drew a formal complaint from the Motion Picture Association and a rebuke from SAG-AFTRA2
. Seedance 2.0's US rollout was delayed earlier this year under pressure from Hollywood to stop using copyrighted works that appeared to be used for training the model1
. ByteDance paused the global rollout in mid-March and did not resume it through CapCut until late March, with face-blocking filters, C2PA watermarks, and copyrighted character detection in place2
. No timeline has been offered for making the new model available in the United States2
. Whether Seedance 2.5 can reach global markets without reigniting the copyright battles that stalled its predecessor remains the central question, especially as every feature that makes the model more capable also raises the stakes of that unresolved conflict.Summarized by
Navi
26 Mar 2026•Technology

09 Feb 2026•Technology

13 Feb 2026•Policy and Regulation

1
Technology

2
Policy and Regulation

3
Technology
