🚀 NEW RELEASEWan2.5 Preview Now Available

Turn Text or Images into Cinematic Videos — InstantlyWan2.5

Experience the next evolution of AI video generation with Wan2.5's native audio-video synchronization. Create stunning 10-second videos with perfectly synced dialogue, music, and sound effects—all generated in a single pass.

Native A/V Sync

10s Videos

< 60s Generation

Native A/V Sync • Up to 10s Videos • 1080p HD Output • Multimodal Architecture

Live Generation

Audio Synced

Powered by Wan2.5 - Alibaba's Cutting-Edge Multimodal AI

Revolutionary Features of Wan2.5

Experience the next generation of AI video creation with groundbreaking capabilities

EXCLUSIVE

One-Pass A/V Sync

Generate perfectly synchronized dialogue, music, and effects in a single pass

10-Second Videos

Create longer, more engaging content perfect for social media and marketing

Cinematic Control

Define camera movements, lighting, and effects directly in your prompts

Multi-Resolution

Output in 480p, 720p, or 1080p with multiple aspect ratios from one prompt

Smart Understanding

Handle complex scenes with multiple objects, actions, and visual effects

Multilingual Creation

Generate videos with voiceovers in multiple languages for global content

Wan2.5 vs Competition

See how Wan2.5 compares to other leading AI video generators

Feature	Wan2.5NEW	Veo 3	Sora
Max Video Duration	10 seconds	8 seconds	60 seconds
Native Audio Sync
Lip Sync Support
Resolution Support	1080p HD	1080p HD	1080p HD
Generation Speed	< 60s	2-3 min	3-5 min
Text & Image Input
Camera Control
Cost Efficiency	Most Affordable	Higher Cost	Premium Pricing
Batch Processing
Multi-language

Wan2.5 Advantage: Native audio-video sync, longer duration, and cost-efficient scaling make it ideal for content creators and businesses.

Stunning Examples Created with Wan2.5

Explore the incredible quality and versatility of Wan2.5 generated content

What Makes Wan2.5 Revolutionary

Wan2.5 represents a fundamental breakthrough in AI video generation, featuring native multimodal architecture that unifies vision, language, and sound into a single generation pipeline.

One-Pass A/V Synchronization
Generate perfectly synchronized dialogue, background music, ambient sounds, and visuals in a single pass—no post-production needed
Unified Multimodal Framework
Deep alignment of text, image, video, and audio modalities through joint training, enabling superior cross-modal understanding
Human Preference Alignment
Enhanced with RLHF (Reinforcement Learning from Human Feedback) to produce outputs that align with human creative preferences

How to Generate Videos with Wan2.5

Create stunning AI videos with native audio in three simple steps

Input Your Content

Upload an image or type a text prompt describing your desired video scene

Customize Settings

Choose resolution, duration, and audio preferences for your video

Generate Video

Click generate and get your AI video with native audio in under 60 seconds

Start creating AI videos with native audio

Wan2.5 Performance Metrics

Industry-leading performance that sets new standards for AI video generation

Generation Speed

Faster than competitors

Video Quality

Professional HD resolution

Success Rate

Generation accuracy

K+

Active Users

Content creators worldwide

* Statistics based on average performance metrics and user feedback

What Creators Are Saying About Wan2.5

Real feedback and amazing creations from users and creators

Wan2.5 is the first to compete with Veo 3

It can produce audio‑embedded videos up to 10 seconds long at 1080p resolution pic.twitter.com/lOodJlXKD2
— Dorksense (@Dork_sense) September 23, 2025

Alibaba dropped Wan2.5 in the same week as Wan2.2 animation and pushing the boundaries

- If properly used, it can be a great use case for anime and brand shots with sound

-Native Audio Support
- Rich textures and sound
- Great for ASMR Videos
- Complex Dancing Motions pic.twitter.com/Zc775SkoWM
— Kunal (@KunalAg45182378) September 24, 2025

🚨 WAKE UP!! Wan2.5 SPEAKS!🚨

You read that right, just released tonight, Wan2.5 has native audio just like VEO3! Capable of 1080p and 10 seconds and Image To Video at launch.

Text To Video Prompt:
camera natural light, 8K. cinematic realistic dramatic zoom in on a a… pic.twitter.com/dK9mrBAHFr
— Brent Lynch (@BrentLynch) September 24, 2025

💥🤯 the new Wan2.5 and it is insane

now it has audio, sound effects and... voices!

check these amazing examples below🧵 pic.twitter.com/zPzjEcgVLL
— Lars_Pragmata (@Lars_pragmata) September 24, 2025

Wan2.5 launched with native audio and 1080p support. I generated the video below and was surprised when I heard the audio. Pretty rad. pic.twitter.com/ubycGCrAtw
— Daniel (@dmigizi) September 25, 2025

Done and done! wan2.5 is definitely great! pic.twitter.com/EPlPWeoc7n
— John (@johnAGI168) September 29, 2025

Join our community and start creating amazing videos

Start Creating with Wan2.5

Frequently Asked Questions

Everything you need to know about Wan2.5 and its capabilities

Wan2.5 is an advanced AI video generation model that supports transforming text or images into video content with synchronized audio, enabling narrative scenes with dialogue, ambient sound, and effects—all generated in a single pass without post-processing.

Start Using Wan 2.5 for Free Now!

Join thousands of creators using Wan2.5 to generate professional videos with perfect audio-video synchronization.