
Wan 2.7 Text-to-Video API by Alibaba
Generates videos from text prompts with multi-shot narrative, audio generation, and sound-image synchronization.
Alibaba WAN 2.7 Text-to-Video
Alibaba WAN 2.7 Text-to-Video generates videos from text prompts with built-in audio generation, multi-shot narrative control, and sound-image synchronization.
What makes it stand out?
- Multi-shot storytelling: Describe scene-by-scene shots in the prompt, and the model generates a coherent multi-shot video with natural transitions.
- Built-in audio: Generates matching sound effects, music, and ambient audio automatically based on the prompt content.
- Flexible framing: Supports five aspect ratios (16:9, 9:16, 1:1, 4:3, 3:4) at 720P or 1080P resolution.
- Up to 15 seconds: Generate videos from 2 to 15 seconds in a single request.
- Super Resolution options: Choose
1080P-SRor1440P-SRwhen you need a sharper final video with cleaner edges and improved texture detail.
Designed For
- Short-form video creators producing social media clips, ads, and story reels.
- Teams that need quick video drafts from a text brief without sourcing footage.
- Anyone exploring creative video concepts through prompt iteration.
- Publishing workflows that need a higher-detail final output from a text prompt.
Super Resolution
Set resolution to 1080P-SR or 1440P-SR to use the FlashVSR super-resolution path. The request first generates a source video, then applies video super-resolution before returning the final output.
Use 1080P-SR for sharper HD results when the native 1080P output is not detailed enough. Use 1440P-SR for larger-screen previews, presentation assets, or publishing workflows where extra texture detail matters.
Super Resolution can take longer than native generation because it adds a post-processing step. Final billing is calculated from the selected model, resolution, duration, account, and environment pricing configuration.
How to Use
- Write a detailed prompt describing the scene, characters, actions, and mood.
- For multi-shot videos, describe each shot with time markers (e.g., "Shot 1 [0-3s]: ...").
- Choose the resolution and aspect ratio for your target platform.
- Set the duration based on how long the final video should be.
- Use prompt extend for better results on shorter prompts. Disable it when you want precise control over the output.
- Choose
resolution:720Por1080Pfor native generation,1080P-SRor1440P-SRfor Super Resolution.


















