
The Wan API brings Alibaba's open Wan video models to Atlas Cloud through one unified key. Wan 2.2 pioneered a Mixture-of-Experts architecture for video diffusion, lifting capacity and motion control at the same inference cost. It handles text to video, image to video, and video to video, with first-to-last frame control, extend, and upscaling, all reachable alongside 300+ models.
Atlas Cloud provides you with the latest industry-leading creative models.
Match each job to the right Wan 2.2 variant: the cinematic A14B models, the lightweight TI2V-5B, and turbo builds for speed, across text, image, and video to video, all through one key on Atlas Cloud.
| Model | Description |
|---|---|
| Wan 2.2 T2V-A14B (Text to Video) | The flagship text-to-video model, using the Mixture-of-Experts architecture for cinematic motion and fine aesthetic control. Best when you need the strongest fidelity from a pure text prompt. |
| Wan 2.2 I2V-A14B (Image to Video) | Animates a still image into moving footage while preserving subject identity and texture, with an MoE design that holds detail as frames evolve. A fit for product shots and concept art brought to life. |
| Wan 2.2 V2V (Video to Video) | Transforms existing footage: restyle a clip, shift its tone, or apply a new look while keeping the original motion intact. Turns one source video into several variations without a reshoot. |
| Wan 2.2 TI2V-5B (Text and Image to Video) | A lighter model that combines text and image inputs in one checkpoint, generating 720p at 24fps. Efficient enough for consumer GPUs, so it suits quick, cost-aware generation. |
| Wan 2.2 Turbo (Accelerated) | A speed-first path built on an rCM sampler and 4-step distillation, cutting latency while retaining strong fidelity. Built for iteration, batch generation, and prompt testing. |
| Wan 2.2 Video Extend | Lengthens an existing clip into a longer sequence while keeping motion and lighting continuous with the source. Turns a short draft into a fuller deliverable. |
| Wan 2.2 Upscale | Sharpens footage to 1080p or 2K while preserving timing and composition, with 4K planned for a later release. A finishing step for drafts and older clips. |
The Wan API brings Alibaba's open Wan 2.2 to Atlas Cloud with a Mixture-of-Experts architecture, cinematic motion control, first-to-last frame guidance, video extend, and upscaling, all behind one unified key.
Wan 2.2 was the first video diffusion model to adopt a Mixture-of-Experts design, splitting the denoising process across specialized experts. This raises model capacity and detail without increasing inference cost, so you get richer scenes and more accurate motion at the same compute budget.
Trained on curated data labeled for lighting, composition, contrast, and color, the Wan API gives fine control over the look of a shot. Wan 2.2 handles complex motion, dynamic camera moves, and fluid transitions, so prompts like an aerial orbit or a handheld tracking shot render as intended.
Wan 2.2 covers text to video, image to video, and video to video in one model, so you can start from a prompt, animate a still, or transform existing footage. Switching modes is a parameter change, which keeps a mixed pipeline on a single integration.
The Wan API supports first-and-last frame guidance, letting you set the opening and closing frames and have the model generate the motion between them. This gives directors precise control over how a shot begins and resolves, useful for product reveals and scripted beats.
Beyond fresh generation, Wan 2.2 extends existing clips into longer sequences while preserving motion, and upscales footage to 1080p or 2K while keeping timing and composition intact. Together these turn short drafts into polished, longer deliverables without a reshoot.
The Wan API offers a turbo path built on an rCM sampler and 4-step distillation, compressing the denoising steps for low-latency generation. It keeps strong visual fidelity while cutting wait time, which suits iteration, batch runs, and prompt testing.
Run the same prompt through the Wan API and other leading video models on Atlas Cloud, and compare how each handles cinematic motion, restyling, and open-source flexibility in a single scene.
Cinematic multi-shot mood piece in 8 seconds, controlled lighting and color. Shot 1, slow dolly-in: a dim study at night, a single desk lamp pooling warm light over an open book as dust drifts in the beam. Shot 2, hard cut: a wide shot of a lone figure standing at a floor-to-ceiling window, city lights bokeh-soft behind rain on the glass. Shot 3, close-up: fingers trace the rim of a coffee cup, steam curling upward in the cool blue light. Shot 4, low angle: the figure turns toward a doorway where warm hallway light spills in, silhouette sharpening. Shot 5, dramatic wide: the room in soft dawn light, curtains glowing as the lamp clicks off. Rich color grading, deliberate lighting shifts, shallow depth of field, cinematic, crisp 1080p.
Wan 2.2
Pixverse v6
Veo 3.1
Restyle and motion showcase in 8 seconds, hard cuts. Shot 1: ordinary daytime street footage of a person walking through a market, transformed into a warm, painterly illustrated style while the walking motion stays natural and continuous. Shot 2, hard cut: the same scene restyled into a cool, cinematic teal-and-orange film look, crowd movement preserved. Shot 3: a slow push-in on a fruit stall as the style shifts to soft watercolor, colors bleeding gently at the edges. Consistent underlying motion across every restyle, smooth transitions, clean detail, crisp 1080p.
Wan 2.2
Pixverse v6
Veo 3.1
From social clips and product videos to previs, restyling, and self-hosted pipelines, the Wan API turns Alibaba's open Wan 2.2 into production features through one unified key on Atlas Cloud.
Turn a prompt or a single image into short, cinematic clips for TikTok, Reels, and Shorts, with the motion control and aesthetic grading Wan 2.2 is built for. Creator tools and social teams can produce polished vertical content without a shoot.
Animate a product photo into a moving showcase with the Wan API, using image to video and first-to-last frame control to script how the reveal opens and closes. Marketing teams get repeatable ad clips from stills, sized for each channel.
Generate quick motion references from scripts and concept art before committing to a full shoot. Wan 2.2's camera and motion control let directors test staging, pacing, and shot transitions cheaply, then iterate before production.
Use the Wan API's video to video and extend modes to restyle clips, shift tone, or lengthen a scene while keeping motion continuous. One source clip becomes several variations or a longer cut without reshooting.
Sharpen and lengthen older or low-resolution footage with Wan 2.2's upscaling to 1080p or 2K and its extend mode, preserving timing and composition. This gives archives and rough drafts a clean, deliverable finish.
Because Wan 2.2 is open under Apache 2.0, you can self-host the weights or reach the Wan API on Atlas Cloud to skip the GPU setup. Wire it into an automated pipeline that turns rows of data into finished clips at scale, all through one integration.
See how the Wan API lines up against other leading video models on architecture, inputs, and licensing, so you can pick the model that fits, all reachable on Atlas Cloud.
| Model | Provider | Architecture | Inputs | Open Weights | Best For |
|---|---|---|---|---|---|
| Wan 2.2 | Alibaba | Mixture-of-Experts diffusion | Text, image, video | Yes (Apache 2.0) | Cinematic open-source video with self-host option |
| Seedance 2.0 | ByteDance | Proprietary | Text, image, video, audio | No | Multimodal, reference-driven cinematic video |
| Kling 3.0 | Kuaishou | Proprietary | Text, image, video | No | AI Director storytelling, multilingual dialogue |
| Hailuo | MiniMax | Proprietary | Text, image | No | Lifelike physics motion and anime styles |
| Veo 3.1 | Proprietary | Text, image | No | Cinematic, prompt-faithful short clips |
Get started in minutes — follow these simple steps to integrate and deploy models through Atlas Cloud's platform.
Sign up at atlascloud.ai and complete verification. New users receive free credits to explore the platform and test models.
Combining the advanced Wan models with Atlas Cloud's GPU-accelerated platform provides unmatched performance, scalability, and developer experience.
Low Latency:
GPU-optimized inference for real-time reasoning.
Unified API:
Run Wan, GPT, Gemini, and DeepSeek with one integration.
Transparent Pricing:
Predictable per-token billing with serverless options.
Developer Experience:
SDKs, analytics, fine-tuning tools, and templates.
Reliability:
99.99% uptime, RBAC, and compliance-ready logging.
Security & Compliance:
SOC 2 Type II, HIPAA alignment, data sovereignty in US.
The Wan API gives developers Alibaba's open Wan video models on Atlas Cloud through one unified key. This page runs Wan 2.2, a foundational video model that pioneered a Mixture-of-Experts architecture for video diffusion, covering text to video, image to video, and video to video with cinematic motion control. It sits alongside 300+ other models on the same account, so you reach it with OpenAI-compatible endpoints and no separate setup.
Wan 2.2 ships in a few variants: the A14B models built for cinematic-quality text to video and image to video, and the lighter TI2V-5B model that combines both modes and runs at 720p on consumer GPUs. Atlas Cloud also offers turbo variants that trade a little fidelity for faster, lower-latency generation. Pick A14B for maximum quality, TI2V-5B for efficiency, and turbo for rapid iteration.
The Wan API covers text to video, image to video, and video to video, so you can start from a prompt, animate a still, or transform existing footage. It also supports first-to-last frame control, video extend to lengthen a clip, and upscaling to sharpen resolution. Each mode is a parameter or endpoint change, which keeps a mixed pipeline on one integration.
Mixture-of-Experts, or MoE, splits the denoising process across specialized expert models that each handle part of the work. Wan 2.2 was the first video diffusion model to use this design, which raises overall model capacity and detail without increasing inference cost per step. In practice that means richer scenes and more accurate motion at the same compute budget.
Wan 2.2 generates natively at 480p and 720p, with higher resolution available through upscaling to 1080p or 2K, and typical clip lengths in the short range of a few seconds per generation. The lighter TI2V-5B variant targets 720p at 24fps.
Yes. Wan 2.2 is released by Alibaba under the Apache 2.0 license, with model weights and inference code published on HuggingFace and GitHub, so you can download, modify, and self-host it. Running it locally needs a GPU with at least 16GB of VRAM, ideally more for the larger A14B models. Atlas Cloud hosts the Wan API so you can skip the hardware and scaling work entirely.
Yes. The Apache 2.0 license allows commercial use, modification, and redistribution, so video generated with Wan 2.2 can go into commercial projects. Review Atlas Cloud's terms of service for the specifics of your plan, and note the usual restrictions around generating content that depicts real, identifiable people without their consent.
The turbo variants apply an rCM sampler and 4-step distillation, which compress the denoising process into far fewer steps. This lowers latency and cost per clip while keeping strong visual fidelity, which makes turbo a good fit for iteration, batch runs, and prompt testing. Move to the standard A14B models when a final render needs maximum detail.
Generation is asynchronous: each request returns a prediction ID that you poll until the clip is ready, which fits queues and high-volume runs. Add exponential backoff and a retry on a 429 response, and use the turbo variants for drafts to keep throughput high. Contact support to raise concurrency limits as your workload grows.
Create an account on Atlas Cloud, generate an API key, and send a request to the Wan model with your prompt or input image through the OpenAI-compatible endpoint. Poll the prediction endpoint for the finished clip, then scale up as needed. Because the same key reaches 300+ models, you can test other video and image models without any extra setup.
Join the Discord community for the latest model updates, prompts, and support.