alibaba/wan-2.2/i2v-720p

图生视频

Open and Advanced Large-Scale Video Generative Models.

输入

正在加载参数配置...

输出

空闲

生成的视频将在这里显示

配置参数后点击运行开始生成

每次运行将花费 0.3。$10 可运行约 33 次。

你可以继续：

视频超分视频延展

参数

Queue

集成

Input Schema

以下参数在请求体中被接受。

总计: 0必填: 0可选: 0

暂无可用参数。

请求体示例

json

{
  "model": "alibaba/wan-2.2/i2v-720p"
}

请登录以查看请求历史

您需要登录才能访问模型请求历史记录。

Wan 2.2 AI Video Model

Wan 2.2 is a new generation multimodal generative model launched by WAN AI. This model adopts an innovative MoE (Mixture of Experts) architecture, consisting of high-noise and low-noise expert models. It can divide expert models according to denoising timesteps, thus generating higher quality video content.

Wan2.2 have focused on incorporating the following innovations:

Effective MoE Architecture: Wan2.2 introduces a Mixture-of-Experts (MoE) architecture into video diffusion models. By separating the denoising process cross timesteps with specialized powerful expert models, this enlarges the overall model capacity while maintaining the same computational cost.
Cinematic-level Aesthetics: Wan2.2 incorporates meticulously curated aesthetic data, complete with detailed labels for lighting, composition, contrast, color tone, and more. This allows for more precise and controllable cinematic style generation, facilitating the creation of videos with customizable aesthetic preferences.
Complex Motion Generation: Compared to Wan2.1, Wan2.2 is trained on a significantly larger data, with +65.6% more images and +83.2% more videos. This expansion notably enhances the model's generalization across multiple dimensions such as motions, semantics, and aesthetics, achieving TOP performance among all open-sourced and closed-sourced models.
Efficient High-Definition Hybrid TI2V: Wan2.2 open-sources a 5B model built with our advanced Wan2.2-VAE that achieves a compression ratio of 16×16×4. This model supports both text-to-video and image-to-video generation at 720P resolution with 24fps and can also run on consumer-grade graphics cards like 4090. It is one of the fastest 720P@24fps models currently available, capable of serving both the industrial and academic sectors simultaneously.

Key Features of Wan 2.2

cinematic-level aesthetic control, deeply integrating professional film industry aesthetic standards, supporting multi-dimensional visual control such as lighting, color, and composition;
large-scale complex motion, easily restoring various complex motions and enhancing the smoothness and controllability of motion;
precise semantic compliance, excelling in complex scenes and multi-object generation, better restoring users’ creative intentions. The model supports multiple generation modes such as text-to-video and image-to-video, suitable for content creation, artistic creation, education and training, and other application scenarios.

Model Highlights

Cinematic-level Aesthetic Control: Professional camera language, supports multi-dimensional visual control such as lighting, color, and composition
Large-scale Complex Motion: Smoothly restores various complex motions, enhances motion controllability and naturalness
Precise Semantic Compliance: Complex scene understanding, multi-object generation, better restoring creative intentions

探索类似模型

图生视频

Wan-2.2-spicy Image-to-video Lora

Open and Advanced Large-Scale Video Generative Models.

Wan-2.2-spicy Image-to-video

Open and Advanced Large-Scale Video Generative Models.

Wan-2.6 Image-to-video Flash

Wan2.6 image to video flash, faster and more cost-effective generation. Intelligent shot scheduling enables multi‑camera storytelling, supports stable multi‑speaker dialogue with more natural and realistic vocal timbres.

Wan-2.6 Image-to-video

A speed-optimized image-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

$0.1/秒

$0.07/秒

-30%

300+ 模型，即刻开启，

探索全部模型