HunyuanVideo

HunyuanVideo is an advanced image-to-video generation model that can create high-quality videos from text descriptions. It features a comprehensive framework that integrates image-video joint model training and efficient infrastructure for large-scale model training and inference.

Model Description

This model is trained on a spatial-temporally compressed latent space and uses a large language model for text encoding. According to professional human evaluation results, HunyuanVideo outperforms previous state-of-the-art models in terms of text alignment, motion quality, and visual quality.

Key features

🎨 High-quality video generation from text descriptions
📐 Support for various aspect ratios and resolutions
✍️ Advanced prompt handling with a built-in rewrite system
🎯 Stable motion generation and temporal consistency

详细规格

概览：

模型提供商：OTHERS

模型类型：image-to-video

部署方式：推理 API；Playground

定价：$0.08

关键参数：

尺寸上限：最大宽度 × 高度（用户可配置）

LoRA 支持：否

种子选项：N/A

创作你的下一件杰作

探索类似模型

NEW

HOT

文生视频

Van-2.6 Text-to-video

A speed-optimized text-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

$0.068/秒

NEW

图生视频

Van-2.6 Image-to-video

A speed-optimized image-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

$0.068/秒

图生视频

Ltx-Video v097 i2v 720p

Open and Advanced Large-Scale Video Generative Models.

$0.3/秒

NEW

图生视频

Magi-1 24b

Open and Advanced Large-Scale Video Generative Models.

$0.32/秒

300+ 模型，即刻开启，

探索全部模型