HunyuanVideo

HunyuanVideo is an advanced text-to-video generation model that can create high-quality videos from text descriptions. It features a comprehensive framework that integrates image-video joint model training and efficient infrastructure for large-scale model training and inference.

Model Description

This model is trained on a spatial-temporally compressed latent space and uses a large language model for text encoding. According to professional human evaluation results, HunyuanVideo outperforms previous state-of-the-art models in terms of text alignment, motion quality, and visual quality.

Key features

🎨 High-quality video generation from text descriptions
📐 Support for various aspect ratios and resolutions
✍️ Advanced prompt handling with a built-in rewrite system
🎯 Stable motion generation and temporal consistency

Specifications in Depth

Overview:

Model Provider:OTHERS

Model Type:text-to-video

Deployment:Inferencing API; Playground

Pricing:$0.4

Key Specs:

Size Cap:up to width × height (user-configurable)

LoRA Support:No

Seed Options:N/A

Create Your Next Masterpiece

Explore Similar Models

NEW

HOT

text-to-video

Van-2.6 Text-to-video

A speed-optimized text-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

$0.068/SEC

NEW

image-to-video

Van-2.6 Image-to-video

A speed-optimized image-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

$0.068/SEC

image-to-video

Ltx-Video v097 i2v 720p

Open and Advanced Large-Scale Video Generative Models.

$0.3/SEC

NEW

image-to-video

Magi-1 24b

Open and Advanced Large-Scale Video Generative Models.

$0.32/SEC

Inizia con Oltre 300 Modelli,

Esplora tutti i modelli