atlascloud/hunyuan-video/t2v

Open and Advanced Large-Scale Video Generative Models.

TEXT-TO-VIDEO
Home
Explore
atlascloud/hunyuan-video/t2v
text-to-video

Open and Advanced Large-Scale Video Generative Models.

HunyuanVideo

HunyuanVideo is an advanced text-to-video generation model that can create high-quality videos from text descriptions. It features a comprehensive framework that integrates image-video joint model training and efficient infrastructure for large-scale model training and inference.

Model Description

This model is trained on a spatial-temporally compressed latent space and uses a large language model for text encoding. According to professional human evaluation results, HunyuanVideo outperforms previous state-of-the-art models in terms of text alignment, motion quality, and visual quality.

Key features

  • 🎨 High-quality video generation from text descriptions

  • 📐 Support for various aspect ratios and resolutions

  • ✍️ Advanced prompt handling with a built-in rewrite system

  • 🎯 Stable motion generation and temporal consistency

Specifications in Depth

Overview:

Model Provider:OTHERS
Model Type:text-to-video
Deployment:Inferencing API; Playground
Pricing:$0.4

Key Specs:

Size Cap:up to width × height (user-configurable)
LoRA Support:No
Seed Options:N/A

Create Your Next Masterpiece

Inizia con Oltre 300 Modelli,

Esplora tutti i modelli