bytedance/avatar-omni-human

Hình ảnh-Video

Open and Advanced Large-Scale Video Generative Models.

OmniHuman

OmniHuman is a cutting-edge end-to-end AI framework developed by ByteDance, designed to generate highly realistic human videos from just a single image and an audio input, with advanced features such as lip sync, facial animation, and gesture synthesis. Whether you provide a portrait, half-body, or full-body photo, OmniHuman brings it to life with natural movements, expressive gestures, accurate lip synchronization to audio, and remarkable attention to detail. By combining multiple input types—such as images and audio—OmniHuman creates vivid, high-quality video results. The model is highly adaptable, supporting not only real human portraits but also animated or cartoon characters, making it suitable for a wide range of applications including content creation, singing, lip sync videos, and performance scenarios. 0.12$ per second.

OmniHuman Avatar Effect

Requirements

Number of Image

Only one image can be uploaded per generation.

Image Requirements

Only human portrait images are supported.
For best results, use clear, front-facing portraits with good lighting.
Supported formats: PNG, JPEG, JPG, WebP.
Maximum file size: 50MB.

Output Characteristics

Produces natural human motion, facial expressions, and accurate lip sync to audio.
Works best with clear, well-lit portrait photos.
May not perform optimally with extreme poses or poor lighting.

Best Practices

Use a clear, front-facing portrait photo.
Ensure the image is well-lit.
Avoid extreme angles or poses.
Make sure the face is clearly visible.
Avoid images with multiple people.

Keywords

lip sync
facial animation
gesture synthesis
ortrait animation
audio-driven video generation

Thông số kỹ thuật Chi tiết

Tổng quan:

Nhà cung cấp Mô hình:BYTEDANCE

Loại Mô hình:image-to-video

Triển khai:API Suy luận; Playground

Giá cả:$0.12

Thông số chính:

Giới hạn Kích thước:Chiều rộng × chiều cao tối đa (tùy chỉnh)

Hỗ trợ LoRA:Không

Tùy chọn Seed:N/A

Tạo Kiệt tác Tiếp theo của Bạn

Khám phá Các Mô hình Tương tự

NEW

HOT

Hình ảnh-Video

Seedance v1.5 Pro Image-to-Video

Native audio-visual joint generation model by ByteDance. Supports unified multimodal generation with precise audio-visual sync, cinematic camera control, and enhanced narrative coherence.

Seedance v1.5 Pro Text-to-Video

Native audio-visual joint generation model by ByteDance. Supports unified multimodal generation with precise audio-visual sync, cinematic camera control, and enhanced narrative coherence.

Seedance v1.5 Pro Image-to-Video Fast

Native audio-visual joint generation model by ByteDance. Supports unified multimodal generation with precise audio-visual sync, cinematic camera control, and enhanced narrative coherence.

$0.022/GIÂY

NEW

Văn bản-Video

Seedance v1.5 Pro Text-to-Video Fast

Native audio-visual joint generation model by ByteDance. Supports unified multimodal generation with precise audio-visual sync, cinematic camera control, and enhanced narrative coherence.

FAST

$0.022/GIÂY

Bắt đầu với 300+ Mô hình,

Khám phá tất cả mô hình