Kling Omni Video O3 (Standard) is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Generates high-quality videos from text prompts with natural motion and audio generation support.
Kling Omni Video O3 (Standard) is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Generates high-quality videos from text prompts with natural motion and audio generation support.
Kling Video O3 Standard is Kuaishou's advanced text-to-video model in the O3 family, delivering high-quality cinematic video from text descriptions. With optional synchronized sound generation, multiple aspect ratios, and flexible duration from 3 to 15 seconds, it offers a strong balance of quality and cost.
O3-level quality Advanced visual fidelity and motion realism beyond V3.0 models.
Sound generation Optional synchronized sound effects generated alongside the video.
Flexible duration Generate videos from 3 to 15 seconds — any length you need.
Multiple aspect ratios Support for 16:9, 9:16, and 1:1 to fit any platform.
Prompt Enhancer Built-in tool to automatically improve your video descriptions.
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Text description of the video scene and motion |
| aspect_ratio | No | Output ratio: 16:9 (default), 9:16, 1:1 |
| duration | No | Video length: 3-15 seconds (default: 5) |
| sound | No | Generate synchronized sound (default: disabled) |
Atlas Cloudでのみ。