Kling Omni Video O3 (Standard) is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Generates high-quality videos from text prompts with natural motion and audio generation support.
Kling Omni Video O3 (Standard) is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Generates high-quality videos from text prompts with natural motion and audio generation support.
Your request will cost 0.071 per run. For $10 you can run this model approximately 140 times.
Here's what you can do next:
The following parameters are accepted in the request body.
No parameters available.
{
"model": "kwaivgi/kling-video-o3-std/text-to-video"
}You need to be logged in to access your model request history.
Log InKling Video O3 Standard is Kuaishou's advanced text-to-video model in the O3 family, delivering high-quality cinematic video from text descriptions. With optional synchronized sound generation, multiple aspect ratios, and flexible duration from 3 to 15 seconds, it offers a strong balance of quality and cost.
O3-level quality Advanced visual fidelity and motion realism beyond V3.0 models.
Sound generation Optional synchronized sound effects generated alongside the video.
Flexible duration Generate videos from 3 to 15 seconds — any length you need.
Multiple aspect ratios Support for 16:9, 9:16, and 1:1 to fit any platform.
Prompt Enhancer Built-in tool to automatically improve your video descriptions.
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Text description of the video scene and motion |
| aspect_ratio | No | Output ratio: 16:9 (default), 9:16, 1:1 |
| duration | No | Video length: 3-15 seconds (default: 5) |
| sound | No | Generate synchronized sound (default: disabled) |