Kling v3.0 Standard Text-to-Video model by Kuaishou. High-quality video generation from text prompts.
Kling v3.0 Standard Text-to-Video model by Kuaishou. High-quality video generation from text prompts.
Your request will cost 0.153 per run. For $10 you can run this model approximately 65 times.
Here's what you can do next:
The following parameters are accepted in the request body.
No parameters available.
{
"model": "kwaivgi/kling-v3.0-std/text-to-video"
}You need to be logged in to access your model request history.
Log InKling V3.0 Standard is Kuaishou's latest text-to-video generation model, delivering cinematic video from text descriptions with optional synchronized sound and voice generation. Support for negative prompts, multiple aspect ratios, and a CFG scale for creative control.
Latest Kling generation V3.0 brings improved motion quality and visual fidelity over V2.6.
Sound generation Optional synchronized sound effects generated alongside the video.
Voice list support Add up to 2 custom voice entries for character dialogue.
Negative prompt support Exclude unwanted elements for precise control over the output.
CFG scale control Fine-tune the balance between prompt adherence and creative freedom.
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Text description of the video scene and motion |
| negative_prompt | No | Elements to exclude from generation |
| duration | No | Video length: 5 or 10 seconds (default: 5) |
| aspect_ratio | No | Output ratio: 16:9 (default), 9:16, 1:1 |
| cfg_scale | No | Prompt adherence strength (default: 0.5) |
| sound | No | Generate synchronized sound (default: disabled) |
| voice_list | No | Custom voice entries, up to 2 (click "+ Add Item") |