kwaivgi/kling-v3.0-std/image-to-video

Kling v3.0 Standard Image-to-Video model by Kuaishou. High-quality video generation from images.

IMAGE-TO-VIDEONEW
Home
Explore
Kling Video Models
Kling 3.0 Video Models
kwaivgi/kling-v3.0-std/image-to-video
image-to-video

Kling v3.0 Standard Image-to-Video model by Kuaishou. High-quality video generation from images.

INPUT

Loading parameter configuration...

OUTPUT

Idle
Your generated videos will appear here
Configure your settings and click Run to get started

Your request will cost 0.153 per run. For $10 you can run this model approximately 65 times.

Here's what you can do next:

Parameters

Queue

Integrations

Input Schema

The following parameters are accepted in the request body.

Total: 0Required: 0Optional: 0

No parameters available.

Example Request Body

json
{
  "model": "kwaivgi/kling-v3.0-std/image-to-video"
}

Please log in to view request history

You need to be logged in to access your model request history.

Log In

Kling V3.0 Standard Image-to-Video

Kling V3.0 Standard Image-to-Video is Kuaishou's latest image-to-video generation model. Upload a reference image and describe the motion — the model generates cinematic video with optional synchronized sound, voice support, and start-to-end frame guidance.

Why Choose This?

  • Latest Kling generation V3.0 delivers improved motion quality and visual fidelity over V2.6.
  • Start-end frame guidance Optional end image for controlled transitions between two frames.
  • Sound generation Optional synchronized sound effects generated alongside the video.
  • Voice list support Add up to 2 custom voice entries for character dialogue.
  • CFG scale control Fine-tune the balance between prompt adherence and creative freedom.

Parameters

ParameterRequiredDescription
promptNoText description of the desired motion and action
negative_promptNoElements to exclude from generation
imageYesStart frame image to animate (URL or upload)
end_imageNoEnd frame image for guided transitions
durationNoVideo length: 5 or 10 seconds (default: 5)
cfg_scaleNoPrompt adherence strength (default: 0.5)
soundNoGenerate synchronized sound (default: disabled)
voice_listNoCustom voice entries, up to 2 (click "+ Add Item")

How to Use

  1. Upload your image — provide the reference image to animate.
  2. Write your prompt (optional) — describe the motion, camera movement, and action.
  3. Upload end image (optional) — provide an end frame for guided transitions.
  4. Add negative prompt (optional) — specify what you want to avoid.
  5. Set duration — 5 seconds or 10 seconds.
  6. Adjust cfg_scale (optional) — higher for stricter prompt following, lower for more freedom.
  7. Enable sound (optional) — generate synchronized audio with the video.
  8. Add voices (optional) — add up to 2 voice entries for dialogue.
  9. Run — submit and download your video.

Best Use Cases

  • Photo Animation — Bring portraits, landscapes, and product images to life.
  • Scene Transitions — Use start and end frames for smooth visual transitions.
  • Social Media Content — Create engaging videos with sound from still images.
  • Marketing & Ads — Generate dynamic promotional videos from product photos.
  • Storytelling — Animate scenes with synchronized audio and dialogue.

Pro Tips

  • Use clear, descriptive prompts with specific motion details for best results.
  • Add an end_image for controlled transitions between two visual states.
  • Enable sound for a complete video experience with synchronized audio.
  • Use negative prompts to avoid artifacts (e.g., "blurry, low quality, distorted").
  • Lower cfg_scale for more creative variation, higher for strict prompt adherence.
  • Use high-quality source images for better video results.

Notes

  • Image is the only required field; prompt is optional but recommended.
  • Duration options are 5 or 10 seconds only.
  • Voice list supports a maximum of 2 entries.
  • Ensure uploaded image URLs are publicly accessible.
  • Kling V3.0 Standard Text-to-Video — Generate video from text descriptions with V3.0 quality.
  • Kling V2.6 Standard Image-to-Video — Previous generation image-to-video.
  • Kling V2.6 Standard Text-to-Video — Previous generation text-to-video.

Start From 300+ Models,

Explore all models