
Kling v3.0 Standard Image-to-Video model by Kuaishou. High-quality video generation from images.

Kling v3.0 Professional Image-to-Video model by Kuaishou. Premium quality video generation from images with advanced features.

Kling v3.0 Professional Text-to-Video model by Kuaishou. Premium quality video generation from text prompts with advanced features.

Kling v3.0 Standard Text-to-Video model by Kuaishou. High-quality video generation from text prompts.

Kling V2 AI Avatar Pro generates high-quality AI avatar videos with clean detail, stable motion, and strong identity consistency—ideal for profiles, intros, and social content.

Kling AI Avatar generates high-quality AI avatar videos for profiles, intros, and social content, delivering clean detail and cinematic motion with reliable prompt adherence.

Kling 2.6 Pro Motion Control turns reference motion clips (dance, action, gesture) into smooth, realistic animations. Upload a character image (or source video) and a motion video; the model transfers the movement while preserving identity and temporal consistency.

Kling 2.6 Standard Motion Control transfers motion from reference videos to animate still images. Upload a character image and a motion clip (dance, action, gesture), and the model extracts the movement to generate smooth, realistic video.

Kling Omni Video O3 Video-Edit enables conversational video editing through natural language commands. Professional quality with object removal/replacement, background changes, and effects.

Kling Omni Video O3 Reference-to-Video generates creative videos using character, prop, or scene references. Professional quality with up to 7 reference images and optional video input.

Kling Omni Video O3 Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Professional quality with first/last frame control and audio generation.

Kling Omni Video O3 is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Professional quality with enhanced motion and detail.

Latest text-to-video model from Kuaishou with sound generation, flexible aspect ratios, and cinematic quality.

Latest image-to-video model from Kuaishou with sound generation, enhanced dynamics, and cinematic quality.

Kling Omni Video O3 Video-Edit (Standard) enables natural-language video edits: remove or replace objects, change backgrounds, add effects, and more. Video duration limited to 10s.

Kling Omni Video O3 (Standard) Reference-to-Video generates creative videos using character, prop, or scene references. Supports up to 7 reference images and optional video input.

Kling Omni Video O3 (Standard) Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Supports first/last frame control and audio generation.

Kling Omni Video O3 (Standard) is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Generates high-quality videos from text prompts with natural motion and audio generation support.

Kling Omni Video O1 Image-to-Video transforms static images into dynamic cinematic videos using MVL (Multi-modal Visual Language) technology. Maintains subject consistency while adding natural motion, physics simulation, and seamless scene dynamics. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

Kling Omni Video O1 is Kuaishou's first unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Text-to-Video mode generates cinematic videos from text prompts with subject consistency, natural physics simulation, and precise semantic understanding. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

Delivers high-speed text-to-video generation with cinematic motion precision and enhanced temporal stability.

Transforms stills into lifelike video clips at 2× faster speed while preserving fine texture and lighting consistency.

Supports start-to-end frame conditioning for controlled motion continuity and smoother scene transitions.

Generates multi-subject video from images with improved coherence and advanced motion-tracking accuracy.

A cost-efficient option for basic image-to-video generation with balanced speed and detail.

Adds post-processing and stylistic motion effects, expanding creative editing within Kling’s video suite.

Produces cinematic 1080p clips with refined lighting, camera realism, and cross-frame character stability.

Interprets complex text prompts with advanced motion logic and enhanced dynamic-camera rendering.

The foundational cinematic model combining high-fidelity visuals with realistic human motion generation.

Delivers professional-grade image-to-video generation with precise motion continuity and visual depth.

Balances generation speed and fidelity, producing sharp, fluid image-to-video results for general creative use.

Entry-level text-to-video generator offering stable motion and prompt alignment for short-form outputs.

Upgraded image-to-video variant with smoother motion blending and improved texture realism.

A fast, reliable 720p model optimized for quick visual drafts and efficient prototyping.

Lightweight early-generation model providing foundational image-to-video conversion at minimal cost.