Kling API: Kuaishou's Full Video Lineup

The Kling API gives developers Kuaishou's full Kling video lineup on Atlas Cloud through one OpenAI-compatible key. The family spans generations from the efficient v1.6 and v2.1 models to the 3.0 flagship, covering text to video, image to video, and video editing, with tiers from fast drafts to cinematic, high-resolution output. Pick the version and tier that fit each job, and reach them all alongside 300+ models on reliable infrastructure, without managing a separate integration per model.

Explore the Leading Kling

Atlas Cloud provides you with the latest industry-leading creative models.

NEW

image-to-video

TURBO

Kling V3.0 Turbo Image-to-Video

Kling V3.0 Turbo Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Supports first/last frame control and audio generation.

Kling V3.0 Turbo Text-to-Video

Kling V3.0 Turbo Text-to-Video generates dynamic cinematic videos from text prompts using MVL technology. Supports first/last frame control and audio generation.

Kling Video O3 4K Image-to-Video

Kling Omni Video O3 (4K) Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Supports first/last frame control and audio generation.

Kling Video O3 4K Text-to-Video

Kling Omni Video O3 (4K) is Kuaishou advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Generates high-quality videos from text prompts with natural motion and audio generation support.

Kling v3.0 4K Image-to-Video

Kling v3.0 4K Image-to-Video model by Kuaishou. High-quality video generation from images.

Kling v3.0 Std Image-to-Video

Kling v3.0 Standard Image-to-Video model by Kuaishou. High-quality video generation from images.

Kling v3.0 Pro Image-to-Video

Kling v3.0 Professional Image-to-Video model by Kuaishou. Premium quality video generation from images with advanced features.

Kling v3.0 Pro Text-to-Video

Kling v3.0 Professional Text-to-Video model by Kuaishou. Premium quality video generation from text prompts with advanced features.

Kling v3.0 4K Text-to-Video

Kling v3.0 4K Text-to-Video model by Kuaishou. High-quality video generation from text prompts.

Kling v3.0 Std Text-to-Video

Kling v3.0 Standard Text-to-Video model by Kuaishou. High-quality video generation from text prompts.

Kling v2.6 Pro Avatar

Kling V2 AI Avatar Pro generates high-quality AI avatar videos with clean detail, stable motion, and strong identity consistency—ideal for profiles, intros, and social content.

Kling v2.6 Std Avatar

Kling AI Avatar generates high-quality AI avatar videos for profiles, intros, and social content, delivering clean detail and cinematic motion with reliable prompt adherence.

Kling v2.6 Pro Motion Control

Kling 2.6 Pro Motion Control turns reference motion clips (dance, action, gesture) into smooth, realistic animations. Upload a character image (or source video) and a motion video; the model transfers the movement while preserving identity and temporal consistency.

Kling v2.6 Std Motion Control

Kling 2.6 Standard Motion Control transfers motion from reference videos to animate still images. Upload a character image and a motion clip (dance, action, gesture), and the model extracts the movement to generate smooth, realistic video.

Kling Video O3 Pro Video-Edit

Kling Omni Video O3 Video-Edit enables conversational video editing through natural language commands. Professional quality with object removal/replacement, background changes, and effects.

Kling Video O3 Pro Reference-to-Video

Kling Omni Video O3 Reference-to-Video generates creative videos using character, prop, or scene references. Professional quality with up to 7 reference images and optional video input.

Kling Video O3 Pro Image-to-Video

Kling Omni Video O3 Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Professional quality with first/last frame control and audio generation.

Kling Video O3 Pro Text-to-Video

Kling Omni Video O3 is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Professional quality with enhanced motion and detail.

Kling v2.6 Pro Text-to-Video

Latest text-to-video model from Kuaishou with sound generation, flexible aspect ratios, and cinematic quality.

Kling v2.6 Pro Image-to-Video

Latest image-to-video model from Kuaishou with sound generation, enhanced dynamics, and cinematic quality.

Kling Video O3 Std Image-to-Video

Kling Omni Video O3 (Standard) Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Supports first/last frame control and audio generation.

Kling Video O3 Std Reference-to-Video

Kling Omni Video O3 (Standard) Reference-to-Video generates creative videos using character, prop, or scene references. Supports up to 7 reference images and optional video input.

Kling Video O3 Std Video-Edit

Kling Omni Video O3 Video-Edit (Standard) enables natural-language video edits: remove or replace objects, change backgrounds, add effects, and more. Video duration limited to 10s.

Kling Video O3 Std Text-to-Video

Kling Omni Video O3 (Standard) is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Generates high-quality videos from text prompts with natural motion and audio generation support.

Kling Video O1 Text-to-video

Kling Omni Video O1 is Kuaishou's first unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Text-to-Video mode generates cinematic videos from text prompts with subject consistency, natural physics simulation, and precise semantic understanding. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

Kling Video O1 Image-to-video

Kling Omni Video O1 Image-to-Video transforms static images into dynamic cinematic videos using MVL (Multi-modal Visual Language) technology. Maintains subject consistency while adding natural motion, physics simulation, and seamless scene dynamics. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

Kling v2.5 Turbo Pro Text-to-video

Delivers high-speed text-to-video generation with cinematic motion precision and enhanced temporal stability.

Kling v2.5 Turbo Pro Image-to-video

Transforms stills into lifelike video clips at 2× faster speed while preserving fine texture and lighting consistency.

Kling v2.1 i2v Pro Start-end-frame

Supports start-to-end frame conditioning for controlled motion continuity and smoother scene transitions.

Kling v1.6 Multi i2v Pro

Generates multi-subject video from images with improved coherence and advanced motion-tracking accuracy.

Kling v1.6 Multi i2v Standard

A cost-efficient option for basic image-to-video generation with balanced speed and detail.

Kling Effects

Adds post-processing and stylistic motion effects, expanding creative editing within Kling’s video suite.

kling v2.0 i2v Master

Produces cinematic 1080p clips with refined lighting, camera realism, and cross-frame character stability.

Kling v2.1 t2v Master

Interprets complex text prompts with advanced motion logic and enhanced dynamic-camera rendering.

Kling v2.0 t2v Master

The foundational cinematic model combining high-fidelity visuals with realistic human motion generation.

Kling v2.1 i2v Master

Delivers professional-grade image-to-video generation with precise motion continuity and visual depth.

Kling v2.1 i2v Pro

Balances generation speed and fidelity, producing sharp, fluid image-to-video results for general creative use.

Kling v1.6 t2v Standard

Entry-level text-to-video generator offering stable motion and prompt alignment for short-form outputs.

Kling v1.6 i2v Pro

Upgraded image-to-video variant with smoother motion blending and improved texture realism.

Kling v2.1 i2v Standard

A fast, reliable 720p model optimized for quick visual drafts and efficient prototyping.

Kling v1.6 i2v Standard

Lightweight early-generation model providing foundational image-to-video conversion at minimal cost.

From$0.056/SEC

$0.048/SEC

-15%

Kling API Generations: Every Version and Tier, One Key

Kling API Model Lineup

Generation	Variants and Description
Kling v1.6	Standard and Multi tiers for text to video and image to video, optimized for 720p and 5-second clips. The efficient older generation for cost-sensitive, high-volume drafts and bulk social content.
Kling v2.0	Master-tier text to video and image to video, a step up in motion and fidelity over v1.6 for everyday production that does not need the newest features.
Kling v2.1	The widest mid-generation: Standard, Master, and Pro tiers for image to video, a Master text to video, plus a Pro start-to-last frame variant for controlled transitions. A balanced choice for quality production work.
Kling v2.5 Turbo Pro	Turbo Pro text to video and image to video, tuned for faster generation at Pro-level quality. A fit when speed and fidelity both matter.
Kling v3.0	The flagship, in Standard and Pro tiers for both text to video and image to video, with 5 or 10 second clips, native audio, voice support, multilingual lip-sync, on-screen text, and start-to-end frame guidance. Use it when quality and audio lead.
Kling 3.0 Omni (O3)	The reference and editing generation: subject and voice cloning from a short clip or image, conversational video editing through natural language, and multi-reference control for consistent characters.
Kling V2 AI Avatar Pro	A dedicated avatar model for talking-head and presenter videos, with clean detail, stable motion, and strong identity consistency, a fit for profiles, intros, and social content.

Kling API Features

The Kling API brings Kuaishou's full video lineup to Atlas Cloud, from v1.6 to the 3.0 flagship, with text and image to video, video editing, reference control, native audio, lip-sync, and tiered output through one integration.

One Key for the Full Kling Lineup

Reach every Kling video model, from the v1.6 and v2.1 generations to the 3.0 flagship and Omni O3, through a single integration on Atlas Cloud. Adopt new Kling releases as they land without changing your setup or opening a separate account per version.

Text and Image to Video with the Kling API

The Kling API covers text to video and image to video across the family, so you can start from a prompt or animate a still. Pick the generation and tier that match the job, from fast drafts to the cinematic 3.0 flagship.

Video Editing and Reference Control

Newer Kling models edit existing footage through natural language, removing or replacing objects, swapping backgrounds, and applying effects, and take up to 7 reference images to lock characters, props, and scenes. This keeps serialized and branded content consistent shot to shot.

Native Audio and Multilingual Lip-Sync

On the audio-capable generations, Kling produces synchronized sound in the same pass and matches lip movement across languages, including Chinese and English. Assign dialogue to specific characters so multi-speaker scenes stay clear, with no separate audio step.

First-to-Last Frame Control with the Kling API

The Kling API supports first and last frame conditioning on image to video, letting you set the opening and closing frames and have the model generate the motion between them. This gives precise control over how a shot begins and resolves.

Standard, Pro, and Turbo Tiers

Each Kling generation exposes tiers that trade speed against fidelity: Standard for everyday clips, Pro for cinematic detail, and Turbo for fast, high-volume runs. Move a request between tiers without reworking your integration.

Cinematic Motion Across Every Generation

Every Kling model is built on Kuaishou's motion and physics engine for smooth, realistic movement and stable frames. Earlier generations stay useful for cost-sensitive work while newer ones push fidelity, so the whole family shares a consistent cinematic base.

High-Resolution Output with the Kling API

The Kling API scales from standard clips to high-resolution masters, with 4K available on the dedicated Omni O3 4K models. Draft at a lower resolution, then render the final at the highest the chosen generation supports, all through the same key.

One Prompt Across the Kling API and Beyond

Run the same prompt through the Kling API and other leading video models on Atlas Cloud, and compare how each handles fast, multi-shot action and cinematic motion in a single scene.

Prompt

A single continuous cinematic action scene in 10 seconds, one flowing camera move, no hard cuts. A free-runner sprints out of a crowded night market, the camera tracking alongside as neon signs and steam blur past. She vaults over a fruit stall, slides under a lowering shutter, and the camera swings around her as she leaps across a gap between rooftops. Rain starts to fall, catching the neon light, and she lands and keeps running toward the camera as it pulls back to reveal the glowing city skyline behind her. Continuous energetic motion, dynamic tracking camera, grounded physics, reflections and volumetric light, cinematic grading, crisp 1080p.

Kling V3.0 Turbo

Seedance 2.0

Kling v2.5 Turbo Pro

Prompt

A single continuous cinematic nature scene in 10 seconds, one sweeping camera move, no hard cuts. The camera glides low over a golden savanna at sunrise, following a cheetah as it breaks into a sprint through the tall grass, dust trailing behind. It rises with the cheetah's leap, then cranes up to reveal a herd of gazelles scattering across the plain and a flock of birds lifting off in unison. The camera keeps climbing into a wide aerial as the whole herd streams toward a distant watering hole under warm morning light. Continuous fluid motion, seamless camera movement, volumetric sunlight, dust and atmosphere, cinematic grading, crisp 1080p.

Kling V3.0 Turbo

Seedance 2.0

Kling v2.5 Turbo Pro

What You Can Build with the Kling API

From social clips and cinematic ads to serialized characters and multilingual content, the Kling API lets you match each job to the right Kling generation and tier on Atlas Cloud, all through one key.

Social and Short-Form Video at Any Budget

Draft high-volume vertical clips on a fast, lower-cost Kling generation, then re-render the winners on a higher one for release. The family lets social teams keep output cheap during iteration and cinematic where it counts, without leaving one integration.

Cinematic Ads and Brand Films with the Kling API

Use the Kling API's flagship generations for polished, high-resolution ad and brand work, with strong motion and aesthetic control. When a campaign needs many variants, drop to a faster tier for the cutdowns and keep the hero spot on the top model.

Serialized Characters and Story Content

Build episodic and character-driven series where a subject must stay consistent across shots, using reference control and video editing on the newer Kling models. Keep the same character across scenes and episodes without re-establishing them each time.

Multilingual and Global Campaigns with the Kling API

Generate the same scene with synchronized audio and lip-sync across languages on the audio-capable Kling generations, so one production reaches multiple markets. The Kling API assigns dialogue per character, which keeps multi-speaker localized content clear.

Previs, Concepting, and Iteration

Run quick motion tests and storyboards on a fast Kling tier before committing to a full render, then move the approved shots to a cinematic generation. Cheap, fast drafts keep directors and agencies iterating without burning the production budget.

Mixed-Version Pipelines with the Kling API

Route different jobs to different Kling generations inside one automated pipeline, polling them the same asynchronous way through the Kling API. This lets a product use the fast tier for user-facing generation and a flagship tier for premium exports, all on one key.

How the Kling API Compares

See how the Kling API lines up against other leading video models on Atlas Cloud by provider, model range, inputs, and audio, so you can weigh a full lineup against single-model families, all under one key.

Model	Provider	Model Range	Inputs	Native Audio
Kling	Kuaishou	Full lineup, v1.6 to 3.0 and O3	Text, image, video	Yes
Seedance	ByteDance	Lite, Pro, and 2.0 tiers	Text, image, video, audio	Yes
Wan	Alibaba	Open 2.2 family (MoE)	Text, image, video	No
Hailuo	MiniMax	02 and 2.3 lines	Text, image	No
Veo 3.1	Google	Single flagship line	Text, image	Yes

How to Use Kling on Atlas Cloud

Get started in minutes — follow these simple steps to integrate and deploy models through Atlas Cloud's platform.

Create an Atlas Cloud Account

Sign up at atlascloud.ai and complete verification. New users receive free credits to explore the platform and test models.

Why Use Kling on Atlas Cloud

Combining the advanced Kling models with Atlas Cloud's GPU-accelerated platform provides unmatched performance, scalability, and developer experience.

Performance & flexibility

Low Latency:
GPU-optimized inference for real-time reasoning.

Unified API:
Run Kling, GPT, Gemini, and DeepSeek with one integration.

Transparent Pricing:
Predictable per-token billing with serverless options.

Enterprise & Scale

Developer Experience:
SDKs, analytics, fine-tuning tools, and templates.

Reliability:
99.99% uptime, RBAC, and compliance-ready logging.

Security & Compliance:
SOC 2 Type II, HIPAA alignment, data sovereignty in US.

Kling API FAQ

The Kling API gives developers Kuaishou's full lineup of Kling video models on Atlas Cloud through one OpenAI-compatible key. It spans generations from the efficient v1.6 to the 3.0 flagship and the Omni O3 reference model, covering text to video, image to video, and video editing. Rather than a single model, you get the whole family behind one integration, alongside 300+ other models on the same account.

The lineup covers several generations: v1.6 and v2.0 for efficient everyday work, the wider v2.1 range, v2.5 Turbo Pro for fast high-quality output, the v3.0 flagship, the Omni O3 reference and editing model, and a dedicated avatar model. Across these generations and their Standard, Pro, Turbo, and Master tiers, there are around 15 models to choose from. Each targets a different balance of speed, cost, and fidelity.

Match the generation to what the job needs. Use an older, efficient generation like v1.6 for high-volume drafts and cost-sensitive work, a mid generation like v2.1 for balanced production, and the v3.0 flagship when you need cinematic quality, native audio, or multilingual lip-sync. Because every generation sits behind the Kling API on one key, you can test a few and settle on the best fit without extra setup.

The tiers trade speed, cost, and fidelity within a generation. Standard is the everyday option, Pro raises detail and motion quality for finished work, Turbo prioritizes fast, low-latency generation, and Master denotes the top-quality tier on the generations that offer it. You can move a request between tiers by changing the model name, without reworking your integration.

Native audio and multilingual lip-sync are features of the newer generations, notably v3.0 and Omni O3, which produce synchronized sound in the same pass and match mouth movement across languages. Older generations focus on visual generation without native audio. If a project needs sound or dialogue, route it to a newer generation through the Kling API.

Across the family, the Kling API covers text to video and image to video, with newer generations adding video editing, reference to video, and start-to-last frame guidance. Image inputs use standard JPEG or PNG. The available modes depend on the generation you call, so a mixed pipeline can route each task to the model that supports the mode it needs.

Not always the latest. Older generations remain useful because they generate faster and cost less, which suits drafts, bulk social content, and high-volume testing where the newest fidelity is not required. A common pattern is to draft on an efficient generation, then re-render the approved shots on the flagship for final delivery, keeping both cost and quality in check.

Yes. Every Kling generation and tier sits behind the same OpenAI-compatible endpoint on Atlas Cloud, so switching versions is a change to the model name in your request. You do not need a separate account, SDK, or integration per version, which makes it easy to adopt newer generations as Kuaishou releases them.

Generation is asynchronous: each request returns a prediction ID that you poll until the clip is ready, and this works the same way across every generation. Route different jobs to different versions in one pipeline, add exponential backoff and a retry on a 429 response, and use faster tiers for high-volume work. Contact support to raise concurrency limits as your workload grows.

Create an account on Atlas Cloud, generate an API key, and send a request naming the Kling generation and tier you want, with your prompt or input image, through the OpenAI-compatible endpoint. Poll the prediction endpoint for the finished clip, then scale up as needed. Because the same key reaches every Kling version and 300+ other models, you can switch generations or try other models without extra setup.

Explore More Families

Seedance 2.0

The Seedance 2.0 API gives you production access to ByteDance's multimodal video model — quad-modal inputs (text, image, video, audio) and an industry-leading "Universal Reference" system that locks composition, camera movement, and character actions across shots. Integrate director-level control with one API call, a flat $0.09/s, instant key, and no waitlist — backed by enterprise-grade uptime and compliance. Seedance 2.0 Native 4K is now live!

View Family

Grok Imagine

The Grok Imagine API gives developers xAI's image, video, and audio generation in one suite. It produces up to 2K images with multilingual text rendering, plus video up to 15 seconds with native, synchronized audio and reference-based editing. On Atlas Cloud one key runs every Grok Imagine mode, so you move between image, video, and audio without separate setups, from $0.02 per image and $0.05 per second.

View Family

Gemini Omni Flash

The Gemini Omni API brings Google DeepMind's multimodal video generation and editing model, introduced at Google I/O 2026, to your stack. Gemini Omni fuses Gemini's reasoning engine with generative media, accepting any mix of text, images, video, and audio to produce consistent, knowledge-grounded output. Refine results through natural conversation, swapping objects, rewriting scenes, and shifting styles while physics, characters, and continuity stay intact. Atlas Cloud serves the full Gemini Omni Flash lineup, text-to-video, image-to-video with up to 7 reference images, and reference-to-video, through one unified API with transparent per-second pricing from $0.112 and no subscription. Start building today.

View Family

GPT Image 2

The GPT Image 2 API gives developers access to OpenAI's latest image model, the successor to GPT Image 1.5. It generates and edits images with accurate text rendering across Latin and CJK scripts, plus strong composition for posters, mockups, and infographics. On Atlas Cloud you reach it through one unified API alongside 300+ models, with free credits, 99.99% uptime, and no OpenAI organization verification required.

View Family

Google

Google's most powerful creative models are all available on Atlas Cloud. Veo 3.1 delivers cinematic video generation, Nano Banana 2 powers high-fidelity image creation, and Gemini brings multimodal intelligence to every workflow. Access the full Google model suite through one API key with Day-0 availability and pay-as-you-go pricing.

View Family

Seedance 2.0 Mini

The Seedance 2.0 Mini API is the lightest, lowest-cost tier of ByteDance's Seedance video line, built for teams where throughput and unit cost matter more than maximum polish. Use it for batch generation, rapid prototyping, and draft passes, all through one OpenAI-compatible key on Atlas Cloud.

View Family

ByteDance

From cinematic video generation to high-fidelity image creation, ByteDance's most powerful models are live on Atlas Cloud. Run Seedance and Seedream at scale with the lowest inference pricing and zero infrastructure overhead.

View Family

Alibaba

Atlas Cloud brings together Alibaba's full model lineup under one API: Qwen for language and image tasks, Wan for video generation up to 1080p. Access every model pay-as-you-go with no subscriptions. The Alibaba API is available via a single base URL using your existing OpenAI-compatible client.

View Family

OpenAI

Atlas Cloud gives you access to the full OpenAI API lineup, from GPT Image 2 for image generation to Sora 2 for video. Every model is available pay-as-you-go with no monthly commitment. Plug in with a single base URL swap using the OpenAI-compatible API.

View Family

xAI

Build complete image and video pipelines using the xAI API on Atlas Cloud. Generate at 2K, edit with reference images, and animate images into audio-synced clips.

View Family

Kwaivgi

The Kwaivgi API at 15% off standard rates. Day-0 access to every new Kling release, pay-as-you-go, no seat limits. One account covers the full Kling lineup.

View Family

Seedream 5.0 Pro

Seedream 5.0 Pro API gives developers ByteDance's controllable image editing model on Atlas Cloud. It places edits precisely with anchors and coordinates, separates images into editable layers, fuses multiple references, and matches exact colors and materials, with multilingual text at 2K and 3K. On Atlas Cloud you reach it through one key!

View Family

One API for All Media AI.

Explore all models