
The Kwaivgi API at 15% off standard rates. Day-0 access to every new Kling release, pay-as-you-go, no seat limits. One account covers the full Kling lineup.
Generate cinematic, high-fidelity videos from text and images with the latest AI video generation models on Atlas Cloud.

Kuaishou’s flagship video generation suite, Kling 3.0, features two powerhouse models—Kling 3.0 (Upgraded from Kling 2.6) and Kling 3.0 Omni (Kling O3, Upgraded from Kling O1)—both offering high-fidelity native audio integration. While Kling 3.0 excels in intelligent cinematic storytelling, multilingual lip-syncing, and precision text rendering, Kling O3 sets a new standard for professional-grade subject consistency by supporting custom subjects and voice clones derived from video or image inputs. Together, these models provide a comprehensive solution tailored for cinematic narratives, global marketing campaigns, social media content, and digital skit production.
Explore Kling3.0
Kling AI is a text-to-video model developed by Kuaishou that creates realistic, high-quality videos from text prompts. It focuses on smooth motion, stable frames, and natural-looking scenes. Kling works well for short videos, ads, and marketing content, helping creators save time and reduce production costs. With strong performance in video consistency and realism, Kling AI is becoming a popular choice in the AI video generation space.
Explore KlingCompare standard vs. our pricing across every Kwaivgi model.
| Model | Standard Price (USD) | Our Price (USD) | Discount | |
|---|---|---|---|---|
| Kling v3.0 Std Image-to-Video | $0.084 | Start from$0.071/s video | -15% | View |
| Kling v3.0 Pro Image-to-Video | $0.112 | Start from$0.095/s video | -15% | View |
| Kling v3.0 Pro Text-to-Video | $0.112 | Start from$0.095/s video | -15% | View |
| Kling v3.0 Std Text-to-Video | $0.084 | Start from$0.071/s video | -15% | View |
| Kling Video O3 Pro Video-Edit | $0.168 | Start from$0.143/s video | -15% | View |
| Kling Video O3 Pro Reference-to-Video | $0.112 | Start from$0.095/s video | -15% | View |
Instantly explore and experiment with 300+ production-ready models in the Atlas Playground. Start customizing with one click.
Kling's model lineup covers video generation, motion transfer, and lip sync from a single API. Teams use these capabilities together to automate video production that would otherwise require physical shoots, voice-over studios, or motion capture rigs.
Marketing teams use Kling Motion Control to map a single human performance onto multiple virtual characters, producing on-brand videos at scale without re-shooting. One reference clip captures the gestures, expressions, and pacing, and the model transfers that performance to each character while preserving their appearance. This reduces the cost of producing consistent brand video from thousands of dollars per clip to a few API calls.
Brands re-voice existing video ads for different markets using Kling Lipsync, replacing the original audio with a localized track and regenerating matching lip movements. The result is a localized video that looks natively shot rather than dubbed. Atlas Cloud's pay-as-you-go pricing makes it practical to produce language variants for every market without a per-language production budget.
Development teams use Kling v2.6's text-to-video endpoint to generate large batches of short clips for TikTok, Reels, and YouTube Shorts from script variations. Native audio generation means each clip comes with synchronized sound in a single API call, removing the post-production step. Atlas Cloud's per-second pricing lets teams scale output volume without committing to a monthly plan.
HR and L&D teams build pipelines that turn slide decks and scripts into polished training videos using Kling Lipsync and Kling Video O1. A presenter image paired with a voiceover track produces a consistent on-screen speaker across every module. Production time drops from weeks to hours, and Atlas Cloud handles the API infrastructure without requiring on-premise setup.
Filmmakers and game studios use Kling Motion Control to animate characters by uploading a reference clip of an actor performing a scene. The model transfers the actor's movements to the target character while keeping their appearance intact, supporting outputs up to 30 seconds. This replaces early-stage motion capture work for storyboarding, pre-vis, and cinematic prototyping.
E-commerce and DTC brands use Kling v2.6's image-to-video endpoint to animate product photos into short video clips with automatically generated ambient sound and motion. A single product image becomes a 5-to-10-second clip ready for ads and product pages without a video shoot. The flat per-second pricing on Atlas Cloud scales cleanly across large product catalogs.
Kling v2.6 is the right choice for standard text-to-video and image-to-video generation, and it includes native audio output in a single API call. Kling Video O1 is built for workflows that combine generation and editing: it uses multimodal visual language technology to keep subjects consistent across shots and includes a video-edit endpoint for modifying existing clips. Start with v2.6 for new content creation and reach for Video O1 when subject consistency or post-generation editing is a requirement.
Kling v2.6 generates audio alongside the video in a single API call. The audio layer covers three categories: natural speech in Chinese and English, action-synchronized sound effects, and environmental ambience. No separate audio pipeline or post-production step is needed.
Kling Lipsync takes a reference image or video of a face and an audio file, then generates a video with lip movements synchronized to the audio. It produces expressive, lifelike mouth motion matched frame-by-frame to the spoken content. Check the Atlas Cloud Kling Lipsync model page for the current list of supported audio formats before integrating.
Kling v2.6 Motion Control transfers movements from a reference video onto a static character image. You provide a character photo and a source clip containing the motion you want to apply, and the model maps the movements to your subject while preserving their appearance. It supports outputs from 3 to 30 seconds and works well for dance sequences, gesture replication, and character animation.
For prototyping, Kling v2.5 Turbo Pro generates at 2x speed and costs $0.06 per second on Atlas Cloud, which makes iteration fast and cheap. Kling v1.6 Standard at $0.048 per second is a lower-cost option for basic drafts. For production, Kling v2.1 Master at $0.238 per second delivers cinematic 1080p output with precise motion continuity.
Kling Effects takes a single image and generates a 5-second video with stylistic motion applied to the scene. It adds post-processing and cinematic movement to static images, making it useful for product showcases and social media content where you want motion without a full video shoot. It is priced at $0.212 per second on Atlas Cloud, a 15% discount from the standard rate.
Yes. A common pattern is to generate a clip with Kling v2.6 and pass it to Kling Video O1's video-edit endpoint for instruction-based editing in a second call. You can also feed a character image and a reference motion clip into Kling Motion Control to transfer specific movements to your subject. All Kling endpoints on Atlas Cloud share the same API key, so no additional authentication is needed between steps.
Every Kling model on Atlas Cloud is priced at 15% below the standard Kwaivgi API rate. Prices range from $0.048 per second for Kling v1.6 Standard to $0.238 per second for Kling v2.1 Master. All models are available pay-as-you-go with no monthly minimums or seat commitments.
Guides, tutorials, and product updates to help you get the most out of Atlas Cloud.
Join the Discord community for the latest model updates, prompts, and support.