Van API for Day-0 1080p Video Generation

Van is a flagship AI video series built on the Wan 2.5 and 2.6 frameworks, and the Van API unifies its full text-to-video and image-to-video lineup on Atlas Cloud. Render cinematic clips up to 1080p, then pick speed-optimized 2.6 models for fast iteration or cost-efficient 2.5 variants for large batches. Every model ships on Day-0 behind one OpenAI-compatible key with transparent pay-as-you-go pricing. Start building today.

Explore the Leading Van

Atlas Cloud provides you with the latest industry-leading creative models.

NEW

HOT

text-to-video

Van-2.6 Text-to-video

A speed-optimized text-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

Van-2.6 Image-to-video

A speed-optimized image-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

Van-2.5 Image-to-video

Get animated visuals from your images faster without major quality sacrifice. Perfect for preview workflows, previews at scale, or mass production of animated assets.

Van-2.5 Text-to-video

Convert prompts into cinematic video clips with synchronized sound. Van 2.5 generates 720p/1080p outputs with stable motion, native audio sync, and prompt-faithful visual storytelling.

From

$0.068/SEC

Van API Video Models Compared by Modality, Resolution, and Price

Match each text-to-video and image-to-video endpoint to your resolution, duration, and budget.

Modality	Description
Van-2.6 T2V API (Text To Video)	Turn plain text prompts into cinematic video up to 15 seconds long at resolutions reaching 1920x1080. Multi-shot sequencing and native audio generation run in a single pass, and the speed-optimized pipeline holds a standard $0.068 per call that suits fast iteration, batch runs, and prompt testing.
Van-2.6 I2V API (Image To Video)	Van-2.6 image-to-video animates a source image and guiding prompt into 1080p clips of 5, 10, or 15 seconds. With multi-shot control and automatic audio built in, it keeps the same $0.068 rate that makes previews and high-volume asset generation practical.
Van-2.5 I2V API (Image To Video)	Need animated assets at scale without a premium price? This image-to-video endpoint converts a still into 720p or 1080p clips of 5 or 10 seconds at $0.054 per call, the lowest rate in the family. That economy fits preview workflows and mass production of animated content.
Van-2.5 T2V API (Text To Video)	Prompts become cinematic clips at 720p or 1080p with stable motion and synchronized native audio through Van-2.5 text-to-video. Running 5 or 10 seconds at $0.068 per call, it is built for prompt-faithful storytelling and social-ready video drafts.

Multi-Shot Narrative Control using Van-2.6 API

The Van-2.6 API empowers storytellers to generate complex video sequences that mimic professional cinematic editing within a single generation task. By orchestrating multiple camera angles and seamless shot transitions in a continuous flow, it maintains perfect narrative consistency while delivering dynamic visual perspectives. It is the ultimate solution for automated storyboarding, immersive cinematic storytelling, and high-impact long-form video production.

High-Resolution Visual Fidelity using Van Model API

The Van Model API elevates the Wan 2.5 and 2.6 frameworks into stunning high-resolution outputs with absolute content consistency. By refining cinematic 3D VAE textures and pixel-perfect clarity, it delivers professional-grade video quality that surpasses original benchmarks. It is the definitive choice for premium storytelling, sharp detail restoration, and high-end cinematic production.

Extreme Inference Efficiency using Van Model API

The Van Model API leverages proprietary compute distillation to shatter the "quality-to-cost" barrier with lightning-fast inference speeds. By optimizing the Flow Matching architecture, it enables massive-scale generation at a fraction of traditional operational costs. It is the ultimate solution for high-frequency enterprise workflows, budget-conscious scaling, and rapid creative iteration.

Unconstrained Cinematic Dynamics using Van Model API

The Van Model API offers unparalleled creative freedom by reducing model constraints while maintaining complex 3D VAE motion dynamics. By deeply understanding fluid physics and intricate camera language, it enables developers to craft unrestricted, high-impact cinematic sequences. It is the premier engine for innovative visual experimentation, complex scene transitions, and boundary-pushing artistic expression.

Synchronized Sound in a Single Pass

Generate dialogue, ambient sound, and background music alongside the picture in a single request, with audio enabled by default across the 2.5 and 2.6 endpoints. Because sound is aligned to on-screen motion, speaking characters stay in sync and each scene carries its own atmosphere. Supply an external audio track whenever you need to steer timing. The payoff is sound-complete clips ready for social, ads, or product demos without a separate scoring step.

Two Ways In: Text or Image

Start from a written prompt or animate an existing still, since every tier ships both text-to-video and image-to-video endpoints under one key. Text-to-video composes a scene from description alone, while image-to-video preserves the details of your source frame and sets them in motion. Both paths share the same audio, resolution, and duration controls. Teams can storyboard from words, then bring finished artwork or product photography to life through an identical workflow.

Formats for Every Screen

Whether you need a widescreen 1920x1080 master, a 1080x1920 vertical cut, or a 1440x1440 square, output dimensions are set by a single parameter, and clips can run 5, 10, or 15 seconds on the 2.6 tier. Resolution scales from 720p up to full 1080p to balance quality against budget. One prompt can therefore feed a widescreen pre-roll, a vertical story, and a square feed post without re-editing, keeping multi-platform campaigns on one pipeline.

Van vs Other Models - One Prompt

The same prompt, generated by Van and other leading video models: commercial advertisement and cinematic video

Prompt

Create a 10-second cinematic commercial video for a compact AI translation device. Scene 1, 0-2s: Clean studio macro close-up of a small handheld translation device on a white desk. Smooth matte metal body, tiny screen, rounded edges, soft daylight reflections. Slow push-in camera movement. Scene 2, 2-5s: A hand picks up the device and taps the screen. The screen lights up with a simple waveform animation. Add a soft touch sound and a subtle startup chime. Keep the device design, screen position, size, and material exactly consistent. Scene 3, 5-8s: Medium shot in a bright airport lounge. A traveler holds the same device while speaking to another person. The device shows a glowing translation waveform. Natural hand motion, clean background, realistic ambient airport sound. Scene 4, 8-10s: Final hero shot: the device stands upright on the desk again, screen glowing softly, with a clean product tagline space above it. Camera slowly pulls back. Minimal, premium technology commercial look. Requirements: - Maintain the exact same translation device design across all shots - Keep the screen shape, button position, metal texture, and product scale consistent - Use readable cinematic camera control: macro close-up, hand interaction, lifestyle usage shot, final hero shot - Bright clean lighting, realistic reflections, stable product geometry - Synchronize audio with action: screen tap, startup chime, soft voice waveform, airport ambience - No warped hands, no changing device shape, no inconsistent screen layout - High-end tech commercial style, clean composition, 1080p

Van 2.6

HappyHorse 1.0

Wan 2.7

Prompt

Create a 10-second cinematic commercial video for an AI cloud platform dashboard. Scene 1, 0-2s: Close-up of a laptop screen showing a clean AI cloud dashboard interface. Floating data cards display model usage, GPU status, and video generation progress. Bright modern office lighting, shallow depth of field, slow push-in camera movement. Scene 2, 2-5s: A developer’s hand types a short command on the keyboard. The dashboard instantly updates: a progress bar moves smoothly from 20% to 100%, and small preview thumbnails appear on the screen. Add soft keyboard sounds and a subtle interface notification chime. Scene 3, 5-8s: Medium shot of the developer looking at the generated video preview on the laptop. The office background remains clean and consistent. The same dashboard layout, laptop design, and UI color structure must stay stable. Scene 4, 8-10s: Final hero shot: the laptop is centered on the desk, showing the completed AI video result on the dashboard. Camera slowly pulls back. Leave clean empty space on the right side for a product slogan or logo. Requirements: - Maintain the same laptop, dashboard layout, UI panels, and office setting across all shots - Use cinematic camera control: screen close-up, keyboard interaction, medium user shot, final hero pullback - Keep UI elements readable and visually consistent, without random text distortion - Synchronize audio with action: keyboard typing, progress completion chime, soft office ambience - Clean tech commercial style, realistic reflections, stable screen geometry - No warped hands, no changing laptop shape, no inconsistent dashboard design - Bright modern SaaS advertising look, polished composition, 1080p

Van 2.6

HappyHorse 1.0

Wan 2.7

Where Teams Put the Van API to Work

From sound-synced social clips and high-volume ad production to cinematic previsualization, the Van API turns text and images into fast, low-cost, high-resolution video across every stage of a creative pipeline.

Sound-Synced Social Content with the Van API

Van 2.5 converts prompts into 720p and 1080p clips with native audio sync, producing voiceover, music, and ambient effects in a single pass. Social teams launch campaign videos without a separate sound editing workflow.

High-Volume Marketing Asset Production

Speed-optimized Van 2.6 keeps latency low across large batches, so entire prompt sets render in one sitting. This suits agencies producing dozens of ad variants and localized cuts on tight campaign deadlines.

E-Commerce Product Animation using the Van API

Animate static catalog photos into dynamic scenes with Van 2.5 image-to-video, the most affordable option in the family. Online stores turn flat product shots into scroll-ready motion for listings and ads.

Rapid Creative Iteration and Prompt Testing

Testing a concept before committing render budget? Van 2.6 keeps latency low for fast iteration, so creators preview prompts, refine shots, and explore multiple creative directions before a costly final pass.

Cinematic Storyboarding and Previsualization

High-resolution cinematic sequences come straight from a text prompt through Van 2.6, giving filmmakers fluid motion and consistent framing. Directors and previs artists visualize scenes early, mapping shots and pacing before a physical shoot.

Multi-Platform Short-Form Video with the Van API

With 720p and 1080p output in both landscape and portrait, Van 2.5 tailors each clip to its channel. Creators spin one concept into widescreen and vertical cuts for YouTube, TikTok, and Reels.

Van API Versus Today's Leading Video Generation Models

Line up the Van API against the top video engines on Atlas Cloud and see how it matches their resolution, duration, and native audio while charging a fraction of the per second price.

Model	Provider	Max Resolution	Max Duration	Native Audio	Price (per second)
Van-2.6 API (Text to Video)	Atlas Cloud	1080p	15s	√	$0.068 / sec
Van-2.6 API (Image to Video)	Atlas Cloud	1080p	15s	√	$0.068 / sec
Van-2.5 API (Text to Video)	Atlas Cloud	1080p	10s	√	$0.068 / sec
Van-2.5 API (Image to Video)	Atlas Cloud	1080p	10s	-	$0.054 / sec
Wan-2.7 API (Text to Video)	Alibaba	1080p	15s	√	$0.10 / sec
Seedance 2.0 API (Text to Video)	ByteDance	1080p	15s	√	$0.112 / sec
Kling v3.0 Pro API (Text to Video)	Kuaishou	1080p	15s	√	$0.112 / sec
Veo 3.1 API (Text to Video)	Google	4K	8s	√	$0.20 / sec

How to Use Van on Atlas Cloud

Get started in minutes — follow these simple steps to integrate and deploy models through Atlas Cloud's platform.

Create an Atlas Cloud Account

Sign up at atlascloud.ai and complete verification. New users receive free credits to explore the platform and test models.

Why Use Van on Atlas Cloud

Combining the advanced Van models with Atlas Cloud's GPU-accelerated platform provides unmatched performance, scalability, and developer experience.

Performance & flexibility

Low Latency:
GPU-optimized inference for real-time reasoning.

Unified API:
Run Van, GPT, Gemini, and DeepSeek with one integration.

Transparent Pricing:
Predictable per-token billing with serverless options.

Enterprise & Scale

Developer Experience:
SDKs, analytics, fine-tuning tools, and templates.

Reliability:
99.99% uptime, RBAC, and compliance-ready logging.

Security & Compliance:
SOC 2 Type II, HIPAA alignment, data sovereignty in US.

Van API FAQ: What Developers Ask Before Building

The Van API is Atlas Cloud's endpoint for the Van family of AI video models, built on the Wan 2.5 and 2.6 frameworks. It converts text prompts and static images into high-resolution cinematic video, using 3D VAE and Flow Matching to keep motion fluid and content consistent. One OpenAI-compatible key gives you Day-0 access to every model in the series with pay-as-you-go pricing.

Van covers both text-to-video and image-to-video generation, so you can direct a scene from a written prompt or animate an existing image into motion. Creators use it for cinematic storytelling, marketing clips, character animation, and rapid creative prototyping. Because inference is speed-optimized, it suits high-frequency workflows that would be slow or costly on heavier models.

Four models sit under the Van family: Van-2.6 and Van-2.5, each offered in text-to-video and image-to-video modes. The 2.6 models prioritize lower latency while retaining strong visual fidelity, and the 2.5 models add cost-effective options for preview workflows and large-scale asset production. You reach all of them through the same endpoint without switching keys or SDKs.

Sign up on Atlas Cloud, generate one API key, and point your existing OpenAI-compatible client at the Van endpoint. Send a prompt or a source image along with your chosen model name, and the API returns a generated video. There is no separate onboarding per model, so you can start building today across the whole Van family.

Van 2.5 generates video at 720p and 1080p with stable motion and prompt-faithful visuals. Higher resolutions and longer clips consume more compute, so your output settings influence generation cost. Check the playground parameter schema for each model to confirm the exact resolution and duration options it exposes.

Yes. Van 2.5 produces native audio synced to the visuals in a single pass, so a single prompt can return a finished clip with sound rather than a silent draft. That removes the manual step of aligning a separate soundtrack after generation.

The Van API uses transparent pay-as-you-go pricing with no subscription. Van-2.5 image-to-video starts at $0.054 per second, while Van-2.5 text-to-video, Van-2.6 text-to-video, and Van-2.6 image-to-video are priced at $0.068 per second. You pay only for what you generate, and output settings such as resolution and length affect the final charge.

Van is Atlas Cloud's optimized flagship series built on the Wan 2.5 and 2.6 architectures. Through proprietary compute distillation it targets faster inference and lower operational cost while preserving the base model's motion and logic. In practice you get the Wan foundation delivered through one Van API with consistent pricing and Day-0 access.

Van 2.6 is the newer generation, tuned for lower latency while holding strong visual fidelity, which suits iteration, batch generation, and prompt testing. Van 2.5 remains a strong choice when cost matters most, with image-to-video priced at the lowest point in the family. Both are reachable through the same endpoint, so you can switch models by name as project needs change.

Explore More Families

Seedance 2.0

The Seedance 2.0 API gives you production access to ByteDance's multimodal video model — quad-modal inputs (text, image, video, audio) and an industry-leading "Universal Reference" system that locks composition, camera movement, and character actions across shots. Integrate director-level control with one API call, a flat $0.09/s, instant key, and no waitlist — backed by enterprise-grade uptime and compliance. Seedance 2.0 Native 4K is now live!

View Family

Grok Imagine

The Grok Imagine API gives developers xAI's image, video, and audio generation in one suite. It produces up to 2K images with multilingual text rendering, plus video up to 15 seconds with native, synchronized audio and reference-based editing. On Atlas Cloud one key runs every Grok Imagine mode, so you move between image, video, and audio without separate setups, from $0.02 per image and $0.05 per second.

View Family

Gemini Omni Flash

The Gemini Omni API brings Google DeepMind's multimodal video generation and editing model, introduced at Google I/O 2026, to your stack. Gemini Omni fuses Gemini's reasoning engine with generative media, accepting any mix of text, images, video, and audio to produce consistent, knowledge-grounded output. Refine results through natural conversation, swapping objects, rewriting scenes, and shifting styles while physics, characters, and continuity stay intact. Atlas Cloud serves the full Gemini Omni Flash lineup, text-to-video, image-to-video with up to 7 reference images, and reference-to-video, through one unified API with transparent per-second pricing from $0.112 and no subscription. Start building today.

View Family

GPT Image 2

The GPT Image 2 API gives developers access to OpenAI's latest image model, the successor to GPT Image 1.5. It generates and edits images with accurate text rendering across Latin and CJK scripts, plus strong composition for posters, mockups, and infographics. On Atlas Cloud you reach it through one unified API alongside 300+ models, with free credits, 99.99% uptime, and no OpenAI organization verification required.

View Family

Google

Google's most powerful creative models are all available on Atlas Cloud. Veo 3.1 delivers cinematic video generation, Nano Banana 2 powers high-fidelity image creation, and Gemini brings multimodal intelligence to every workflow. Access the full Google model suite through one API key with Day-0 availability and pay-as-you-go pricing.

View Family

Seedance 2.0 Mini

The Seedance 2.0 Mini API is the lightest, lowest-cost tier of ByteDance's Seedance video line, built for teams where throughput and unit cost matter more than maximum polish. Use it for batch generation, rapid prototyping, and draft passes, all through one OpenAI-compatible key on Atlas Cloud.

View Family

ByteDance

From cinematic video generation to high-fidelity image creation, ByteDance's most powerful models are live on Atlas Cloud. Run Seedance and Seedream at scale with the lowest inference pricing and zero infrastructure overhead.

View Family

Alibaba

Atlas Cloud brings together Alibaba's full model lineup under one API: Qwen for language and image tasks, Wan for video generation up to 1080p. Access every model pay-as-you-go with no subscriptions. The Alibaba API is available via a single base URL using your existing OpenAI-compatible client.

View Family

OpenAI

Atlas Cloud gives you access to the full OpenAI API lineup, from GPT Image 2 for image generation to Sora 2 for video. Every model is available pay-as-you-go with no monthly commitment. Plug in with a single base URL swap using the OpenAI-compatible API.

View Family

xAI

Build complete image and video pipelines using the xAI API on Atlas Cloud. Generate at 2K, edit with reference images, and animate images into audio-synced clips.

View Family

Kwaivgi

The Kwaivgi API at 15% off standard rates. Day-0 access to every new Kling release, pay-as-you-go, no seat limits. One account covers the full Kling lineup.

View Family

Seedream 5.0 Pro

Seedream 5.0 Pro API gives developers ByteDance's controllable image editing model on Atlas Cloud. It places edits precisely with anchors and coordinates, separates images into editable layers, fuses multiple references, and matches exact colors and materials, with multilingual text at 2K and 3K. On Atlas Cloud you reach it through one key!

View Family

One API for All Media AI.

Explore all models

Van API for Day-0 1080p Video Generation

Explore the Leading Van

Van-2.6 Text-to-video

Van-2.6 Image-to-video

Van-2.5 Image-to-video

Van-2.5 Text-to-video

Van API Video Models Compared by Modality, Resolution, and Price

Multi-Shot Narrative Control using Van-2.6 API

High-Resolution Visual Fidelity using Van Model API

Extreme Inference Efficiency using Van Model API

Unconstrained Cinematic Dynamics using Van Model API

Synchronized Sound in a Single Pass

Two Ways In: Text or Image

Formats for Every Screen

Van vs Other Models - One Prompt

Where Teams Put the Van API to Work

Sound-Synced Social Content with the Van API

High-Volume Marketing Asset Production

E-Commerce Product Animation using the Van API

Rapid Creative Iteration and Prompt Testing

Cinematic Storyboarding and Previsualization

Multi-Platform Short-Form Video with the Van API

Van API Versus Today's Leading Video Generation Models

How to Use Van on Atlas Cloud

Create an Atlas Cloud Account

Why Use Van on Atlas Cloud

Performance & flexibility

Enterprise & Scale

Van API FAQ: What Developers Ask Before Building

Explore More Families

Seedance 2.0

Grok Imagine

Gemini Omni Flash

GPT Image 2

Google

Seedance 2.0 Mini

ByteDance

Alibaba

OpenAI

xAI

Kwaivgi

Seedream 5.0 Pro

One API for All Media AI.

Join our Discord community