Model Platform

Featured Model Families

View All Models

OpenAI

GPU Cloud

Scale AI workloads with high-performance cloud GPUs.

Bare Metal

Run AI models faster on dedicated GPU servers.

Pricing

Pay by Usage

Pay only for what you generate.

Pay by GPU/hr

Scale AI with on-demand GPU compute.

Developers

Docs

Explore guides, APIs, and examples to build on Atlas Cloud.

Blog

Stay updated with product releases, insights, and tutorials.

Company

About Us

Discover who we are and why we build.

Our Team

Meet the talent building Atlas Cloud.

News & Events

Explore our latest updates and announcements.

Contact

Model Platform

Image & Video Models

Create and scale visual imagination.

Large Language Models

Power intelligent reasoning and conversation.

Serverless

Deploy, customize, and build with Dedicated Endpoint, Fine-Tuning, and DevPod.

Agent Solutions

GPU Cloud

On-Demand GPUs

Scale AI workloads with high-performance cloud GPUs.

Bare Metal

Run AI models faster on dedicated GPU servers.

Pricing

Pay by Usage

Pay only for what you generate.

Pay by GPU/hr

Scale AI with on-demand GPU compute.

Developers

Docs

Explore guides, APIs, and examples to build on Atlas Cloud.

Blog

Stay updated with product releases, insights, and tutorials.

Company

About Us

Discover who we are and why we build.

Our Team

Meet the talent building Atlas Cloud.

News & Events

Explore our latest updates and announcements.

Model Platform Qwen & Wan Model FamiliesWan2.6 Video Models

Wan2.6 Video Models

Wan 2.6 is Alibaba’s state-of-the-art multimodal video generation model, capable of producing high-fidelity, audio-synchronized videos from text or images. Wan 2.6 will let you create videos of up to 15 seconds, ensuring narrative flow and visual integrity. It is perfect for creating YouTube Shorts, Instagram Reels, Facebook clips, and TikTok videos.

Start for Free

Explore the Leading Wan2.6 Video Models

Wan-2.5 Video Extend

Extend your videos with Alibaba WAN 2.5 video extender model with audio.

$0.2000/s video

Try it

Wan-2.5 Text-to-video

A speed-optimized text-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

$0.0400/s video

Try it

Wan-2.5 Image-to-video

Bring static images to life with dynamic motion, lighting consistency, and synchronized audio. This variant smoothly animates reference visuals into short video sequences.

$0.0400/s video

Try it

What Makes Wan2.6 Video Models Stand Out

Native A/V Sync

Generates perfectly aligned visuals and sound without extra editing.

Unified Multimodal Core

Handles text, image, video, and audio in one seamless model.

Video Customization

More granular control over the video style, camera logic, lighting effects, and video framing.

Multi-Voice Singing

Wan 2.6 will bring you a powerhouse for music creation.

Flexible Prompt Inputs

Understands Chinese, English, and other languages natively.

Cinematic Control

Direct camera motion, pacing, and composition right from your prompt.

What You Can Do with Wan2.6 Video Models

Generate cinematic clips up to 15 seconds length at 480p, 720p, or 1080p resolutions.

Sync visuals and audio in one pass — lip-sync, dialogue, sound effects and background music come together automatically.

Replace, add, or remove objects and subjects, adjust scenes, or modify the music video style.

Localize with multilingual prompts: Chinese, English, and more are supported natively.

Preview ideas rapidly with a fast-iteration variant of models to test concepts before full render.

Run Wan2.6 Models

Why Use Wan2.6 Video Models on Atlas Cloud

Combining the advanced Wan2.6 Video Models models with Atlas Cloud's GPU-accelerated platform provides unmatched performance, scalability, and developer experience.

Reality in motion. Powered by Atlas Cloud, Wan 2.6 brings life, light, and emotion into every frame with natural precision.

Performance & flexibility

Low Latency:
GPU-optimized inference for real-time reasoning.

Unified API:
Run Wan2.6 Video Models, GPT, Gemini, and DeepSeek with one integration.

Transparent Pricing:
Predictable per-token billing with serverless options.

Enterprise & Scale

Developer Experience:
SDKs, analytics, fine-tuning tools, and templates.

Reliability:
99.99% uptime, RBAC, and compliance-ready logging.

Security & Compliance:
SOC 2 Type II, HIPAA alignment, data sovereignty in US.

Explore More Families

Wan2.6 Video Models

View Family

Flux.2 Image Models

The Flux.2 Series is a comprehensive family of AI image generation models. Across the lineup, Flux supports text-to-image, image-to-image, reconstruction, contextual reasoning, and high-speed creative workflows.

View Family

Nano Banana Image Models

Nano Banana is a fast, lightweight image generation model for playful, vibrant visuals. Optimized for speed and accessibility, it creates high-quality images with smooth shapes, bold colors, and clear compositions—perfect for mascots, stickers, icons, social posts, and fun branding.

View Family

Image and Video Tools

Open, advanced large-scale image generative models that power high-fidelity creation and editing with modular APIs, reproducible training, built-in safety guardrails, and elastic, production-grade inference at scale.

View Family

Ltx-2 Video Models

LTX-2 is a complete AI creative engine. Built for real production workflows, it delivers synchronized audio and video generation, 4K video at 48 fps, multiple performance modes, and radical efficiency, all with the openness and accessibility of running on consumer-grade GPUs.

View Family

Qwen Image Models

Qwen-Image is Alibaba’s open image generation model family. Built on advanced diffusion and Mixture-of-Experts design, it delivers cinematic quality, controllable styles, and efficient scaling, empowering developers and enterprises to create high-fidelity media with ease.

View Family

Open AI Model Families

Explore OpenAI’s language and video models on Atlas Cloud: ChatGPT for advanced reasoning and interaction, and Sora-2 for physics-aware video generation.

View Family

Hailuo Video Models

MiniMax Hailuo video models deliver text-to-video and image-to-video at native 1080p (Pro) and 768p (Standard), with strong instruction following and realistic, physics-aware motion.

View Family

Wan2.5 Video Models

Wan 2.5 is Alibaba’s state-of-the-art multimodal video generation model, capable of producing high-fidelity, audio-synchronized videos from text or images. It delivers realistic motion, natural lighting, and strong prompt alignment across 480p to 1080p outputs—ideal for creative and production-grade workflows.

View Family

Sora-2 Video Models

The Sora-2 family from OpenAI is the next-generation video + audio generation model, enabling both text-to-video and image-to-video outputs with synchronized dialogue, sound effect, improved physical realism, and fine-grained control.

View Family

Kling Video Models

Kling is Kuaishou’s cutting-edge generative video engine that transforms text or images into cinematic, high-fidelity clips. It offers multiple quality tiers for flexible creation, from fast drafts to studio-grade output.

View Family

Veo3 Video Models

Veo is Google’s generative video model family, designed to produce cinematic-quality clips with natural motion, creative styles, and integrated audio. With options from fast, iterative variants to high-fidelity production outputs, Veo enables seamless text-to-video and image-to-video creation.

View Family

Open AI Model Families

Explore OpenAI’s language and video models on Atlas Cloud: ChatGPT for advanced reasoning and interaction, and Sora-2 for physics-aware video generation.

View Family

Hailuo Video Models

MiniMax Hailuo video models deliver text-to-video and image-to-video at native 1080p (Pro) and 768p (Standard), with strong instruction following and realistic, physics-aware motion.

View Family