alibaba/wan-2.6/video-to-video

影片轉影片

Wan 2.6 Video-to-Video API by Alibaba

alibaba/wan-2.6/video-to-video

Video-to-video

A speed-optimized video-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

輸入

正在載入參數設定...

輸出

閒置

生成的影片將在這裡顯示

設定參數後點擊執行開始生成

每次執行將花費 $0.07。$10 可執行約 142 次。

你可以繼續：

Seedance 2.0 Kling v3 Vidu Wan2.7

參數

程式碼範例
import requests
import time

# Step 1: Start video generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "alibaba/wan-2.6/video-to-video",
    "prompt": "A beautiful sunset over the ocean with gentle waves",
    "width": 512,
    "height": 512,
    "duration": 3,
    "fps": 24,
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] in ["completed", "succeeded"]:
            print("Generated video:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

video_url = check_status()

安裝

安裝所需的相依套件。

pip install requests

驗證

所有 API 請求都需要透過 API Key 進行認證。您可以在 Atlas Cloud 控制台取得 API Key。

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP 標頭

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

保護好您的 API Key

切勿在客戶端程式碼或公開儲存庫中暴露您的 API Key。請使用環境變數或後端代理。

提交請求

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

提交請求

提交一個非同步生成請求。API 會傳回一個 prediction ID，您可以用它來檢查狀態與取得結果。

POST/api/v1/model/generateVideo

請求主體

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "alibaba/wan-2.6/video-to-video",
    "prompt": "A beautiful sunset over the ocean with gentle waves"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

回應

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

檢查狀態

輪詢 prediction 端點以檢查請求的當前狀態。

GET/api/v1/model/prediction/{prediction_id}

輪詢範例

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

狀態值

processing請求仍在處理中。

completed生成完成，輸出可用。

succeeded生成成功，輸出可用。

failed生成失敗，請檢查 error 欄位。

完成回應

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.mp4"
    ],
    "metrics": {
      "predict_time": 45.2
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

上傳檔案

將檔案上傳到 Atlas Cloud 儲存空間，取得可在 API 請求中使用的 URL。使用 multipart/form-data 上傳。

POST/api/v1/model/uploadMedia

上傳範例

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

回應

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

Input Schema

以下參數在請求主體中被接受。

總計: 0必填: 0選填: 0

暫無可用參數。

範例請求主體

{
  "model": "alibaba/wan-2.6/video-to-video"
}

Output Schema

API 傳回包含生成輸出 URL 的 prediction 回應。

idstringrequired

Unique identifier for the prediction.

statusstringrequired

Current status of the prediction.

processingcompletedsucceededfailed

modelstringrequired

The model used for generation.

outputsarray[string]

Array of output URLs. Available when status is "completed".

errorstring

Error message if status is "failed".

metricsobject

Performance metrics.

predict_timenumber

Time taken for video generation in seconds.

created_atstringrequired

ISO 8601 timestamp when the prediction was created.

Format: date-time

completed_atstring

ISO 8601 timestamp when the prediction was completed.

Format: date-time

範例回應

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.mp4"
  ],
  "metrics": {
    "predict_time": 45.2
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills 將 300+ AI 模型直接整合到您的 AI 程式碼助手中。一條命令安裝，即可用自然語言生成圖片、影片，以及與 LLM 對話。

支援的客戶端

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ 支援的客戶端

安裝

npx skills add AtlasCloudAI/atlas-cloud-skills

設定 API Key

從 Atlas Cloud 控制台取得 API Key，並將其設定為環境變數。

export ATLASCLOUD_API_KEY="your-api-key-here"

功能

安裝完成後，您可以在 AI 助手中使用自然語言存取所有 Atlas Cloud 模型。

圖片生成使用 Nano Banana 2、Z-Image 等模型生成圖片。

影片創作使用 Kling、Vidu、Veo 等從文字或圖片創建影片。

LLM 對話與 Qwen、DeepSeek 及其他大型語言模型對話。

媒體上傳上傳本機檔案用於圖片編輯和圖生影片工作流程。

MCP Server

Atlas Cloud MCP Server 透過 Model Context Protocol 將您的 IDE 與 300+ AI 模型連接。支援任何相容 MCP 的客戶端。

支援的客戶端

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ 支援的客戶端

安裝

npx -y atlascloud-mcp

設定

將以下設定新增到您的 IDE 的 MCP 設定檔中。

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

可用工具

atlas_generate_image根據文字提示生成圖片。

atlas_generate_video從文字或圖片創建影片。

atlas_chat與大型語言模型對話。

atlas_list_models瀏覽 300+ 可用 AI 模型。

atlas_quick_generate一步式內容創建，自動選擇最佳模型。

atlas_upload_media上傳本機檔案用於 API 工作流程。

了解更多

github.com/AtlasCloudAI/mcp-server

API Schema

Schema 不可用

暫無可用範例

載入中...

🎬多鏡頭影片生成

Wan 2.6專業多鏡頭 AI 影片創作

阿里巴巴在 AI 影片生成領域的最新突破。創作最長 15 秒的 1080p 影片，支援多鏡頭敘事、參考影片驅動的角色一致性以及原生音畫同步。首個真正理解分鏡邏輯的影片生成模型，讓電影級敘事成為可能。

革命性突破

Wan 2.6 成為 AI 影片生成遊戲規則改變者的關鍵

多鏡頭敘事

首個理解分鏡邏輯的模型。自動生成連續鏡頭與連貫轉場，在場景變換中維持角色外觀與環境一致性——在單次 15 秒生成中實現完整故事弧線。

參考影片轉換 (R2V)

上傳 2-30 秒的參考影片，擷取並保留角色外觀、動作模式與聲音特徵。以前所未有的精準度在多支影片中創造一致的角色表演。

精準文字渲染

業界領先的文字渲染能力，適用於產品包裝、招牌與品牌內容。在影片畫面中生成清晰可讀的文字——對行銷與商業應用至關重要。

核心能力

延長至 15 秒時長

每支影片可生成長達 15 秒，具備完整「三幕式」結構（開端 → 衝突 → 解決）

專業 1080p 畫質

原生 1080p 輸出，24fps 電影級畫質，視覺穩定性大幅提升

原生音畫同步

對白與唇形同步，背景音樂契合節奏，音效完美觸發

角色一致性

跨鏡頭與多支影片維持角色外觀、服裝與身份一致

電影級鏡頭控制

專業攝影機運動，包含平移、縮放、追蹤鏡頭與推軌運動

彈性長寬比

16:9（YouTube）、9:16（限時動態）、1:1（方形）——針對平台優化，無需後製裁切

Wan 2.6 vs Wan 2.5：重大改進

看看最新版本的新功能

影片時長

最長 15 秒

Wan 2.5：最長 10 秒

多鏡頭能力

理解分鏡邏輯

Wan 2.5：單鏡頭或雜亂變形

參考影片支援

R2V 模式完整保留

Wan 2.5：僅圖片參考

角色一致性

跨鏡頭表現優異

Wan 2.5：角色漂移問題

動態穩定性

減少抖動與瑕疵

Wan 2.5：偶爾畫面偏移

提示理解

複雜多角色場景

Wan 2.5：基本場景生成

三種專業生成模式

為您的創作流程選擇合適模式

文字轉影片 (T2V)

最受歡迎

從文字提示生成完整影片，具備增強的多鏡頭分割與改進的提示處理。完美適用於故事敘述與創意探索。

從單一提示自動分鏡
多角色互動理解
鏡頭運動與情感提示
環境細節保留

圖片轉影片 (I2V)

增強版

將靜態圖片轉換為動態影片，動作連貫性大幅提升。適合產品展示、照片動畫與視覺敘事。

產品精準文字渲染
跨畫面風格一致
從靜態圖片自然動態化
敘事驅動的視覺優化

參考影片轉換 (R2V)

全新

上傳參考影片（2-30秒）保留角色外觀、動作模式與聲音。為角色驅動內容提供最強一致性保證。

完整角色身份保留
聲音特徵擷取
動作模式複製
多角色共演場景

完美適用於

行銷與廣告

具文字渲染的產品演示、角色一致的品牌行銷與宣傳影片

內容創作

YouTube 影片、社群媒體短片、多鏡頭故事敘述與影片編輯工作流程

電子商務

精準文字的產品展示、教學影片與客戶評價重現

教育與培訓

教學內容、課程教材與多場景教育敘事

娛樂

短片、角色驅動故事、電影級序列與創意實驗

前期視覺化

電影概念開發、分鏡創作與製作場景規劃

Wan 2.6 T2V、I2V 與 R2V API 整合

完整的文字轉影片、圖片轉影片與參考影片轉換 API 套件

文字轉影片 API (T2V API)

我們的 Wan 2.6 T2V API 將文字提示轉換為具自動場景分割的多鏡頭電影級影片。生成專業 1080p 影片，長達 15 秒，具備原生音畫同步。

從單一提示進行多鏡頭敘事

15 秒時長，具三幕式結構

增強複雜場景提示理解

彈性長寬比：16:9、9:16、1:1

圖片轉影片 API (I2V API)

我們的 Wan 2.6 I2V API 以精準動作控制與文字渲染賦予靜態圖片生命。完美適用於產品影片、照片動畫與品牌內容創作。

產品與招牌精準文字渲染

動畫畫面風格一致

改進連貫性的自然動態

敘事優化的視覺輸出

參考影片轉換 API (R2V API)

我們的 Wan 2.6 R2V API 從參考影片中保留角色身份。上傳 2-30 秒片段以擷取外觀、聲音與動作模式，實現一致的角色生成。

角色外觀與身份保留

聲音特徵擷取與複製

動作模式分析與重現

多角色場景支援

💡

完整 API 套件

所有三種 Wan 2.6 API 模式（T2V API、I2V API、R2V API）均支援 RESTful 架構與完整文件。使用 Python、Node.js 等 SDK 快速開始。每個端點均包含原生音畫同步與完整商業使用權。

如何開始使用 Wan 2.6

透過兩個簡單路徑，幾分鐘內開始創作專業影片

API 整合

適用於開發應用程式的開發者

註冊與登入

建立您的 Atlas Cloud 帳號或登入以存取控制台

新增付款方式

在帳單區域綁定您的信用卡為帳戶儲值

生成 API 金鑰

前往控制台 → API 金鑰並建立您的驗證金鑰

開始建置

使用 T2V、I2V 或 R2V API 端點將 Wan 2.6 整合到您的應用程式

Playground 體驗

適用於快速測試與實驗

註冊與登入

建立您的 Atlas Cloud 帳號或登入以存取平台

新增付款方式

在帳單區域綁定您的信用卡即可開始

使用 Playground

前往 Wan 2.6 playground，選擇 T2V/I2V/R2V 模式，立即生成影片

💡

專業提示：先在 Playground 測試不同生成模式，了解哪種最適合您的使用情境，再整合相應的 API 以擴展到生產規模。:

常見問題

Wan 2.6 的多鏡頭能力有何獨特之處？

Wan 2.6 是首個真正理解分鏡邏輯的模型。不同於 Wan 2.5 會產生雜亂的「變形」效果，Wan 2.6 可自動將單一提示分割為多個清晰鏡頭，具連貫轉場，並在場景變換中維持角色一致性。

參考影片轉換 (R2V) 如何運作？

上傳 2-30 秒的參考影片，Wan 2.6 會擷取角色的外觀、動作模式與聲音特徵。您可以生成具相同角色的新影片，身份保持一致——適合創作角色驅動的系列內容。

支援哪些影片格式與時長？

Wan 2.6 生成 1080p 影片，24fps，時長從 5 到 15 秒。支援的長寬比包括 16:9（YouTube）、9:16（Instagram Reels/TikTok）與 1:1（方形格式），針對各平台優化，無需後製裁切。

Wan 2.6 能在影片中渲染文字嗎？

可以！Wan 2.6 具備業界領先的文字渲染能力，適用於產品包裝、招牌與品牌內容。模型可在影片畫面中生成清晰可讀的文字——這是 Seedance 與多數競爭者缺乏的關鍵功能。

T2V、I2V 與 R2V 模式有何差異？

T2V（文字轉影片）從文字提示生成，具多鏡頭能力。I2V（圖片轉影片）以精準文字渲染將靜態圖片動畫化。R2V（參考影片轉換）使用影片參考在多次生成中保留角色身份。根據您的輸入類型與一致性需求選擇。

我擁有生成影片的商業權利嗎？

是的！每個 Wan 2.6 創作都附帶完整商業使用權。影片可直接用於行銷活動、客戶交付項目、品牌內容與商業應用，無需額外授權。

為何在 Atlas Cloud 上使用 Wan 2.6？

利用企業級基礎設施實現專業影片生成工作流程

專用基礎設施

在專為高需求 AI 影片工作負載優化的基礎設施上部署 Wan 2.6 的多鏡頭生成與 R2V 能力。1080p 15 秒生成的最佳效能。

統一所有模型的 API

透過單一統一 API 存取 Wan 2.6（T2V、I2V、R2V）與 300 多個 AI 模型（LLM、圖片、影片、音訊）。單次整合滿足所有生成式 AI 需求，驗證一致。

具競爭力的定價

與 AWS 相比節省高達 70%，透明的按使用量付費定價。無隱藏費用、無承諾——從原型到生產規模擴展，無需超支。

SOC I & II 認證安全性

您的參考影片與生成內容受 SOC I & II 認證與 HIPAA 合規保護。企業級安全性，具加密傳輸與儲存。

99.9% 正常運行 SLA

企業級可靠性，保證 99.9% 正常運行。您的 Wan 2.6 多鏡頭影片生成隨時可用於生產活動與關鍵內容工作流程。

輕鬆整合

使用 REST API 與多語言 SDK（Python、Node.js、Go）幾分鐘內完成整合。透過統一端點結構在 T2V、I2V 與 R2V 模式間無縫切換。

99.9%

正常運行時間

70%

較 AWS 降低成本

300+

生成式 AI 模型

24/7

專業支援

技術規格

Architecture

具多模態理解的進階 Transformer

Resolution

1080p（Full HD）

Frame Rate

24 FPS

Duration

5-15 秒（依模式而定）

Aspect Ratios

16:9、9:16、1:1

Generation Modes

T2V、I2V、R2V

Audio

原生同步，具唇形同步

Commercial Rights

包含完整商業使用權

體驗專業多鏡頭影片生成

加入全球內容創作者、行銷人員與電影製作人的行列，透過 Wan 2.6 突破性的多鏡頭敘事與角色一致性能力，革新影片製作。

Alibaba WAN 2.6 Video-to-Video Model

Alibaba WAN 2.6 is an advanced Video-to-Video model provided by Alibaba Cloud's DashScope platform. This model generates high-quality 480p/720p/1080p videos from text prompts.

What makes it stand out?

More affordable: Wan 2.6 is more streamlined and cost-effective - reducing creator expenses and offering more options.
One-pass A/V sync: Wan 2.6 creates a fully synchronized video (audio/voiceover + lip-sync) from a single, well-structured prompt - no separate recording or manual alignment required.
Multilingual friendly: Wan 2.6 reliably processes like Chinese prompts for A/V-synced videos.
Longer duration & more video size options: Wan 2.6 delivers up to 10 seconds and 6 aspect/size options, enabling more storytelling room and publishing flexibility.
Multi-shot storytelling: Generates cohesive multi-shot narratives, keeping key details consistent across shots and offering auto shot-split for simple prompts.
Video reference generation: Uses a reference video's appearance and voice to guide new videos; supports human or arbitrary subjects, single or dual performers.
15s long videos: Produces videos up to 15 seconds, expanding temporal capacity for richer storytelling.

Designed For

Marketing teams: Fast, polished demos/tutorials—low cost, consistent style.
Global enterprises: Multilingual, lip-synced videos with subtitles for efficient localization.
Storytellers & YouTubers: Immersive narratives while maintaining cadence and quality—driving growth.
Corporate training teams: HD videos over docs—clearer key points, better communication.

Pricing

The table below lists prices for easy comparsion.

Output Resolution	Duration (5s)	Duration (10s)
480p	$0.2	$0.4
720p	$0.4	$0.8
1080p	$0.6	$1.2

Billing Rules

Minimum charge: 5 seconds
Per-second rate = (price per 5 seconds) ÷ 5
Billed duration = video length in seconds (rounded up), with a 5-second minimum
Total cost = billed duration × per-second rate (by output resolution)

How to Use

Write your prompt.
Upload an audio file (optional) for voice/music.
Choose the video size (resolution/aspect).
Select the video duration (e.g., 5s / 10s).
Submit and wait for processing.
Preview and download the result.

探索類似模型

NEW

HOT

文生影片

Van-2.6 Text-to-video

A speed-optimized text-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

Van-2.6 Image-to-video

A speed-optimized image-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

HappyHorse-1.1 Reference-to-video

Generates videos from one to nine reference images and a text prompt, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.1 Image-to-video

Animates a first-frame image into video with optional prompt guidance, 720P or 1080P output, and durations from 3 to 15 seconds.

HappyHorse-1.1 Text-to-video

Generates videos from text prompts with HappyHorse 1.1, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.0 Image-to-video

Animates a first-frame image into video with optional prompt guidance, 720P or 1080P output, and durations from 3 to 15 seconds.

HappyHorse-1.0 Text-to-video

Generates videos from text prompts with HappyHorse 1.0, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.0 Video-edit

Edits an input video with text instructions and optional reference images, supporting 720P or 1080P output.

HappyHorse-1.0 Reference-to-video

Generates videos from one to nine reference images and a text prompt, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

From

$0.14/秒