alibaba/wan-2.5/video-extend

Văn bản-Video

Wan 2.5 Video Extend API by Alibaba

alibaba/wan-2.5/video-extend

Video-extend

Extend your videos with Alibaba WAN 2.5 video extender model with audio.

Đầu vào

Đang tải cấu hình tham số...

Đầu ra

Nhàn rỗi

Video đã tạo của bạn sẽ xuất hiện ở đây

Cấu hình tham số và nhấp Chạy để bắt đầu tạo

Mỗi lần chạy có giá $0.052. Với $10, bạn có thể chạy khoảng 192 lần.

Bạn có thể tiếp tục với:

Seedance 2.0 Kling v3 Vidu Wan2.7

Tham số

Ví dụ mã
import requests
import time

# Step 1: Start video generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "alibaba/wan-2.5/video-extend",
    "prompt": "A beautiful sunset over the ocean with gentle waves",
    "width": 512,
    "height": 512,
    "duration": 3,
    "fps": 24,
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] in ["completed", "succeeded"]:
            print("Generated video:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

video_url = check_status()

Cài đặt

Cài đặt gói cần thiết cho ngôn ngữ lập trình của bạn.

pip install requests

Xác thực

Tất cả các yêu cầu API đều cần xác thực thông qua khóa API. Bạn có thể lấy khóa API từ bảng điều khiển Atlas Cloud.

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP Headers

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

Bảo mật khóa API của bạn

Không bao giờ để lộ khóa API trong mã phía máy khách hoặc kho lưu trữ công khai. Thay vào đó, hãy sử dụng biến môi trường hoặc proxy phía máy chủ.

Gửi yêu cầu

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

Gửi yêu cầu

Gửi một yêu cầu tạo nội dung không đồng bộ. API trả về một prediction ID mà bạn có thể dùng để kiểm tra trạng thái và lấy kết quả.

POST/api/v1/model/generateVideo

Nội dung yêu cầu

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "alibaba/wan-2.5/video-extend",
    "input": {
        "prompt": "A beautiful sunset over the ocean with gentle waves"
    }
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['id']}")
print(f"Status: {result['status']}")

Phản hồi

{
  "id": "pred_abc123",
  "status": "processing",
  "model": "model-name",
  "created_at": "2025-01-01T00:00:00Z"
}

Kiểm tra trạng thái

Truy vấn (poll) endpoint prediction để kiểm tra trạng thái hiện tại của yêu cầu.

GET/api/v1/model/prediction/{prediction_id}

Ví dụ truy vấn

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

Giá trị trạng thái

processingYêu cầu vẫn đang được xử lý.

completedQuá trình tạo đã hoàn tất. Kết quả đầu ra đã sẵn sàng.

succeededQuá trình tạo thành công. Kết quả đầu ra đã sẵn sàng.

failedTạo nội dung thất bại. Hãy kiểm tra trường error.

Phản hồi hoàn tất

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.mp4"
    ],
    "metrics": {
      "predict_time": 45.2
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

Tải tệp lên

Tải tệp lên bộ nhớ Atlas Cloud và nhận URL mà bạn có thể sử dụng trong các yêu cầu API của mình. Sử dụng multipart/form-data để tải lên.

POST/api/v1/model/uploadMedia

Ví dụ tải lên

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

Phản hồi

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

Input Schema

Các tham số sau được chấp nhận trong nội dung yêu cầu.

Tổng cộng: 0Bắt buộc: 0Tùy chọn: 0

Không có tham số nào khả dụng.

Ví dụ nội dung yêu cầu

{
  "model": "alibaba/wan-2.5/video-extend"
}

Output Schema

API trả về phản hồi prediction kèm theo các URL đầu ra đã tạo.

idstringrequired

Unique identifier for the prediction.

statusstringrequired

Current status of the prediction.

processingcompletedsucceededfailed

modelstringrequired

The model used for generation.

outputsarray[string]

Array of output URLs. Available when status is "completed".

errorstring

Error message if status is "failed".

metricsobject

Performance metrics.

predict_timenumber

Time taken for video generation in seconds.

created_atstringrequired

ISO 8601 timestamp when the prediction was created.

Format: date-time

completed_atstring

ISO 8601 timestamp when the prediction was completed.

Format: date-time

Ví dụ phản hồi

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.mp4"
  ],
  "metrics": {
    "predict_time": 45.2
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills tích hợp hơn 300 mô hình AI trực tiếp vào trợ lý lập trình AI của bạn. Một lệnh để cài đặt, sau đó sử dụng ngôn ngữ tự nhiên để tạo hình ảnh, video và trò chuyện với LLM.

Ứng dụng được hỗ trợ

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ ứng dụng được hỗ trợ

Cài đặt

npx skills add AtlasCloudAI/atlas-cloud-skills

Thiết lập khóa API

Lấy khóa API từ bảng điều khiển Atlas Cloud và đặt nó làm biến môi trường.

export ATLASCLOUD_API_KEY="your-api-key-here"

Khả năng

Sau khi cài đặt, bạn có thể sử dụng ngôn ngữ tự nhiên trong trợ lý AI để truy cập tất cả các mô hình Atlas Cloud.

Tạo hình ảnhTạo hình ảnh với các mô hình như Nano Banana 2, Z-Image và nhiều hơn nữa.

Tạo videoTạo video từ văn bản hoặc hình ảnh với Kling, Vidu, Veo, v.v.

Trò chuyện LLMTrò chuyện với Qwen, DeepSeek và các mô hình ngôn ngữ lớn khác.

Tải lên phương tiệnTải tệp cục bộ lên để chỉnh sửa hình ảnh và quy trình chuyển hình ảnh sang video.

Tìm hiểu thêm

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

Atlas Cloud MCP Server kết nối IDE của bạn với hơn 300 mô hình AI thông qua Model Context Protocol. Hoạt động với bất kỳ ứng dụng tương thích MCP nào.

Ứng dụng được hỗ trợ

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ ứng dụng được hỗ trợ

Cài đặt

npx -y atlascloud-mcp

Cấu hình

Thêm cấu hình sau vào tệp cài đặt MCP của IDE.

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

Công cụ khả dụng

atlas_generate_imageTạo hình ảnh từ mô tả văn bản.

atlas_generate_videoTạo video từ văn bản hoặc hình ảnh.

atlas_chatTrò chuyện với các mô hình ngôn ngữ lớn.

atlas_list_modelsDuyệt hơn 300 mô hình AI khả dụng.

atlas_quick_generateTạo nội dung một bước với khả năng tự động chọn mô hình tốt nhất.

atlas_upload_mediaTải tệp cục bộ lên cho quy trình API.

Tìm hiểu thêm

github.com/AtlasCloudAI/mcp-server

Schema API

Schema không khả dụng

Không có ví dụ

Đăng nhập để xem lịch sử yêu cầu

Bạn cần đăng nhập để truy cập lịch sử yêu cầu mô hình của mình.

Đăng nhập

Wan 2.5 - Lựa Chọn Của Nhà Sáng Tạo Video Thông Minh

HOT

Tạo Đồng Bộ Âm Thanh/Hình Ảnh Tất Cả Trong Một

Wan 2.5 là mô hình tạo video AI mang tính cách mạng, tạo ra nội dung âm thanh-hình ảnh đồng bộ chỉ trong một bước. Không cần ghi âm riêng hay căn chỉnh khẩu hình thủ công - chỉ cần cung cấp một prompt rõ ràng, có cấu trúc là tạo ngay video hoàn chỉnh kèm âm thanh/lồng tiếng và đồng bộ khẩu hình.

Tại Sao Chọn Wan 2.5?

Tiết Kiệm Chi Phí Hơn

Dù Google vừa giảm giá, Veo 3 vẫn nhìn chung khá đắt đỏ. Wan 2.5 nhẹ nhàng và tiết kiệm chi phí, mang đến nhiều lựa chọn hơn cho người sáng tạo đồng thời giảm đáng kể chi phí sản xuất.

Tạo Ra Một Bước, Đồng Bộ Đầu Cuối

Với Wan 2.5, không cần ghi âm riêng lẻ hay căn chỉnh khẩu hình thủ công. Chỉ cần cung cấp prompt rõ ràng, có cấu trúc để tạo ra video hoàn chỉnh với âm thanh/lồng tiếng và đồng bộ khẩu hình ngay trong một lần - nhanh hơn và đơn giản hơn.

Thân Thiện Đa Ngôn Ngữ

Khi prompt bằng tiếng Trung, Wan 2.5 tạo ra video đồng bộ âm thanh-hình ảnh một cách đáng tin cậy. Trong khi đó, Veo 3 thường hiển thị "ngôn ngữ không xác định" với prompt tiếng Trung.

Tái Hiện Nhân Vật Chính Xác

Wan 2.5 xuất sắc trong việc khôi phục đặc điểm nhân vật, tái hiện chính xác ngoại hình, biểu cảm và phong cách chuyển động, giúp nhân vật video được tạo ra nhận diện dễ dàng và có cá tính hơn, nâng cao khả năng kể chuyện và trải nghiệm đắm chìm.

Dựng Phong Cách Nghệ Thuật

Hỗ trợ dựng phong cách Studio Ghibli, tạo kết cấu màu nước vẽ tay và hiệu ứng hoạt hình. Mang đến trải nghiệm thị giác ấm áp, mơ mộng, tăng sức hấp dẫn nghệ thuật và chiều sâu kể chuyện.

Ai Có Thể Hưởng Lợi?

Đội Ngũ Marketing

Dù là ra mắt sản phẩm, chiến dịch khuyến mãi hay marketing thương hiệu, Wan 2.5 giúp bạn nhanh chóng tạo ra video chất lượng cao, biến việc sáng tạo trở nên dễ dàng và hiệu quả.

Demo sản phẩm và hướng dẫn mà không lo phối hợp phức tạp
Marketing mạng xã hội với phụ đề đa ngôn ngữ và đồng bộ khẩu hình
Nội dung do AI tạo ra giúp đội ngũ tập trung vào chiến lược và sáng tạo

Bottom line: Tóm lại: Sáng tạo chưa bao giờ đơn giản, nhanh chóng và thông minh đến vậy - Wan 2.5 là vũ khí bí mật cho marketing của bạn!

Doanh Nghiệp Toàn Cầu

Cung cấp giải pháp bản địa hóa nội dung lý tưởng cho các công ty đa quốc gia, giúp việc sáng tạo dễ dàng và hiệu quả hơn.

Hỗ trợ video đa ngôn ngữ với nhận diện prompt
Tạo phụ đề và lồng tiếng đồng bộ khẩu hình chỉ một cú nhấp
Bản địa hóa nội dung nhanh chóng cho thị trường toàn cầu

Bottom line: Tóm lại: Sáng tạo nội dung xuyên biên giới chưa bao giờ đơn giản, nhanh chóng và thông minh đến vậy.

Người Sáng Tạo Nội Dung / YouTubers

Người sáng tạo có thể tận dụng Wan 2.5 để nâng cao hiệu quả sản xuất video trong khi vẫn đảm bảo chất lượng đầu ra cao.

Kể chuyện đắm chìm với hành động và biểu cảm nhân vật chính xác
Hiệu quả đăng tải cao hơn với thời gian chỉnh sửa và hậu kỳ giảm bớt
Nội dung đa dạng từ video ngắn đến đoạn truyện hoạt hình

Đội Ngũ Đào Tạo Doanh Nghiệp

Wan 2.5 giúp đào tạo doanh nghiệp hiệu quả hơn và thu hút hơn.

Video chuyên nghiệp thay thế tài liệu văn bản nhàm chán
Nhanh chóng tạo demo vận hành và hướng dẫn đào tạo
Phong cách nhất quán và đầu ra chuẩn hóa cho việc triển khai toàn cầu

Freelancer Sáng Tạo / Studio Nhỏ

Wan 2.5 giải phóng sức sáng tạo mà không cần thiết bị đắt tiền hay diễn viên - AI tạo ra mọi thứ một cách hiệu quả.

Thử nghiệm các tác phẩm đa dạng từ phim ngắn đến nội dung mạng xã hội
Từ ý tưởng đến hoàn thiện với "tạo ra một cú nhấp"
Nội dung chất lượng cao mà không cần thiết bị đắt tiền hay diễn viên chuyên nghiệp

Bottom line: Tóm lại: Wan 2.5 giúp sáng tạo dễ dàng hơn, tự do hơn và thú vị hơn với mỗi lần thử!

Cơ Sở Giáo Dục / Người Tạo Khóa Học Trực Tuyến

Biến sáng tạo thành hiện thực mà không tốn chi phí cao - Wan 2.5 làm cho việc sản xuất nội dung chất lượng trở nên dễ dàng và tiết kiệm.

Thử nghiệm các phong cách khác nhau từ phim ngắn đến video quảng cáo
Hiệu quả sản xuất cao hơn từ ý tưởng đến thành phẩm
Nội dung chất lượng mà không cần thiết bị đắt tiền hay nhân sự chuyên nghiệp

Bottom line: Tóm lại: Wan 2.5 giúp sáng tạo dễ dàng, hiệu quả và tự do - mỗi lần thử đều ấn tượng!

Tính Năng Cốt Lõi

Tạo Âm Thanh/Hình Ảnh Một Bước

Tạo ra video hoàn chỉnh với âm thanh đồng bộ, lồng tiếng và đồng bộ khẩu hình trong một quy trình duy nhất

Đồng Bộ Hai Nhân Vật

Hỗ trợ tạo đồng thời hai nhân vật với hành động, biểu cảm và đồng bộ khẩu hình để tương tác tự nhiên

Chất Lượng Chuyên Nghiệp

Đầu ra video chất lượng cao với biểu cảm nhân vật chân thực và đồng bộ khẩu hình chính xác

Hỗ Trợ Đa Ngôn Ngữ

Hỗ trợ xuất sắc cho prompt tiếng Trung và tạo nội dung đa ngôn ngữ đáng tin cậy

Tiết Kiệm Chi Phí

Chi phí thấp hơn đáng kể so với các đối thủ trong khi vẫn duy trì chất lượng chuyên nghiệp

Khôi Phục Đặc Điểm Nhân Vật

Tái hiện chính xác ngoại hình, biểu cảm và phong cách chuyển động của nhân vật với độ trung thực cao và cá tính riêng

Dựng Phong Cách Nghệ Thuật

Hỗ trợ nhiều phong cách nghệ thuật khác nhau bao gồm kết cấu màu nước vẽ tay theo phong cách Studio Ghibli

Cảnh Đắm Chìm

Lý tưởng cho cảnh đối thoại, phỏng vấn hoặc phim ngắn hai người với sự nhất quán âm thanh-hình ảnh tự nhiên

Digital Human Sync

Study Room Scholar

Middle-aged man reading with perfect lip-sync in a warm study environment

Lip-sync with audioEnvironmental soundsCharacter emotion

Prompt

A middle-aged man sitting at a wooden desk in a cozy study room, surrounded by bookshelves and a warm lamp glow. He opens an old book and reads aloud with a calm, deep voice: 'History teaches us more than just facts… it shows us who we are.' The room has subtle background sounds: pages turning, the faint ticking of a clock, and distant rain against the window.

Dual Character Scene

Park Sunset Romance

Couple interaction with synchronized dual character actions and expressions

Dual character syncNatural interactionAmbient soundscape

Prompt

A young couple sitting on a park bench during sunset. The woman leans her head on the man's shoulder. He whispers softly: 'No matter where we go, I'll always be here with you.' The sound includes the rustling of leaves, distant laughter of children playing, and the gentle hum of cicadas in the evening air.

Character Restoration

Ballet Performance Art

Precise character trait restoration with artistic movement and expression

Character trait restorationMovement precisionArtistic lighting

Prompt

A graceful ballerina with her hair in a messy bun, performing a powerful and emotional contemporary ballet routine. She is in a minimalist, dark art studio. Abstract patterns of light and shadow, projected from a hidden source, dance across her body and the surrounding walls, constantly shifting with her movements. The camera focuses on the tension in her muscles and the expressive gestures of her hands. A single, dramatic slow-motion shot captures her mid-air leap, with the light patterns swirling around her like a galaxy. Moody, artistic, high contrast.

Artistic Style Rendering

Ghibli Forest Magic

Studio Ghibli-inspired animation with hand-painted watercolor texture

Ghibli art styleHand-painted textureMagical atmosphere

Prompt

Studio Ghibli-inspired anime style. A young girl with a straw hat lies peacefully in a sun-dappled magical forest, surrounded by friendly, glowing forest spirits (Kodama). A gentle breeze rustles the leaves of the giant, ancient trees. The air is filled with sparkling dust motes, illuminated by shafts of sunlight. The art style is soft, with a hand-painted watercolor texture. The scene feels serene, magical, and heartwarming.

Hoàn Hảo Cho

🎬

Sản Xuất Video

📢

Nội Dung Marketing

🎓

Video Giáo Dục

📱

Mạng Xã Hội

🌐

Nội Dung Đa Ngôn Ngữ

💼

Đào Tạo Doanh Nghiệp

🎭

Giải Trí

💃

Nghệ Thuật Biểu Diễn

🎨

Hoạt Hình & Anime

📚

Kể Chuyện

👥

Video Hai Nhân Vật

🎙️

Phỏng Vấn

📺

Truyền Thông Phát Sóng

Thông Số Kỹ Thuật

Loại Mô Hình:Tạo Đồng Bộ Âm Thanh-Hình Ảnh

Tính Năng Chính:Đồng bộ A/V, Khôi phục nhân vật, Dựng nghệ thuật, Đa ngôn ngữ

Hỗ Trợ Ngôn Ngữ:Tiếng Trung, Tiếng Anh và nhiều ngôn ngữ khác

Chất Lượng Đầu Ra:Video HD chuyên nghiệp kèm âm thanh

Tốc Độ Tạo Ra:Tạo ra nhanh một bước

Tích Hợp API:RESTful API với tài liệu hướng dẫn đầy đủ

Trải Nghiệm Wan 2.5 - Cuộc Cách Mạng Sáng Tạo Video Của Bạn

Tham gia cùng hàng nghìn nhà sáng tạo và doanh nghiệp đang thay đổi cách tạo nội dung video của họ bằng công nghệ tạo âm thanh-hình ảnh đồng bộ.

🎬Đồng Bộ A/V Một Bước

🌍Hỗ Trợ Đa Ngôn Ngữ

⚡Tiết Kiệm Chi Phí

Wan 2.5: A next-generation AI video generation model developed by Alibaba Wanxiang.

Model Card Overview

Field	Description
Model Name	Wan 2.5
Developed By	Alibaba Group
Release Date	September 24, 2025
Model Type	Generative AI, Video Foundation Model
Related Links	Official Website: https://wan.video/, Hugging Face: https://huggingface.co/Wan-AI, Technical Paper (Wan Series): https://arxiv.org/abs/2503.20314

Introduction

Wan 2.5 is a state-of-the-art, open-source video foundation model developed by Alibaba's Wan AI team. It is designed to generate high-quality, cinematic videos complete with synchronized audio directly from text or image prompts. The model represents a significant advancement in the field of generative AI, aiming to lower the barrier for creative video production. Its core contribution lies in its ability to produce coherent, dynamic, and narratively consistent video clips with a high degree of realism and integrated audio-visual elements, such as lip-sync and sound effects, in a single, streamlined process.

Key Features & Innovations

Wan 2.5 introduces several key features that distinguish it from previous models and competitors:

Unified Audio-Visual Synthesis: Unlike many models that require separate steps for video and audio generation, Wan 2.5 creates video with natively synchronized audio, including voice, sound effects, and lip-sync, in one step.
High-Fidelity, High-Resolution Output: The model is capable of generating videos in multiple resolutions, including 480p, 720p, and full 1080p HD, with significant improvements in visual quality and frame-to-frame stability over its predecessors.
Extended Video Duration: Wan 2.5 can generate video clips up to 10 seconds in length, offering more creative flexibility for storytelling compared to other models in its class.
Advanced Cinematic Control: The model demonstrates a sophisticated understanding of cinematic language, allowing for precise control over camera movement, shot composition, and character consistency within scenes.
Open-Source Commitment: Following the precedent set by earlier versions, the Wan series of models, including Wan 2.5, are open-sourced to encourage research, development, and innovation within the broader AI community.

Model Architecture & Technical Details

Wan 2.5 is built upon the Diffusion Transformer (DiT) paradigm, which has become a mainstream approach for high-quality generative tasks. The technical report for the Wan model series outlines a suite of innovations that contribute to its performance.

The architecture includes a novel Variational Autoencoder (VAE) designed for high-efficiency video compression, enabling the model to handle high-resolution video data effectively. The Wan series is available in multiple sizes to balance performance and computational requirements, such as the 1.3B and 14B parameter models detailed for Wan 2.2. The model was trained on a massive, curated dataset comprising billions of images and videos, which enhances its ability to generalize across a wide range of motions, semantics, and aesthetic styles.

Intended Use & Applications

Wan 2.5 is designed for a wide array of applications in creative and commercial fields. Its intended uses include:

Content Creation: Generating short-form videos for social media, marketing campaigns, and digital advertising.
Storytelling and Filmmaking: Creating cinematic scenes, character animations, and narrative sequences for short films and conceptual art.
Prototyping: Rapidly visualizing scripts and storyboards for film, television, and game development.
Personalized Media: Enabling users to create unique, personalized video content from their own ideas and images.

Performance

Wan 2.5 has demonstrated significant performance improvements over previous versions and holds a competitive position against other leading video generation models. Independent reviews and benchmarks provide insight into its capabilities.

Benchmark Scores

A review conducted by Curious Refuge Labs™ evaluated the model's visual generation capabilities across several metrics.

Metric	Score (out of 10)
Prompt Adherence	7.0
Temporal Consistency	6.6
Visual Fidelity	6.5
Motion Quality	5.9
Style & Cinematic Realism	5.7
Overall Score	6.3

These scores indicate strong prompt understanding and a notable improvement in visual quality from Wan 2.2, although it still shows limitations in complex motion and realism compared to top-tier commercial models.

Khám phá Các Mô hình Tương tự

NEW

HOT

Văn bản-Video

Van-2.5 Text-to-video

Convert prompts into cinematic video clips with synchronized sound. Van 2.5 generates 720p/1080p outputs with stable motion, native audio sync, and prompt-faithful visual storytelling.

Van-2.5 Image-to-video

Get animated visuals from your images faster without major quality sacrifice. Perfect for preview workflows, previews at scale, or mass production of animated assets.

HappyHorse-1.0 Image-to-video

Animates a first-frame image into video with optional prompt guidance, 720P or 1080P output, and durations from 3 to 15 seconds.

HappyHorse-1.0 Text-to-video

Generates videos from text prompts with HappyHorse 1.0, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.0 Video-edit

Edits an input video with text instructions and optional reference images, supporting 720P or 1080P output.

HappyHorse-1.0 Reference-to-video

Generates videos from one to nine reference images and a text prompt, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

Wan-2.7 Image-to-video

Animates images into videos with first-frame, first-and-last-frame, video continuation, and audio-driven modes.

Wan-2.7 Text-to-video

Generates videos from text prompts with multi-shot narrative, audio generation, and sound-image synchronization.

Wan-2.7 Video-edit

Edits videos using text instructions, reference images, and style transfer with multi-modal input support.

Wan-2.7 Reference-to-video

Generates character-driven videos from reference images and videos, with multi-subject and voice-cloning support.

Wan-2.2 Image-to-video

Open and Advanced Large-Scale Video Generative Models.

Wan-2.2 Image-to-video Lora

Open and Advanced Large-Scale Video Generative Models.

From

$0.04/GIÂY

Hình ảnh-Video

Wan-2.2-spicy Image-to-video

Open and Advanced Large-Scale Video Generative Models.

From

$0.03/GIÂY

Hình ảnh-Video

Wan-2.2-spicy Image-to-video Lora

Open and Advanced Large-Scale Video Generative Models.

Wan-2.6 Image-to-video Flash

Wan2.6 image to video flash, faster and more cost-effective generation. Intelligent shot scheduling enables multi‑camera storytelling, supports stable multi‑speaker dialogue with more natural and realistic vocal timbres.

Wan-2.6 Video-to-video

A speed-optimized video-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

From$0.1/GIÂY

$0.07/GIÂY

-30%

Một API cho mọi AI đa phương tiện.

Khám phá tất cả mô hình

Wan 2.5 Video Extend API by Alibaba

Đầu vào

Đầu ra

Tham số

Ví dụ mã

Cài đặt

Xác thực

HTTP Headers

Gửi yêu cầu

Gửi yêu cầu

Nội dung yêu cầu

Phản hồi

Kiểm tra trạng thái

Ví dụ truy vấn

Giá trị trạng thái

Phản hồi hoàn tất

Tải tệp lên

Ví dụ tải lên

Phản hồi

Input Schema

Ví dụ nội dung yêu cầu

Output Schema

Ví dụ phản hồi

Atlas Cloud Skills

Ứng dụng được hỗ trợ

Cài đặt

Thiết lập khóa API

Khả năng

MCP Server

Ứng dụng được hỗ trợ

Cài đặt

Cấu hình

Công cụ khả dụng

Schema API

Đăng nhập để xem lịch sử yêu cầu

Wan 2.5 - Lựa Chọn Của Nhà Sáng Tạo Video Thông Minh

Tại Sao Chọn Wan 2.5?

Tiết Kiệm Chi Phí Hơn

Tạo Ra Một Bước, Đồng Bộ Đầu Cuối

Thân Thiện Đa Ngôn Ngữ

Tái Hiện Nhân Vật Chính Xác

Dựng Phong Cách Nghệ Thuật

Ai Có Thể Hưởng Lợi?

Đội Ngũ Marketing

Doanh Nghiệp Toàn Cầu

Người Sáng Tạo Nội Dung / YouTubers

Đội Ngũ Đào Tạo Doanh Nghiệp

Freelancer Sáng Tạo / Studio Nhỏ

Cơ Sở Giáo Dục / Người Tạo Khóa Học Trực Tuyến

Tính Năng Cốt Lõi

Tạo Âm Thanh/Hình Ảnh Một Bước

Đồng Bộ Hai Nhân Vật

Chất Lượng Chuyên Nghiệp

Hỗ Trợ Đa Ngôn Ngữ

Tiết Kiệm Chi Phí

Khôi Phục Đặc Điểm Nhân Vật

Dựng Phong Cách Nghệ Thuật

Cảnh Đắm Chìm

Wan 2.5 Prompt Showcase

Study Room Scholar

Park Sunset Romance

Ballet Performance Art

Ghibli Forest Magic

Hoàn Hảo Cho

Thông Số Kỹ Thuật

Trải Nghiệm Wan 2.5 - Cuộc Cách Mạng Sáng Tạo Video Của Bạn

Wan 2.5: A next-generation AI video generation model developed by Alibaba Wanxiang.

Model Card Overview

Introduction

Key Features & Innovations

Model Architecture & Technical Details

Intended Use & Applications

Performance

Benchmark Scores

Khám phá Các Mô hình Tương tự

Van-2.5 Text-to-video

Van-2.5 Image-to-video

HappyHorse-1.0 Image-to-video

HappyHorse-1.0 Text-to-video

HappyHorse-1.0 Video-edit