alibaba/wan-2.6/image-to-video-flash

画像から動画

Wan 2.6 Image-to-Video Flash API by Alibaba

alibaba/wan-2.6/image-to-video-flash

Image-to-video-flash

Wan2.6 image to video flash, faster and more cost-effective generation. Intelligent shot scheduling enables multi‑camera storytelling, supports stable multi‑speaker dialogue with more natural and realistic vocal timbres.

入力

プロンプト *

ネガティブプロンプト

画像 *

ファイルをドラッグ＆ドロップするか、クリックしてアップロード

MAX:1

オーディオ

ファイルをドラッグ＆ドロップするか、クリックしてアップロード

MAX:1

解像度 *

長さ

プロンプト拡張

ショットタイプ

オーディオ生成

シード

出力

待機中

生成された動画がここに表示されます

設定を構成して「実行」をクリックして開始

各実行には$0.018かかります。$10で約555回実行できます。

次にできること：

Seedance 2.0 Kling v3 Vidu Wan2.7

パラメータ

コード例
import requests
import time

# Step 1: Start video generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "alibaba/wan-2.6/image-to-video-flash",  # Required. model name
    "audio": "example_value",  # Audio URL to guide generation (optional)
    "duration": 5,  # The duration of the generated media in seconds. (min: 5, max: 15)
    "enable_prompt_expansion": True,  # If set to true, the prompt optimizer will be enabled
    "image": "example_value",  # Required. The image for generating the output
    "negative_prompt": "example_value",  # Negative prompt for the generation
    "prompt": "A beautiful sunset over the ocean with gentle waves",  # Required. The prompt for generating the output
    "resolution": "720p",  # Required. The resolution of the generated video. options: 720p | 1080p
    "seed": -1,  # The random seed to use for the generation
    "shot_type": "multi",  # Generate video in multi camera angles, only works when set enable_prompt_expansion to true. options: multi | single
    "generate_audio": True,  # Whether to automatically add audio to the generated video
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] in ["completed", "succeeded"]:
            print("Generated video:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

video_url = check_status()

インストール

お使いの言語に必要なパッケージをインストールしてください。

pip install requests

認証

すべての API リクエストには API キーによる認証が必要です。API キーは Atlas Cloud ダッシュボードから取得できます。

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP ヘッダー

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

API キーを安全に保管してください

API キーをクライアントサイドのコードや公開リポジトリに公開しないでください。代わりに環境変数またはバックエンドプロキシを使用してください。

リクエストを送信

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

リクエストを送信

非同期生成リクエストを送信します。API は予測 ID を返し、それを使用してステータスの確認や結果の取得ができます。

POST/api/v1/model/generateVideo

リクエストボディ

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "alibaba/wan-2.6/image-to-video-flash",
    "prompt": "A beautiful sunset over the ocean with gentle waves"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

レスポンス

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

ステータスを確認

予測エンドポイントをポーリングして、リクエストの現在のステータスを確認します。

GET/api/v1/model/prediction/{prediction_id}

ポーリング例

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

ステータス値

processingリクエストはまだ処理中です。

completed生成が完了しました。出力が利用可能です。

succeeded生成が成功しました。出力が利用可能です。

failed生成に失敗しました。エラーフィールドを確認してください。

完了レスポンス

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.mp4"
    ],
    "metrics": {
      "predict_time": 45.2
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

ファイルをアップロード

Atlas Cloud ストレージにファイルをアップロードし、API リクエストで使用できる URL を取得します。multipart/form-data を使用してアップロードします。

POST/api/v1/model/uploadMedia

アップロード例

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

レスポンス

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

入力 Schema

以下のパラメータがリクエストボディで使用できます。

合計: 11必須: 4任意: 7

modelstringrequired

model name

Default: "alibaba/wan-2.6/image-to-video-flash"

audiostring

Audio URL to guide generation (optional).

durationinteger

The duration of the generated media in seconds.

Default: 5Min: 5Max: 15

enable_prompt_expansionboolean

If set to true, the prompt optimizer will be enabled.

Default: true

imagestringrequired

The image for generating the output. URL or base64 encoded image data.

negative_promptstring

Negative prompt for the generation.

promptstringrequired

The prompt for generating the output.

resolutionstringrequired

The resolution of the generated video.

Default: "720p"

720p1080p

seedinteger

The random seed to use for the generation. -1 means a random seed will be used.

Default: -1

shot_typestring

Generate video in multi camera angles, only works when set enable_prompt_expansion to true.

Default: "multi"

multisingle

generate_audioboolean

Whether to automatically add audio to the generated video.

Default: true

リクエストボディの例

{
  "model": "alibaba/wan-2.6/image-to-video-flash",
  "duration": 5,
  "enable_prompt_expansion": true,
  "image": "example_image",
  "prompt": "A beautiful landscape",
  "resolution": "720p",
  "seed": -1,
  "shot_type": "multi",
  "generate_audio": true
}

出力 Schema

API は生成された出力 URL を含む予測レスポンスを返します。

created_atstring

ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”).

idstring

Unique identifier for the prediction, the ID of the prediction to get.

modelstring

Model ID used for the prediction.

outputsarray

Array of URLs to the generated content (empty when status is not completed).

statusstring

Status of the task: created, processing, completed, or failed.

レスポンス例

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.mp4"
  ],
  "metrics": {
    "predict_time": 45.2
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills は 400 以上の AI モデルを AI コーディングアシスタントに直接統合します。ワンコマンドでインストールし、自然言語で画像・動画生成や LLM との対話が可能です。

対応クライアント

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ 対応クライアント

インストール

npx skills add AtlasCloudAI/atlas-cloud-skills

API キーの設定

Atlas Cloud ダッシュボードから API キーを取得し、環境変数として設定してください。

export ATLASCLOUD_API_KEY="your-api-key-here"

機能

インストール後、AI アシスタントで自然言語を使用してすべての Atlas Cloud モデルにアクセスできます。

画像生成Nano Banana 2、Z-Image などのモデルで画像を生成します。

動画作成Kling、Vidu、Veo などでテキストや画像から動画を作成します。

LLM チャットQwen、DeepSeek などの大規模言語モデルと対話します。

メディアアップロード画像編集や画像から動画へのワークフロー用にローカルファイルをアップロードします。

詳細を見る

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

Atlas Cloud MCP Server は Model Context Protocol を通じて IDE と 400 以上の AI モデルを接続します。MCP 対応のあらゆるクライアントで動作します。

対応クライアント

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ 対応クライアント

インストール

npx -y atlascloud-mcp

設定

以下の設定を IDE の MCP 設定ファイルに追加してください。

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

利用可能なツール

atlas_generate_imageテキストプロンプトから画像を生成します。

atlas_generate_videoテキストや画像から動画を作成します。

atlas_chat大規模言語モデルと対話します。

atlas_list_models400 以上の利用可能な AI モデルを閲覧します。

atlas_quick_generate最適なモデルを自動選択し、ワンステップでコンテンツを作成。

atlas_upload_mediaAPI ワークフロー用にローカルファイルをアップロードします。

詳細を見る

github.com/AtlasCloudAI/mcp-server

APIスキーマ

{
  "info": {
    "title": "AtlasCloud API",
    "version": "1.0.0",
    "description": "The AtlasCloud API."
  },
  "openapi": "3.0.0",
  "paths": {
    "/api/v1/model/generateVideo": {
      "post": {
        "requestBody": {
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/Input"
              }
            }
          },
          "required": true
        },
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "The request status."
          }
        }
      },
      "x-api-name": "model_run"
    },
    "/api/v1/model/prediction/{request_id}": {
      "get": {
        "parameters": [
          {
            "in": "path",
            "name": "request_id",
            "required": true,
            "schema": {
              "description": "Request ID",
              "type": "string"
            }
          }
        ],
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "Result of the request."
          }
        }
      },
      "x-api-name": "model_result"
    }
  },
  "components": {
    "schemas": {
      "Input": {
        "properties": {
          "model": {
            "type": "string",
            "description": "model name",
            "default": "alibaba/wan-2.6/image-to-video-flash"
          },
          "audio": {
            "description": "Audio URL to guide generation (optional).",
            "type": "string"
          },
          "duration": {
            "maximum": 15,
            "minimum": 5,
            "default": 5,
            "description": "The duration of the generated media in seconds.",
            "type": "integer"
          },
          "enable_prompt_expansion": {
            "default": true,
            "description": "If set to true, the prompt optimizer will be enabled.",
            "type": "boolean"
          },
          "image": {
            "description": "The image for generating the output. URL or base64 encoded image data.",
            "type": "string"
          },
          "negative_prompt": {
            "description": "Negative prompt for the generation.",
            "type": "string"
          },
          "prompt": {
            "description": "The prompt for generating the output.",
            "type": "string"
          },
          "resolution": {
            "default": "720p",
            "description": "The resolution of the generated video.",
            "enum": [
              "720p",
              "1080p"
            ],
            "type": "string",
            "x-ui-component": "select"
          },
          "seed": {
            "default": -1,
            "description": "The random seed to use for the generation. -1 means a random seed will be used.",
            "type": "integer"
          },
          "shot_type": {
            "default": "multi",
            "description": "Generate video in multi camera angles, only works when set enable_prompt_expansion to true.",
            "enum": [
              "multi",
              "single"
            ],
            "type": "string"
          },
          "generate_audio": {
            "default": true,
            "description": "Whether to automatically add audio to the generated video.",
            "type": "boolean"
          }
        },
        "required": [
          "model",
          "image",
          "prompt",
          "resolution"
        ],
        "type": "object",
        "allOf": [
          {
            "if": {
              "properties": {
                "shot_type": {
                  "const": "multi"
                }
              },
              "required": [
                "shot_type"
              ]
            },
            "then": {
              "properties": {
                "enable_prompt_expansion": {
                  "enum": [
                    true
                  ]
                }
              },
              "required": [
                "enable_prompt_expansion"
              ]
            }
          }
        ],
        "x-order-properties": [
          "model",
          "image",
          "audio",
          "prompt",
          "negative_prompt",
          "resolution",
          "duration",
          "enable_prompt_expansion",
          "shot_type",
          "generate_audio",
          "seed"
        ]
      },
      "PredictionResponse": {
        "properties": {
          "created_at": {
            "description": "ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”).",
            "format": "date-time",
            "type": "string"
          },
          "has_nsfw_contents": {
            "description": "Array of boolean values indicating NSFW detection for each output.",
            "items": {
              "type": "boolean"
            },
            "type": "array"
          },
          "id": {
            "description": "Unique identifier for the prediction, the ID of the prediction to get.",
            "type": "string"
          },
          "model": {
            "description": "Model ID used for the prediction.",
            "type": "string"
          },
          "outputs": {
            "description": "Array of URLs to the generated content (empty when status is not completed).",
            "items": {
              "type": "object"
            },
            "type": "array"
          },
          "status": {
            "description": "Status of the task: created, processing, completed, or failed.",
            "type": "string"
          },
          "urls": {
            "description": "Object containing related API endpoints.",
            "type": "object"
          }
        },
        "type": "object"
      }
    }
  },
  "servers": [
    {
      "url": "https://api.atlascloud.ai"
    }
  ]
}

LLMフレンドリーなプロンプトテンプレート

# alibaba/wan-2.6/image-to-video-flash

> Wan2.6 image to video flash, faster and more cost-effective generation. Intelligent shot scheduling enables multi‑camera storytelling, supports stable multi‑speaker dialogue with more natural and realistic vocal timbres.


## Overview

- **Submit endpoint (POST)**: `https://api.atlascloud.ai/api/v1/model/generateVideo` — start an async generation; returns a `prediction_id`
- **Poll endpoint (GET)**: `https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}` — poll this until the prediction finishes
- **Model ID**: `alibaba/wan-2.6/image-to-video-flash`


## API Information

This model can be used via our HTTP API or more conveniently via our client libraries.
See the input and output schema below, as well as the usage examples.


### Input Schema

The API accepts the following input parameters:

- **`model`** (`string`, _required_):
  model name
  - Default: `"alibaba/wan-2.6/image-to-video-flash"`

- **`image`** (`string`, _required_):
  The image for generating the output. URL or base64 encoded image data.

- **`audio`** (`string`, _optional_):
  Audio URL to guide generation (optional).

- **`prompt`** (`string`, _required_):
  The prompt for generating the output.

- **`negative_prompt`** (`string`, _optional_):
  Negative prompt for the generation.

- **`resolution`** (`string`, _required_):
  The resolution of the generated video.
  - Default: `"720p"`
  - Options: "720p", "1080p"

- **`duration`** (`integer`, _optional_):
  The duration of the generated media in seconds.
  - Default: `5`
  - Min: 5
  - Max: 15

- **`enable_prompt_expansion`** (`boolean`, _optional_):
  If set to true, the prompt optimizer will be enabled.
  - Default: `true`

- **`shot_type`** (`string`, _optional_):
  Generate video in multi camera angles, only works when set enable_prompt_expansion to true.
  - Default: `"multi"`
  - Options: "multi", "single"

- **`generate_audio`** (`boolean`, _optional_):
  Whether to automatically add audio to the generated video.
  - Default: `true`

- **`seed`** (`integer`, _optional_):
  The random seed to use for the generation. -1 means a random seed will be used.
  - Default: `-1`



**Required Parameters Example**:

```json
{
  "model": "alibaba/wan-2.6/image-to-video-flash",
  "image": "",
  "prompt": "",
  "resolution": "720p"
}
```


**Full Example**:

```json
{
  "model": "alibaba/wan-2.6/image-to-video-flash",
  "image": "",
  "audio": "",
  "prompt": "",
  "negative_prompt": "",
  "resolution": "720p",
  "duration": 5,
  "enable_prompt_expansion": true,
  "shot_type": "multi",
  "generate_audio": true,
  "seed": -1
}
```


### Output Schema

The API returns the following output format:


- **`created_at`** (`string`, _optional_):
  ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”).

- **`has_nsfw_contents`** (`array[boolean]`, _optional_):
  Array of boolean values indicating NSFW detection for each output.

- **`id`** (`string`, _optional_):
  Unique identifier for the prediction, the ID of the prediction to get.

- **`model`** (`string`, _optional_):
  Model ID used for the prediction.

- **`outputs`** (`array[object]`, _optional_):
  Array of URLs to the generated content (empty when status is not completed).

- **`status`** (`string`, _optional_):
  Status of the task: created, processing, completed, or failed.

- **`urls`** (`object`, _optional_):
  Object containing related API endpoints.



**Example Response**:

```json
{
  "created_at": "",
  "has_nsfw_contents": [],
  "id": "",
  "model": "",
  "outputs": [],
  "status": "",
  "urls": {}
}
```


## Usage Examples

### cURL

```bash
# Step 1: Start generation (async)
curl -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "alibaba/wan-2.6/image-to-video-flash",
  "image": "",
  "audio": "",
  "prompt": "",
  "negative_prompt": "",
  "resolution": "720p",
  "duration": 5,
  "enable_prompt_expansion": true,
  "shot_type": "multi",
  "generate_audio": true,
  "seed": -1
}'

# Response will contain: {"code": 200, "data": {"id": "prediction_id", "status": "processing"}}

# Step 2: Poll for result (replace {prediction_id} with the id returned above)
curl -X GET "https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"

# Keep polling until status is "completed", "succeeded" or "failed"
# When completed, outputs will contain the generated content URL(s)
```

## Additional Resources

### Documentation

- [Model Playground](https://www.atlascloud.ai/models/alibaba/wan-2.6/image-to-video-flash)

A scene of urban fantasy art. A dynamic graffiti art character. A teenager, painted with spray paint, comes to life from a concrete wall. He raps at breakneck speed in an English rap while striking a classic, energetic rapper pose. The scene is set at night under a quaint urban railway bridge. Light comes from a lone streetlamp, creating a cinematic atmosphere, full of high energy and stunning detail. The audio of the video consists entirely of his rap, with no other dialogue or background noise.

読み込み中...

🎬マルチショット動画生成

Wan 2.6プロフェッショナルマルチショットAI動画制作

Alibabaの最新AI動画生成技術の飛躍的進化。マルチショットストーリーテリング、リファレンス駆動のキャラクター一貫性、ネイティブオーディオビジュアル同期を備えた最大15秒の1080p動画を作成。ストーリーボードロジックを真に理解した初のシネマティックナラティブモデル。

革命的なブレークスルー

Wan 2.6がAI動画生成のゲームチェンジャーである理由

マルチショットストーリーテリング

ストーリーボードロジックを理解する初のモデル。シーン変更を通じてキャラクターの外観と環境の一貫性を維持しながら、一貫したトランジションを持つ連続ショットを自動生成—単一の15秒生成で完全なストーリーアークを実現。

リファレンス動画変換(R2V)

2〜30秒のリファレンス動画をアップロードして、キャラクターの外観、動きパターン、音声特性を抽出・保存。複数の動画にわたって前例のない精度で一貫したキャラクターパフォーマンスを作成。

正確なテキストレンダリング

製品パッケージ、看板、ブランドコンテンツ向けの業界最先端のテキストレンダリング機能。動画フレーム内に明瞭で読みやすいテキストを生成—マーケティングと商用アプリケーションに不可欠。

コア機能

15秒の長時間生成

完全な「三幕構成」（設定→展開→解決）を持つ最大15秒の動画を生成

プロフェッショナル1080p品質

シネマティック品質と強化された視覚安定性を備えた24fpsのネイティブ1080p出力

ネイティブオーディオ同期

セリフと口の動きが一致し、テンポに合わせた背景音楽、効果音も完璧なタイミングで再生

キャラクター一貫性

ショットと複数の動画を通じてキャラクターの外観、衣装、アイデンティティを維持

シネマティックカメラコントロール

パン、ズーム、トラッキングショット、ドリー移動を含むプロフェッショナルカメラムーブメント

柔軟なアスペクト比

16:9（YouTube）、9:16（リール）、1:1（スクエア）—ポストプロダクションクロッピング不要のプラットフォーム最適化

Wan 2.6 vs Wan 2.5:主要な改善点

最新リリースの新機能をご覧ください

動画時間

最大15秒

Wan 2.5:最大10秒

マルチショット機能

ストーリーボードロジックを理解

Wan 2.5:単一ショットまたは乱雑なモーフィング

リファレンス動画サポート

R2V モードで完全保持

Wan 2.5:画像リファレンスのみ

キャラクター一貫性

ショット間で優れた性能

Wan 2.5:キャラクターのドリフト問題

モーション安定性

ジッターとアーティファクトを削減

Wan 2.5:時折フレームドリフト

プロンプト理解

複雑なマルチキャラクターシーン

Wan 2.5:基本的なシーン生成

3つの専門生成モード

クリエイティブワークフローに適したモードを選択

テキストto動画(T2V)

最も人気

強化されたマルチショットセグメンテーションと改善されたプロンプト処理を備えたテキストプロンプトから完全な動画を生成。ストーリーテリングとクリエイティブ探求に最適。

単一プロンプトからの自動ショットセグメンテーション
マルチキャラクターインタラクション理解
カメラムーブメントと感情的手がかり
環境ディテール保存

画像to動画(I2V)

強化版

モーションコヒーレンスを改善して静止画像をモーション動画に変換。製品ショーケース、写真アニメーション、ビジュアルストーリーテリングに最適。

製品向けの正確なテキストレンダリング
フレーム間のスタイル一貫性
静止画像からの自然なモーション
ナラティブ駆動のビジュアル最適化

リファレンス動画変換(R2V)

新機能

リファレンス動画（2〜30秒）をアップロードして、キャラクターの外観、動きパターン、音声を保存。キャラクター駆動コンテンツの最強の一貫性保証。

完全なキャラクターアイデンティティ保存
音声特性抽出
動きパターンの複製
マルチキャラクター共演シーン

最適な用途

マーケティング&広告

テキストレンダリング付き製品デモ、キャラクター一貫性のあるブランドキャンペーン、プロモーション動画

コンテンツ制作

YouTube動画、ソーシャルメディアリール、マルチショットストーリーテリング、動画編集ワークフロー

eコマース

正確なテキスト付きの製品ショーケース、チュートリアル動画、お客様の声の再現

教育&トレーニング

教育コンテンツ、コース教材、マルチシーン教育ナラティブ

エンターテインメント

短編映画、キャラクター駆動ストーリー、シネマティックシーケンス、クリエイティブ実験

プリビジュアライゼーション

映画コンセプト開発、ストーリーボード作成、制作のシーンプランニング

Wan 2.6 T2V、I2V、R2V API統合

テキストto動画、画像to動画、リファレンス動画変換の完全APIスイート

テキストto動画API(T2V API)

当社のWan 2.6 T2V APIは、テキストプロンプトを自動シーンセグメンテーション付きのマルチショットシネマティック動画に変換。ネイティブオーディオ同期を備えた最大15秒のプロフェッショナル1080p動画を生成。

単一プロンプトからのマルチショットストーリーテリング

三幕構成による 15 秒の長さ

複雑なシーンの強化されたプロンプト理解

柔軟なアスペクト比:16:9、9:16、1:1

画像to動画API(I2V API)

当社のWan 2.6 I2V APIは、正確なモーションコントロールとテキストレンダリングで静止画像に命を吹き込みます。製品動画、写真アニメーション、ブランドコンテンツ制作に最適。

製品と看板の正確なテキストレンダリング

アニメーションフレーム間のスタイル一貫性

改善されたコヒーレンスを持つ自然なモーション

ナラティブ最適化されたビジュアル出力

リファレンス動画変換API(R2V API)

当社のWan 2.6 R2V APIは、リファレンス動画からキャラクターアイデンティティを保存。外観、音声、動きパターンを抽出して一貫したキャラクター生成を実現する2〜30秒のクリップをアップロード。

キャラクター外観とアイデンティティの保存

音声特性の抽出と複製

動きパターンの分析と再現

マルチキャラクターシーンのサポート

💡

完全なAPIスイート

Wan 2.6 の 3 つの API モード(T2V API、I2V API、R2V API)はすべて RESTful アーキテクチャに対応し、充実したドキュメントを備えています。Python や Node.js などの SDK ですぐに開始可能。各エンドポイントにはネイティブな音声・映像同期と完全な商用利用権が含まれます。

Wan 2.6の始め方

2つのシンプルなパスで数分でプロフェッショナル動画作成を開始

API統合

アプリケーションを構築する開発者向け

サインアップ&ログイン

Atlas Cloudアカウントを作成するか、ログインしてコンソールにアクセス

支払い方法の追加

請求セクションでクレジットカードを紐付けてアカウントに入金

APIキーの生成

コンソール→APIキーに移動して認証キーを作成

構築開始

T2V、I2V、またはR2V APIエンドポイントを使用してWan 2.6をアプリケーションに統合

Playground体験

クイックテストと実験向け

サインアップ&ログイン

Atlas Cloudアカウントを作成するか、ログインしてプラットフォームにアクセス

支払い方法の追加

請求セクションでクレジットカードを紐付けて開始

Playgroundを使用

Wan 2.6 playgroundに移動し、T2V/I2V/R2Vモードを選択して即座に動画を生成

💡

プロのヒント: まずは Playground でさまざまな生成モードを試し、ユースケースに最適なものを見極めてから、本番運用に向けて対応する API を統合しましょう。

よくある質問

Wan 2.6のマルチショット機能の独自性は何ですか?

Wan 2.6は、ストーリーボードロジックを真に理解する初のモデルです。乱雑な「モーフィング」効果を生み出したWan 2.5とは異なり、Wan 2.6は単一のプロンプトを一貫したトランジションを持つ複数の明確なショットに自動的にセグメント化し、シーン変更を通じてキャラクターの一貫性を維持できます。

リファレンス動画変換(R2V)はどのように機能しますか?

2〜30秒のリファレンス動画をアップロードすると、Wan 2.6はキャラクターの外観、動きパターン、音声特性を抽出します。その後、同じキャラクターをフィーチャーした新しい動画を一貫したアイデンティティで生成できます—キャラクター駆動のコンテンツシリーズの作成に最適です。

サポートされている動画形式と時間は?

Wan 2.6は、5〜15秒の時間で24fpsの1080p動画を生成します。サポートされているアスペクト比には、16:9(YouTube)、9:16(Instagram Reels/TikTok)、1:1(スクエアフォーマット)が含まれ、各プラットフォーム向けに最適化されており、ポストプロダクションクロッピングは不要です。

Wan 2.6は動画内でテキストをレンダリングできますか?

はい!Wan 2.6は、製品パッケージ、看板、ブランドコンテンツ向けの業界最先端のテキストレンダリングを備えています。モデルは動画フレーム内に明瞭で読みやすいテキストを生成できます—これはSeedanceとほとんどの競合他社が欠いている重要な機能です。

T2V、I2V、R2Vモードの違いは何ですか?

T2V(テキストto動画)は、マルチショット機能を備えたテキストプロンプトから生成します。I2V(画像to動画)は、正確なテキストレンダリングで静止画像をアニメーション化します。R2V(リファレンス動画変換)は、動画リファレンスを使用して生成間でキャラクターアイデンティティを保存します。入力タイプと一貫性のニーズに基づいて選択してください。

生成された動画の商用権はありますか?

はい!すべてのWan 2.6作成には完全な商用利用権が付属します。動画は、追加のライセンス要件なしに、マーケティングキャンペーン、クライアント成果物、ブランドコンテンツ、商用アプリケーション向けに本番レディです。

Atlas CloudでWan 2.6を使用する理由

プロフェッショナル動画生成ワークフロー向けのエンタープライズグレードインフラストラクチャを活用

専用インフラストラクチャ

要求の厳しいAI動画ワークロード向けに特別に最適化されたインフラストラクチャにWan 2.6のマルチショット生成とR2V機能を展開。1080p 15秒生成の最大パフォーマンス。

すべてのモデル向け統一API

1 つの統一 API を通じて、Wan 2.6(T2V、I2V、R2V)と 400 以上の AI モデル(LLM、画像、動画、音声)にアクセス。一貫した認証で、あらゆる生成 AI のニーズを単一の統合で実現します。

競争力のある価格

従量課金制の透明な料金体系で、AWS と比べて最大 70% のコスト削減。隠れた費用や契約の縛りはなく、プロトタイプから本番環境まで無理なくスケールできます。

SOC 2認定セキュリティ

SOC 2認定とHIPAAコンプライアンスでリファレンス動画と生成コンテンツを保護。暗号化された伝送とストレージを備えたエンタープライズグレードのセキュリティ。

99.9%稼働時間SLA

保証された99.9%稼働時間を備えたエンタープライズグレードの信頼性。Wan 2.6マルチショット動画生成は、本番キャンペーンと重要なコンテンツワークフローで常に利用可能。

簡単な統合

REST APIと多言語SDK(Python、Node.js、Go)で数分で完全統合。統一されたエンドポイント構造でT2V、I2V、R2Vモード間をシームレスに切り替え。

99.9%

稼働時間

70%

AWS比低コスト

400+

生成AIモデル

24/7

プロサポート

技術仕様

Architecture

マルチモーダル理解を備えた高度なTransformer

Resolution

1080p(フルHD)

Frame Rate

24 FPS

Duration

5〜15 秒(モードにより異なる)

Aspect Ratios

16:9、9:16、1:1

Generation Modes

T2V、I2V、R2V

Audio

リップシンク付きネイティブ同期

Commercial Rights

完全な商用利用が含まれます

プロフェッショナルマルチショット動画生成を体験

Wan 2.6の画期的なマルチショットストーリーテリングとキャラクター一貫性機能で動画制作を革新している世界中のコンテンツクリエーター、マーケター、映画製作者に参加してください。

Alibaba WAN 2.6 Image-to-Video Flash

Alibaba WAN 2.6 Image-to-Video Flash is an advanced image-to-video model on Alibaba Cloud’s DashScope. It generates high-quality videos from images and supports output resolutions of 720p and 1080p.

What makes it stand out?

More affordable: Wan 2.6 is more streamlined and cost-effective - reducing creator expenses and offering more options.
One-pass A/V sync: Wan 2.6 creates a fully synchronized video (audio/voiceover + lip-sync) from a single, well-structured prompt - no separate recording or manual alignment required.
Multilingual friendly: Wan 2.6 reliably processes like Chinese prompts for A/V-synced videos.
Longer duration & more video size options: Wan 2.6 delivers up to 15 seconds and 6 aspect/size options, enabling more storytelling room and publishing flexibility.
Multi-shot storytelling: Generates cohesive multi-shot narratives, keeping key details consistent across shots and offering auto shot-split for simple prompts.
Video reference generation: Uses a reference video's appearance and voice to guide new videos; supports human or arbitrary subjects, single or dual performers.
15s long videos: Produces videos up to 15 seconds, expanding temporal capacity for richer storytelling.
Flexible Duration Support: Supports generating videos of any duration from 2 to 15 seconds.

Designed For

Marketing teams: Fast, polished demos/tutorials—low cost, consistent style.
Global enterprises: Multilingual, lip-synced videos with subtitles for efficient localization.
Storytellers & YouTubers: Immersive narratives while maintaining cadence and quality—driving growth.
Corporate training teams: HD videos over docs—clearer key points, better communication.

Pricing

The table below lists prices for easy comparsion.

Output Resolution	Duration (5s)	Duration (10s)
720p	$0.25	$0.5
1080p	$0.375	$0.75

Silent Video Generation: Supports generating silent videos at 50% of the standard price (e.g., $0.125 for 720p/5s).

Billing Rules

Minimum charge: 2 seconds
Per-second rate = (price per 5 seconds) ÷ 5
Total cost = billed duration × per-second rate (by output resolution)

類似モデルを探索

NEW

リファレンスから動画

HappyHorse-1.1 Reference-to-video

Generates videos from one to nine reference images and a text prompt, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.1 Text-to-video

Generates videos from text prompts with HappyHorse 1.1, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.1 Image-to-video

Animates a first-frame image into video with optional prompt guidance, 720P or 1080P output, and durations from 3 to 15 seconds.

HappyHorse-1.0 Image-to-video

Animates a first-frame image into video with optional prompt guidance, 720P or 1080P output, and durations from 3 to 15 seconds.

HappyHorse-1.0 Text-to-video

Generates videos from text prompts with HappyHorse 1.0, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.0 Video-edit

Edits an input video with text instructions and optional reference images, supporting 720P or 1080P output.

HappyHorse-1.0 Reference-to-video

Generates videos from one to nine reference images and a text prompt, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

Wan-2.7 Image-to-video

Animates images into videos with first-frame, first-and-last-frame, video continuation, and audio-driven modes.

Wan-2.7 Text-to-video

Generates videos from text prompts with multi-shot narrative, audio generation, and sound-image synchronization.

Wan-2.7 Video-edit

Edits videos using text instructions, reference images, and style transfer with multi-modal input support.

Wan-2.7 Reference-to-video

Generates character-driven videos from reference images and videos, with multi-subject and voice-cloning support.

Wan-2.2 Image-to-video Lora

Open and Advanced Large-Scale Video Generative Models.

Wan-2.2 Image-to-video

Open and Advanced Large-Scale Video Generative Models.

From

$0.03/秒

画像から動画

Wan-2.2-spicy Image-to-video

Open and Advanced Large-Scale Video Generative Models.

From

$0.03/秒

画像から動画

Wan-2.2-spicy Image-to-video Lora

Open and Advanced Large-Scale Video Generative Models.

Wan-2.6 Video-to-video

A speed-optimized video-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

From$0.1/秒

$0.07/秒

-30%

ひとつのAPIで、あらゆるメディアAIを。

すべてのモデルを探索