alibaba/wan-2.5/text-to-image

텍스트를 이미지로

Wan 2.5 Text-to-Image API by Alibaba

alibaba/wan-2.5/text-to-image

Text-to-image

Generate AI images with Alibaba WAN 2.5 text-to-image model.

입력

프롬프트 *

네거티브 프롬프트

크기

프롬프트 확장

시드

출력

대기

생성된 이미지가 여기에 표시됩니다

설정을 구성하고 실행을 클릭하여 시작하세요

요청당 $0.021가 소요됩니다. $10로 이 모델을 약 476번 실행할 수 있습니다.

다음으로 할 수 있는 작업:

이미지를 비디오로 이미지를 이미지로

파라미터

코드 예시
import requests
import time

# Step 1: Start image generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "alibaba/wan-2.5/text-to-image",  # Required. model name
    "enable_prompt_expansion": False,  # If set to true, the prompt optimizer will be enabled
    "negative_prompt": "example_value",  # Negative prompt for the generation
    "prompt": "A beautiful landscape with mountains and lake",  # Required. The prompt for generating the image
    "seed": -1,  # The random seed to use for the generation
    "size": "1280*1280",  # The size of the generated image in pixels (width*height)
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] == "completed":
            print("Generated image:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

image_url = check_status()

설치

사용하는 언어에 필요한 패키지를 설치하세요.

pip install requests

인증

모든 API 요청에는 API 키를 통한 인증이 필요합니다. Atlas Cloud 대시보드에서 API 키를 받을 수 있습니다.

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP 헤더

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

API 키를 안전하게 보관하세요

클라이언트 측 코드나 공개 저장소에 API 키를 노출하지 마세요. 대신 환경 변수 또는 백엔드 프록시를 사용하세요.

요청 제출

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

요청 제출

비동기 생성 요청을 제출합니다. API는 상태 확인 및 결과 조회에 사용할 수 있는 예측 ID를 반환합니다.

POST/api/v1/model/generateImage

요청 본문

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "alibaba/wan-2.5/text-to-image",
    "prompt": "A beautiful landscape with mountains and lake"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

응답

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

상태 확인

예측 엔드포인트를 폴링하여 요청의 현재 상태를 확인합니다.

GET/api/v1/model/prediction/{prediction_id}

폴링 예시

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

상태 값

processing요청이 아직 처리 중입니다.

completed생성이 완료되었습니다. 출력을 사용할 수 있습니다.

succeeded생성이 성공했습니다. 출력을 사용할 수 있습니다.

failed생성에 실패했습니다. 오류 필드를 확인하세요.

완료 응답

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.png"
    ],
    "metrics": {
      "predict_time": 8.3
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

파일 업로드

Atlas Cloud 스토리지에 파일을 업로드하고 API 요청에 사용할 수 있는 URL을 받습니다. multipart/form-data를 사용하여 업로드합니다.

POST/api/v1/model/uploadMedia

업로드 예시

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

응답

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

입력 Schema

다음 파라미터를 요청 본문에서 사용할 수 있습니다.

전체: 6필수: 2선택: 4

modelstringrequired

model name

Default: "alibaba/wan-2.5/text-to-image"

enable_prompt_expansionboolean

If set to true, the prompt optimizer will be enabled.

Default: false

negative_promptstring

Negative prompt for the generation.

promptstringrequired

The prompt for generating the image.

seedinteger

The random seed to use for the generation. -1 means a random seed will be used.

Default: -1

sizestring

The size of the generated image in pixels (width*height).

Default: "1280*1280"

576*1344720*1280720*1680768*1024800*1200936*2184960*1280960*14401024*7681024*10241080*19201168*17521200*8001200*16001224*16321280*7201280*9601280*12801344*5761440*9601440*14401600*12001632*12241680*7201752*11681920*10802184*936

요청 본문 예시

{
  "model": "alibaba/wan-2.5/text-to-image",
  "enable_prompt_expansion": false,
  "prompt": "A beautiful landscape",
  "seed": -1,
  "size": "1280*1280"
}

출력 Schema

API는 생성된 출력 URL이 포함된 예측 응답을 반환합니다.

created_atstring

ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”).

idstring

Unique identifier for the prediction, the ID of the prediction to get.

modelstring

Model ID used for the prediction.

outputsarray

Array of URLs to the generated content (empty when status is not completed).

statusstring

Status of the task: created, processing, completed, or failed.

응답 예시

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.png"
  ],
  "metrics": {
    "predict_time": 8.3
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills는 400개 이상의 AI 모델을 AI 코딩 어시스턴트에 직접 통합합니다. 한 번의 명령으로 설치하고 자연어로 이미지, 동영상 생성 및 LLM과 대화할 수 있습니다.

지원 클라이언트

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ 지원 클라이언트

설치

npx skills add AtlasCloudAI/atlas-cloud-skills

API 키 설정

Atlas Cloud 대시보드에서 API 키를 받아 환경 변수로 설정하세요.

export ATLASCLOUD_API_KEY="your-api-key-here"

기능

설치 후 AI 어시스턴트에서 자연어를 사용하여 모든 Atlas Cloud 모델에 접근할 수 있습니다.

이미지 생성Nano Banana 2, Z-Image 등의 모델로 이미지를 생성합니다.

동영상 제작Kling, Vidu, Veo 등으로 텍스트나 이미지에서 동영상을 만듭니다.

LLM 채팅Qwen, DeepSeek 등 대규모 언어 모델과 대화합니다.

미디어 업로드이미지 편집 및 이미지-동영상 변환 워크플로우를 위해 로컬 파일을 업로드합니다.

더 알아보기

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

Atlas Cloud MCP Server는 Model Context Protocol을 통해 IDE와 400개 이상의 AI 모델을 연결합니다. MCP 호환 클라이언트에서 사용할 수 있습니다.

지원 클라이언트

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ 지원 클라이언트

설치

npx -y atlascloud-mcp

설정

다음 설정을 IDE의 MCP 설정 파일에 추가하세요.

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

사용 가능한 도구

atlas_generate_image텍스트 프롬프트로 이미지를 생성합니다.

atlas_generate_video텍스트나 이미지로 동영상을 만듭니다.

atlas_chat대규모 언어 모델과 대화합니다.

atlas_list_models400개 이상의 사용 가능한 AI 모델을 탐색합니다.

atlas_quick_generate최적 모델을 자동 선택하여 한 번에 콘텐츠를 생성합니다.

atlas_upload_mediaAPI 워크플로우를 위해 로컬 파일을 업로드합니다.

더 알아보기

github.com/AtlasCloudAI/mcp-server

API 스키마

{
  "info": {
    "title": "AtlasCloud API",
    "version": "1.0.0",
    "description": "The AtlasCloud API."
  },
  "openapi": "3.0.0",
  "paths": {
    "/api/v1/model/generateImage": {
      "post": {
        "requestBody": {
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/Input"
              }
            }
          },
          "required": true
        },
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "The request status."
          }
        }
      },
      "x-api-name": "model_run"
    },
    "/api/v1/model/result/{request_id}": {
      "get": {
        "parameters": [
          {
            "in": "path",
            "name": "request_id",
            "required": true,
            "schema": {
              "description": "Request ID",
              "type": "string"
            }
          }
        ],
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "Result of the request."
          }
        }
      },
      "x-api-name": "model_result"
    }
  },
  "components": {
    "schemas": {
      "Input": {
        "properties": {
          "model": {
            "type": "string",
            "description": "model name",
            "default": "alibaba/wan-2.5/text-to-image"
          },
          "enable_prompt_expansion": {
            "default": false,
            "description": "If set to true, the prompt optimizer will be enabled.",
            "type": "boolean"
          },
          "negative_prompt": {
            "description": "Negative prompt for the generation.",
            "type": "string"
          },
          "prompt": {
            "description": "The prompt for generating the image.",
            "type": "string"
          },
          "seed": {
            "default": -1,
            "description": "The random seed to use for the generation. -1 means a random seed will be used.",
            "type": "integer"
          },
          "size": {
            "default": "1280*1280",
            "description": "The size of the generated image in pixels (width*height).",
            "enum": [
              "576*1344",
              "720*1280",
              "720*1680",
              "768*1024",
              "800*1200",
              "936*2184",
              "960*1280",
              "960*1440",
              "1024*768",
              "1024*1024",
              "1080*1920",
              "1168*1752",
              "1200*800",
              "1200*1600",
              "1224*1632",
              "1280*720",
              "1280*960",
              "1280*1280",
              "1344*576",
              "1440*960",
              "1440*1440",
              "1600*1200",
              "1632*1224",
              "1680*720",
              "1752*1168",
              "1920*1080",
              "2184*936"
            ],
            "type": "string"
          }
        },
        "required": [
          "model",
          "prompt"
        ],
        "type": "object",
        "x-order-properties": [
          "model",
          "prompt",
          "size",
          "enable_prompt_expansion",
          "negative_prompt",
          "seed"
        ]
      },
      "PredictionResponse": {
        "properties": {
          "created_at": {
            "description": "ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”).",
            "format": "date-time",
            "type": "string"
          },
          "has_nsfw_contents": {
            "description": "Array of boolean values indicating NSFW detection for each output.",
            "items": {
              "type": "boolean"
            },
            "type": "array"
          },
          "id": {
            "description": "Unique identifier for the prediction, the ID of the prediction to get.",
            "type": "string"
          },
          "model": {
            "description": "Model ID used for the prediction.",
            "type": "string"
          },
          "outputs": {
            "description": "Array of URLs to the generated content (empty when status is not completed).",
            "items": {
              "type": "object"
            },
            "type": "array"
          },
          "status": {
            "description": "Status of the task: created, processing, completed, or failed.",
            "type": "string"
          },
          "urls": {
            "description": "Object containing related API endpoints.",
            "type": "object"
          }
        },
        "type": "object"
      }
    }
  },
  "servers": [
    {
      "url": "https://api.atlascloud.ai"
    }
  ]
}

LLM 친화적 프롬프트 템플릿

# alibaba/wan-2.5/text-to-image

> Generate AI images with Alibaba WAN 2.5 text-to-image model.


## Overview

- **Submit endpoint (POST)**: `https://api.atlascloud.ai/api/v1/model/generateImage` — start an async generation; returns a `prediction_id`
- **Poll endpoint (GET)**: `https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}` — poll this until the prediction finishes
- **Model ID**: `alibaba/wan-2.5/text-to-image`


## API Information

This model can be used via our HTTP API or more conveniently via our client libraries.
See the input and output schema below, as well as the usage examples.


### Input Schema

The API accepts the following input parameters:

- **`model`** (`string`, _required_):
  model name
  - Default: `"alibaba/wan-2.5/text-to-image"`

- **`prompt`** (`string`, _required_):
  The prompt for generating the image.

- **`size`** (`string`, _optional_):
  The size of the generated image in pixels (width*height).
  - Default: `"1280*1280"`
  - Options: "576*1344", "720*1280", "720*1680", "768*1024", "800*1200", "936*2184", "960*1280", "960*1440", "1024*768", "1024*1024", "1080*1920", "1168*1752", "1200*800", "1200*1600", "1224*1632", "1280*720", "1280*960", "1280*1280", "1344*576", "1440*960", "1440*1440", "1600*1200", "1632*1224", "1680*720", "1752*1168", "1920*1080", "2184*936"

- **`enable_prompt_expansion`** (`boolean`, _optional_):
  If set to true, the prompt optimizer will be enabled.
  - Default: `false`

- **`negative_prompt`** (`string`, _optional_):
  Negative prompt for the generation.

- **`seed`** (`integer`, _optional_):
  The random seed to use for the generation. -1 means a random seed will be used.
  - Default: `-1`



**Required Parameters Example**:

```json
{
  "model": "alibaba/wan-2.5/text-to-image",
  "prompt": ""
}
```


**Full Example**:

```json
{
  "model": "alibaba/wan-2.5/text-to-image",
  "prompt": "",
  "size": "1280*1280",
  "enable_prompt_expansion": false,
  "negative_prompt": "",
  "seed": -1
}
```


### Output Schema

The API returns the following output format:


- **`created_at`** (`string`, _optional_):
  ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”).

- **`has_nsfw_contents`** (`array[boolean]`, _optional_):
  Array of boolean values indicating NSFW detection for each output.

- **`id`** (`string`, _optional_):
  Unique identifier for the prediction, the ID of the prediction to get.

- **`model`** (`string`, _optional_):
  Model ID used for the prediction.

- **`outputs`** (`array[object]`, _optional_):
  Array of URLs to the generated content (empty when status is not completed).

- **`status`** (`string`, _optional_):
  Status of the task: created, processing, completed, or failed.

- **`urls`** (`object`, _optional_):
  Object containing related API endpoints.



**Example Response**:

```json
{
  "created_at": "",
  "has_nsfw_contents": [],
  "id": "",
  "model": "",
  "outputs": [],
  "status": "",
  "urls": {}
}
```


## Usage Examples

### cURL

```bash
# Step 1: Start generation (async)
curl -X POST "https://api.atlascloud.ai/api/v1/model/generateImage" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "alibaba/wan-2.5/text-to-image",
  "prompt": "",
  "size": "1280*1280",
  "enable_prompt_expansion": false,
  "negative_prompt": "",
  "seed": -1
}'

# Response will contain: {"code": 200, "data": {"id": "prediction_id", "status": "processing"}}

# Step 2: Poll for result (replace {prediction_id} with the id returned above)
curl -X GET "https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"

# Keep polling until status is "completed", "succeeded" or "failed"
# When completed, outputs will contain the generated content URL(s)
```

## Additional Resources

### Documentation

- [Model Playground](https://www.atlascloud.ai/models/alibaba/wan-2.5/text-to-image)

사용 가능한 예제 없음

로드 중...

Wan 2.5를 선택해야 하는 이유

더 합리적인 가격

Google의 최근 가격 인하에도 불구하고 Veo 3는 전반적으로 여전히 고가입니다. Wan 2.5는 경량화되어 있고 비용 효율적이며, 창작자에게 더 많은 선택지를 제공하면서 제작 비용을 대폭 절감합니다.

원스텝 생성, 엔드투엔드 동기화

Wan 2.5를 사용하면 별도의 음성 녹음이나 수동 입술 동기화 작업이 필요하지 않습니다. 명확하고 구조화된 프롬프트만 제공하면 오디오/보이스오버와 립싱크가 포함된 완성도 높은 영상을 한 번에 생성할 수 있습니다. 더 빠르고 간편합니다.

다국어 친화적

중국어 프롬프트를 입력해도 Wan 2.5는 음성·영상이 동기화된 영상을 안정적으로 생성합니다. 반면 Veo 3는 중국어 프롬프트에 대해 "알 수 없는 언어"를 표시하는 경우가 많습니다.

정밀한 캐릭터 재현

Wan 2.5는 캐릭터 특성 복원에 탁월하여 캐릭터의 외모, 표정, 동작 스타일을 정확하게 구현합니다. 생성된 영상 속 캐릭터의 개성과 인지도를 높여 스토리텔링과 몰입감을 한층 강화합니다.

예술적 스타일 렌더링

Studio Ghibli 스타일 렌더링을 지원하여 손으로 그린 듯한 수채화 질감과 애니메이션 효과를 구현합니다. 따뜻하고 몽환적인 시각적 경험을 선사하며 예술적 감성과 스토리텔링의 깊이를 더합니다.

누가 활용할 수 있나요?

마케팅 팀

제품 출시, 프로모션 캠페인, 브랜드 마케팅 등 어떤 경우에도 Wan 2.5는 고품질 영상을 빠르게 생성하여 창작을 쉽고 효율적으로 만들어 줍니다.

복잡한 조율 없이 제품 데모와 튜토리얼 제작 가능
다국어 자막과 립싱크를 활용한 소셜 미디어 마케팅
AI 생성 콘텐츠로 팀이 전략과 창의성에 집중할 수 있음

Bottom line: 한 마디로: 창작이 이토록 간단하고 빠르고 스마트해진 적이 없었습니다. Wan 2.5는 마케팅의 비밀 무기입니다!

글로벌 기업

다국적 기업에 이상적인 콘텐츠 현지화 솔루션을 제공하여 창작을 더 쉽고 효율적으로 만들어 줍니다.

프롬프트 인식 기반 다국어 영상 지원
립싱크 자막 및 보이스오버 원클릭 생성
글로벌 시장을 위한 신속한 콘텐츠 현지화

Bottom line: 한 마디로: 크로스보더 콘텐츠 제작이 이토록 간단하고 빠르고 스마트해진 적이 없었습니다.

스토리 창작자 / YouTuber

창작자는 Wan 2.5를 활용하여 고품질 결과물을 유지하면서 영상 제작 효율을 높일 수 있습니다.

정밀한 캐릭터 동작과 표정으로 몰입감 있는 스토리텔링 구현
편집 및 후반 제작 시간 단축으로 더 빠른 업로드 가능
숏폼 영상부터 애니메이션 스토리 세그먼트까지 다양한 콘텐츠 제작

기업 교육 팀

Wan 2.5는 기업 교육을 더 효율적이고 흥미롭게 만들어 줍니다.

지루한 텍스트 문서를 전문적인 영상으로 대체
운영 데모 및 교육 튜토리얼 빠르게 제작
일관된 스타일과 표준화된 결과물로 글로벌 배포 용이

프리랜서 크리에이터 / 소규모 스튜디오

Wan 2.5를 사용하면 값비싼 장비나 배우 없이도 창의력을 자유롭게 발휘할 수 있습니다. AI가 모든 것을 효율적으로 생성합니다.

단편 영화부터 소셜 미디어 콘텐츠까지 다양한 작품 도전 가능
영감에서 완성까지 "원클릭 생성"으로 구현
값비싼 장비나 전문 배우 없이도 고품질 콘텐츠 제작

Bottom line: 한 마디로: Wan 2.5는 창작을 더 쉽고, 더 자유롭고, 더 짜릿하게 만들어 줍니다. 매번 새로운 시도가 놀라울 것입니다!

교육 기관 / 온라인 강의 크리에이터

높은 비용 없이 창의력을 현실로 구현하세요. Wan 2.5는 고품질 콘텐츠 제작을 쉽고 경제적으로 만들어 줍니다.

단편 영화부터 홍보 영상까지 다양한 스타일 실험 가능
기획에서 완성물까지 더 높은 제작 효율성
값비싼 장비나 전문 인력 없이도 고품질 콘텐츠 제작

Bottom line: 한 마디로: Wan 2.5는 창작을 쉽고, 효율적이고, 자유롭게 만들어 줍니다. 매번 시도할 때마다 눈부신 결과를 경험하세요!

핵심 기능

원스텝 A/V 생성

단일 프로세스에서 동기화된 오디오, 보이스오버, 립싱크가 포함된 완성 영상 생성

2인 캐릭터 동기화

두 캐릭터의 동작, 표정, 립싱크를 동시에 생성하여 자연스러운 상호작용 구현 지원

전문가 수준의 품질

사실적인 캐릭터 표정과 정밀한 립싱크를 갖춘 고품질 영상 출력

다국어 지원

중국어 프롬프트에 대한 우수한 지원 및 다국어 콘텐츠 안정적 생성

뛰어난 비용 효율

전문가 수준의 품질을 유지하면서 경쟁사 대비 비용을 대폭 절감

캐릭터 특성 복원

높은 충실도와 개성으로 캐릭터의 외모, 표정, 동작 스타일을 정밀하게 재현

예술적 스타일 렌더링

Studio Ghibli 스타일의 손으로 그린 듯한 수채화 질감을 포함한 다양한 예술 스타일 지원

몰입감 있는 장면

자연스러운 음성·영상 일치로 대화 장면, 인터뷰, 2인 단편 영화에 최적화

Digital Human Sync

Study Room Scholar

Middle-aged man reading with perfect lip-sync in a warm study environment

Lip-sync with audioEnvironmental soundsCharacter emotion

Prompt

A middle-aged man sitting at a wooden desk in a cozy study room, surrounded by bookshelves and a warm lamp glow. He opens an old book and reads aloud with a calm, deep voice: 'History teaches us more than just facts… it shows us who we are.' The room has subtle background sounds: pages turning, the faint ticking of a clock, and distant rain against the window.

Dual Character Scene

Park Sunset Romance

Couple interaction with synchronized dual character actions and expressions

Dual character syncNatural interactionAmbient soundscape

Prompt

A young couple sitting on a park bench during sunset. The woman leans her head on the man's shoulder. He whispers softly: 'No matter where we go, I'll always be here with you.' The sound includes the rustling of leaves, distant laughter of children playing, and the gentle hum of cicadas in the evening air.

Character Restoration

Ballet Performance Art

Precise character trait restoration with artistic movement and expression

Character trait restorationMovement precisionArtistic lighting

Prompt

A graceful ballerina with her hair in a messy bun, performing a powerful and emotional contemporary ballet routine. She is in a minimalist, dark art studio. Abstract patterns of light and shadow, projected from a hidden source, dance across her body and the surrounding walls, constantly shifting with her movements. The camera focuses on the tension in her muscles and the expressive gestures of her hands. A single, dramatic slow-motion shot captures her mid-air leap, with the light patterns swirling around her like a galaxy. Moody, artistic, high contrast.

Artistic Style Rendering

Ghibli Forest Magic

Studio Ghibli-inspired animation with hand-painted watercolor texture

Ghibli art styleHand-painted textureMagical atmosphere

Prompt

Studio Ghibli-inspired anime style. A young girl with a straw hat lies peacefully in a sun-dappled magical forest, surrounded by friendly, glowing forest spirits (Kodama). A gentle breeze rustles the leaves of the giant, ancient trees. The air is filled with sparkling dust motes, illuminated by shafts of sunlight. The art style is soft, with a hand-painted watercolor texture. The scene feels serene, magical, and heartwarming.

완벽한 활용

🎬

영상 제작

📢

마케팅 콘텐츠

🎓

교육 영상

📱

소셜 미디어

🌐

다국어 콘텐츠

💼

기업 교육

🎭

엔터테인먼트

💃

퍼포먼스 아트

🎨

애니메이션 & 애니

📚

스토리텔링

👥

2인 캐릭터 영상

🎙️

인터뷰

📺

방송 미디어

기술 사양

모델 유형:음성·영상 동기화 생성

주요 기능:A/V 동기화, 캐릭터 복원, 예술적 렌더링, 다국어

언어 지원:중국어, 영어 외 다수

출력 품질:오디오가 포함된 전문 HD 영상

생성 속도:빠른 원스텝 생성

API 통합:포괄적인 문서를 갖춘 RESTful API

Wan 2.5 경험하기 - 당신의 영상 창작 혁명

수천 명의 크리에이터와 기업과 함께 음성·영상 동기화 생성 기술로 영상 콘텐츠 제작을 혁신하세요.

🎬원스텝 A/V 동기화

🌍다국어 지원

⚡뛰어난 비용 효율

Alibaba WAN 2.5 Text-to-Image Model

Alibaba WAN 2.5 is a high-quality text-to-image model provided by Alibaba Cloud's DashScope platform.

유사한 모델 탐색

Wan-2.7 Pro Image-to-image

Edits and recomposes images with Wan 2.7 image pro using text instructions and multi-image references for higher quality outputs.

Wan-2.7 Pro Text-to-image

Generates images from text prompts with Wan 2.7 image pro, supporting higher fidelity outputs and 4K-ready workflows.

Wan-2.7 Image-to-image

Edits and recomposes images with Wan 2.7 image using text instructions, multi-image references, and optional interaction boxes.

Wan-2.7 Text-to-image

Generates images from text prompts with Wan 2.7 image, supporting fast iteration and strong prompt fidelity for illustration and photorealistic outputs.

Qwen Image 2.0 Pro Text-to-image

Qwen Image 2.0 Pro is a professional-grade text-to-image model with superior quality and advanced prompt understanding. Up to 2k. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Qwen Image 2.0 Pro Edit

Qwen Image 2.0 Pro Edit is a professional-grade image editing model with superior quality and advanced instruction understanding. Up to 2k. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Qwen Image 2.0 Edit

Qwen Image 2.0 Edit is an advanced image-editing model with improved quality and better understanding of instructions. Up to 2k. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Qwen Image 2.0 Text-to-image

Qwen Image 2.0 is an advanced text-to-image model with enhanced image quality and improved prompt understanding. Up to 2k. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Qwen-Image Edit Plus 20251215

Supports multiple image inputs and outputs, allowing for precise modification of text within images, addition, deletion, or movement of objects, alteration of subject actions, transfer of image styles, and enhancement of image details.

From$0.03/이미지

$0.021/이미지

-30%