alibaba/wan-2.6/text-to-image

टेक्स्ट-से-इमेज

Wan 2.6 Text-to-Image API by Alibaba

alibaba/wan-2.6/text-to-image

Text-to-image

Generates images based on text, supports various artistic styles and realistic photographic effects, and meets diverse creative needs.

इनपुट

प्रॉम्प्ट *

नकारात्मक प्रॉम्प्ट

आकार

प्रॉम्प्ट विस्तार

सीड

आउटपुट

निष्क्रिय

जेनरेट की गई छवियां यहां दिखाई देंगी

अपनी सेटिंग्स कॉन्फ़िगर करें और शुरू करने के लिए चलाएं पर क्लिक करें

आपके अनुरोध की लागत $0.021 प्रति रन होगी। $10 के साथ आप इस मॉडल को लगभग 476 बार चला सकते हैं।

आगे आप यह कर सकते हैं:

इमेज-से-वीडियो इमेज-से-इमेज

पैरामीटर

कोड उदाहरण
import requests
import time

# Step 1: Start image generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "alibaba/wan-2.6/text-to-image",  # Required. model name
    "enable_prompt_expansion": False,  # If set to true, the prompt optimizer will be enabled
    "enable_base64_output": False,  # If enabled, the output will be encoded into a BASE64 string instead of a URL
    "enable_sync_mode": False,  # If set to true, the function will wait for the result to be generated and uploaded before returning the response
    "negative_prompt": "example_value",  # Negative prompt for the generation
    "prompt": "A beautiful landscape with mountains and lake",  # Required. The prompt for generating the image
    "seed": -1,  # The random seed to use for the generation
    "size": "1280*1280",  # The size of the generated image in pixels (width*height)
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] == "completed":
            print("Generated image:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

image_url = check_status()

इंस्टॉल करें

अपनी प्रोग्रामिंग भाषा के लिए आवश्यक पैकेज इंस्टॉल करें।

pip install requests

प्रमाणीकरण

सभी API अनुरोधों के लिए API कुंजी के माध्यम से प्रमाणीकरण आवश्यक है। आप अपनी API कुंजी Atlas Cloud डैशबोर्ड से प्राप्त कर सकते हैं।

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP हेडर

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

अपनी API कुंजी सुरक्षित रखें

क्लाइंट-साइड कोड या सार्वजनिक रिपॉज़िटरी में अपनी API कुंजी कभी उजागर न करें। इसके बजाय एनवायरनमेंट वेरिएबल या बैकएंड प्रॉक्सी का उपयोग करें।

अनुरोध सबमिट करें

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

अनुरोध सबमिट करें

एक असिंक्रोनस जनरेशन अनुरोध सबमिट करें। API एक प्रेडिक्शन ID लौटाता है जिसका उपयोग आप स्थिति जाँचने और परिणाम प्राप्त करने के लिए कर सकते हैं।

POST/api/v1/model/generateImage

अनुरोध बॉडी

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "alibaba/wan-2.6/text-to-image",
    "prompt": "A beautiful landscape with mountains and lake"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

प्रतिक्रिया

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

स्थिति जाँचें

अपने अनुरोध की वर्तमान स्थिति जाँचने के लिए प्रेडिक्शन एंडपॉइंट को पोल करें।

GET/api/v1/model/prediction/{prediction_id}

पोलिंग उदाहरण

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

स्थिति मान

processingअनुरोध अभी भी प्रोसेस हो रहा है।

completedजनरेशन पूर्ण हो गया है। आउटपुट उपलब्ध हैं।

succeededजनरेशन सफल रहा। आउटपुट उपलब्ध हैं।

failedजनरेशन विफल हो गया। एरर फ़ील्ड जाँचें।

पूर्ण प्रतिक्रिया

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.png"
    ],
    "metrics": {
      "predict_time": 8.3
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

फ़ाइलें अपलोड करें

Atlas Cloud स्टोरेज पर फ़ाइलें अपलोड करें और एक URL प्राप्त करें जिसका उपयोग आप अपने API अनुरोधों में कर सकते हैं। अपलोड के लिए multipart/form-data का उपयोग करें।

POST/api/v1/model/uploadMedia

अपलोड उदाहरण

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

प्रतिक्रिया

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

Input Schema

अनुरोध बॉडी में निम्नलिखित पैरामीटर स्वीकार किए जाते हैं।

कुल: 8आवश्यक: 2वैकल्पिक: 6

modelstringrequired

model name

Default: "alibaba/wan-2.6/text-to-image"

enable_prompt_expansionboolean

If set to true, the prompt optimizer will be enabled.

Default: false

enable_base64_outputboolean

If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Default: false

enable_sync_modeboolean

If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.

Default: false

negative_promptstring

Negative prompt for the generation.

promptstringrequired

The prompt for generating the image.

seedinteger

The random seed to use for the generation. -1 means a random seed will be used.

Default: -1

sizestring

The size of the generated image in pixels (width*height).

Default: "1280*1280"

576*1344720*1280720*1680768*1024800*1200936*2184960*1280960*14401024*7681024*10241080*19201168*17521200*8001200*16001224*16321280*7201280*9601280*12801344*5761440*9601440*14401600*12001632*12241680*7201752*11681920*10802184*936

अनुरोध बॉडी का उदाहरण

{
  "model": "alibaba/wan-2.6/text-to-image",
  "enable_prompt_expansion": false,
  "enable_base64_output": false,
  "enable_sync_mode": false,
  "prompt": "A beautiful landscape",
  "seed": -1,
  "size": "1280*1280"
}

Output Schema

API जनरेट किए गए आउटपुट URL के साथ एक प्रेडिक्शन प्रतिक्रिया लौटाता है।

created_atstring

ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”).

idstring

Unique identifier for the prediction, the ID of the prediction to get.

modelstring

Model ID used for the prediction.

outputsarray

Array of URLs to the generated content (empty when status is not completed).

statusstring

Status of the task: created, processing, completed, or failed.

प्रतिक्रिया का उदाहरण

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.png"
  ],
  "metrics": {
    "predict_time": 8.3
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills 400+ AI मॉडल को सीधे आपके AI कोडिंग असिस्टेंट में इंटीग्रेट करता है। इंस्टॉल करने के लिए एक कमांड, फिर इमेज, वीडियो जनरेट करने और LLM के साथ चैट करने के लिए प्राकृतिक भाषा का उपयोग करें।

समर्थित क्लाइंट

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ समर्थित क्लाइंट

इंस्टॉल करें

npx skills add AtlasCloudAI/atlas-cloud-skills

API कुंजी सेटअप करें

Atlas Cloud डैशबोर्ड से अपनी API कुंजी प्राप्त करें और इसे एनवायरनमेंट वेरिएबल के रूप में सेट करें।

export ATLASCLOUD_API_KEY="your-api-key-here"

क्षमताएँ

एक बार इंस्टॉल होने के बाद, आप सभी Atlas Cloud मॉडल तक पहुँचने के लिए अपने AI असिस्टेंट में प्राकृतिक भाषा का उपयोग कर सकते हैं।

इमेज जनरेशनNano Banana 2, Z-Image और अन्य मॉडल के साथ इमेज जनरेट करें।

वीडियो निर्माणKling, Vidu, Veo आदि के साथ टेक्स्ट या इमेज से वीडियो बनाएँ।

LLM चैटQwen, DeepSeek और अन्य बड़े भाषा मॉडल के साथ चैट करें।

मीडिया अपलोडइमेज एडिटिंग और इमेज-टू-वीडियो वर्कफ़्लो के लिए लोकल फ़ाइलें अपलोड करें।

और जानें

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

Atlas Cloud MCP Server आपके IDE को Model Context Protocol के माध्यम से 400+ AI मॉडल से जोड़ता है। किसी भी MCP-संगत क्लाइंट के साथ काम करता है।

समर्थित क्लाइंट

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ समर्थित क्लाइंट

इंस्टॉल करें

npx -y atlascloud-mcp

कॉन्फ़िगरेशन

अपने IDE की MCP सेटिंग्स फ़ाइल में निम्नलिखित कॉन्फ़िगरेशन जोड़ें।

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

उपलब्ध टूल

atlas_generate_imageटेक्स्ट प्रॉम्प्ट से इमेज जनरेट करें।

atlas_generate_videoटेक्स्ट या इमेज से वीडियो बनाएँ।

atlas_chatबड़े भाषा मॉडल के साथ चैट करें।

atlas_list_models400+ उपलब्ध AI मॉडल ब्राउज़ करें।

atlas_quick_generateऑटो मॉडल चयन के साथ एक-चरण कंटेंट निर्माण।

atlas_upload_mediaAPI वर्कफ़्लो के लिए लोकल फ़ाइलें अपलोड करें।

और जानें

github.com/AtlasCloudAI/mcp-server

API स्कीमा

{
  "info": {
    "title": "AtlasCloud API",
    "version": "1.0.0",
    "description": "The AtlasCloud API."
  },
  "openapi": "3.0.0",
  "paths": {
    "/api/v1/model/generateImage": {
      "post": {
        "requestBody": {
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/Input"
              }
            }
          },
          "required": true
        },
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "The request status."
          }
        }
      },
      "x-api-name": "model_run"
    },
    "/api/v1/model/prediction/{request_id}": {
      "get": {
        "parameters": [
          {
            "in": "path",
            "name": "request_id",
            "required": true,
            "schema": {
              "description": "Request ID",
              "type": "string"
            }
          }
        ],
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "Result of the request."
          }
        }
      },
      "x-api-name": "model_result"
    }
  },
  "components": {
    "schemas": {
      "Input": {
        "properties": {
          "model": {
            "type": "string",
            "description": "model name",
            "default": "alibaba/wan-2.6/text-to-image"
          },
          "enable_prompt_expansion": {
            "default": false,
            "description": "If set to true, the prompt optimizer will be enabled.",
            "type": "boolean"
          },
          "enable_base64_output": {
            "default": false,
            "description": "If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.",
            "disabled": true,
            "type": "boolean"
          },
          "enable_sync_mode": {
            "default": false,
            "description": "If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.",
            "disabled": true,
            "type": "boolean"
          },
          "negative_prompt": {
            "description": "Negative prompt for the generation.",
            "type": "string"
          },
          "prompt": {
            "description": "The prompt for generating the image.",
            "type": "string"
          },
          "seed": {
            "default": -1,
            "description": "The random seed to use for the generation. -1 means a random seed will be used.",
            "type": "integer"
          },
          "size": {
            "default": "1280*1280",
            "description": "The size of the generated image in pixels (width*height).",
            "enum": [
              "576*1344",
              "720*1280",
              "720*1680",
              "768*1024",
              "800*1200",
              "936*2184",
              "960*1280",
              "960*1440",
              "1024*768",
              "1024*1024",
              "1080*1920",
              "1168*1752",
              "1200*800",
              "1200*1600",
              "1224*1632",
              "1280*720",
              "1280*960",
              "1280*1280",
              "1344*576",
              "1440*960",
              "1440*1440",
              "1600*1200",
              "1632*1224",
              "1680*720",
              "1752*1168",
              "1920*1080",
              "2184*936"
            ],
            "type": "string"
          }
        },
        "required": [
          "model",
          "prompt"
        ],
        "type": "object",
        "x-order-properties": [
          "model",
          "prompt",
          "negative_prompt",
          "size",
          "enable_prompt_expansion",
          "enable_sync_mode",
          "enable_base64_output",
          "seed"
        ]
      },
      "PredictionResponse": {
        "properties": {
          "created_at": {
            "description": "ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”).",
            "format": "date-time",
            "type": "string"
          },
          "has_nsfw_contents": {
            "description": "Array of boolean values indicating NSFW detection for each output.",
            "items": {
              "type": "boolean"
            },
            "type": "array"
          },
          "id": {
            "description": "Unique identifier for the prediction, the ID of the prediction to get.",
            "type": "string"
          },
          "model": {
            "description": "Model ID used for the prediction.",
            "type": "string"
          },
          "outputs": {
            "description": "Array of URLs to the generated content (empty when status is not completed).",
            "items": {
              "type": "object"
            },
            "type": "array"
          },
          "status": {
            "description": "Status of the task: created, processing, completed, or failed.",
            "type": "string"
          },
          "urls": {
            "description": "Object containing related API endpoints.",
            "type": "object"
          }
        },
        "type": "object"
      }
    }
  },
  "servers": [
    {
      "url": "https://api.atlascloud.ai"
    }
  ]
}

LLM के लिए प्रॉम्प्ट टेम्पलेट

# alibaba/wan-2.6/text-to-image

> Generates images based on text, supports various artistic styles and realistic photographic effects, and meets diverse creative needs.


## Overview

- **Submit endpoint (POST)**: `https://api.atlascloud.ai/api/v1/model/generateImage` — start an async generation; returns a `prediction_id`
- **Poll endpoint (GET)**: `https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}` — poll this until the prediction finishes
- **Model ID**: `alibaba/wan-2.6/text-to-image`


## API Information

This model can be used via our HTTP API or more conveniently via our client libraries.
See the input and output schema below, as well as the usage examples.


### Input Schema

The API accepts the following input parameters:

- **`model`** (`string`, _required_):
  model name
  - Default: `"alibaba/wan-2.6/text-to-image"`

- **`prompt`** (`string`, _required_):
  The prompt for generating the image.

- **`negative_prompt`** (`string`, _optional_):
  Negative prompt for the generation.

- **`size`** (`string`, _optional_):
  The size of the generated image in pixels (width*height).
  - Default: `"1280*1280"`
  - Options: "576*1344", "720*1280", "720*1680", "768*1024", "800*1200", "936*2184", "960*1280", "960*1440", "1024*768", "1024*1024", "1080*1920", "1168*1752", "1200*800", "1200*1600", "1224*1632", "1280*720", "1280*960", "1280*1280", "1344*576", "1440*960", "1440*1440", "1600*1200", "1632*1224", "1680*720", "1752*1168", "1920*1080", "2184*936"

- **`enable_prompt_expansion`** (`boolean`, _optional_):
  If set to true, the prompt optimizer will be enabled.
  - Default: `false`

- **`enable_sync_mode`** (`boolean`, _optional_):
  If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.
  - Default: `false`

- **`enable_base64_output`** (`boolean`, _optional_):
  If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.
  - Default: `false`

- **`seed`** (`integer`, _optional_):
  The random seed to use for the generation. -1 means a random seed will be used.
  - Default: `-1`



**Required Parameters Example**:

```json
{
  "model": "alibaba/wan-2.6/text-to-image",
  "prompt": ""
}
```


**Full Example**:

```json
{
  "model": "alibaba/wan-2.6/text-to-image",
  "prompt": "",
  "negative_prompt": "",
  "size": "1280*1280",
  "enable_prompt_expansion": false,
  "enable_sync_mode": false,
  "enable_base64_output": false,
  "seed": -1
}
```


### Output Schema

The API returns the following output format:


- **`created_at`** (`string`, _optional_):
  ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”).

- **`has_nsfw_contents`** (`array[boolean]`, _optional_):
  Array of boolean values indicating NSFW detection for each output.

- **`id`** (`string`, _optional_):
  Unique identifier for the prediction, the ID of the prediction to get.

- **`model`** (`string`, _optional_):
  Model ID used for the prediction.

- **`outputs`** (`array[object]`, _optional_):
  Array of URLs to the generated content (empty when status is not completed).

- **`status`** (`string`, _optional_):
  Status of the task: created, processing, completed, or failed.

- **`urls`** (`object`, _optional_):
  Object containing related API endpoints.



**Example Response**:

```json
{
  "created_at": "",
  "has_nsfw_contents": [],
  "id": "",
  "model": "",
  "outputs": [],
  "status": "",
  "urls": {}
}
```


## Usage Examples

### cURL

```bash
# Step 1: Start generation (async)
curl -X POST "https://api.atlascloud.ai/api/v1/model/generateImage" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "alibaba/wan-2.6/text-to-image",
  "prompt": "",
  "negative_prompt": "",
  "size": "1280*1280",
  "enable_prompt_expansion": false,
  "enable_sync_mode": false,
  "enable_base64_output": false,
  "seed": -1
}'

# Response will contain: {"code": 200, "data": {"id": "prediction_id", "status": "processing"}}

# Step 2: Poll for result (replace {prediction_id} with the id returned above)
curl -X GET "https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"

# Keep polling until status is "completed", "succeeded" or "failed"
# When completed, outputs will contain the generated content URL(s)
```

## Additional Resources

### Documentation

- [Model Playground](https://www.atlascloud.ai/models/alibaba/wan-2.6/text-to-image)

A cinematic portrait of a woman standing under neon signs in a rainy night street, raindrops shimmering, reflective puddles, vivid colors, hyper realistic, 8K detail, photography style.

A serene mountain lake at sunrise, low mist hovering over water, golden light kissing snow-capped peaks, ultra-detailed landscape photography, natural color grading.

A futuristic cyberpunk city alley with holographic billboards, wet ground reflections, dense atmosphere, moody lighting, wide-angle composition, photorealistic rendering.

Close-up product shot of a matte black smartwatch on a marble surface, soft studio lighting, sharp edges, shallow depth of field, commercial photography, clean minimal style.

A young boy flying a red kite in a windy seaside field, golden hour sunlight, soft natural tones, lifestyle photography, joyful mood, realistic textures.

लोड हो रहा है...

🎬मल्टी-शॉट वीडियो जनरेशन

Wan 2.6प्रोफेशनल मल्टी-शॉट AI वीडियो क्रिएशन

AI वीडियो जनरेशन में Alibaba की नवीनतम सफलता। मल्टी-शॉट स्टोरीटेलिंग, रेफरेंस-ड्रिवेन कैरेक्टर कंसिस्टेंसी और नेटिव ऑडियो-विजुअल सिंक्रोनाइज़ेशन के साथ 15 सेकंड तक के 1080p वीडियो बनाएं। सिनेमैटिक नैरेटिव के लिए स्टोरीबोर्ड लॉजिक को सच्चे अर्थ में समझने वाला पहला मॉडल।

क्रांतिकारी सफलताएं

क्या Wan 2.6 को AI वीडियो जनरेशन में गेम-चेंजर बनाता है

मल्टी-शॉट स्टोरीटेलिंग

स्टोरीबोर्ड लॉजिक को समझने वाला पहला मॉडल। स्वचालित रूप से सुसंगत ट्रांज़िशन के साथ सीक्वेंशियल शॉट्स जनरेट करता है, सीन चेंजेज़ में कैरेक्टर अपीयरेंस और एनवायरनमेंट कंसिस्टेंसी बनाए रखता है—एक सिंगल 15-सेकंड जनरेशन में कम्पलीट स्टोरी आर्क्स को सक्षम बनाता है।

रेफरेंस-टू-वीडियो (R2V)

कैरेक्टर अपीयरेंस, मूवमेंट पैटर्न और वॉइस कैरेक्टरिस्टिक्स को एक्सट्रैक्ट और प्रिज़र्व करने के लिए 2-30 सेकंड का रेफरेंस वीडियो अपलोड करें। अभूतपूर्व सटीकता के साथ मल्टिपल वीडियो में कंसिस्टेंट कैरेक्टर परफॉर्मेंस बनाएं।

सटीक टेक्स्ट रेंडरिंग

प्रोडक्ट पैकेजिंग, साइनेज और ब्रांडेड कंटेंट के लिए इंडस्ट्री-लीडिंग टेक्स्ट रेंडरिंग क्षमताएं। वीडियो फ्रेम्स के भीतर क्लियर, रीडेबल टेक्स्ट जनरेट करें—मार्केटिंग और कमर्शियल एप्लिकेशन के लिए आवश्यक।

कोर कैपेबिलिटीज़

एक्सटेंडेड 15-सेकंड अवधि

हर वीडियो में संपूर्ण "थ्री-एक्ट" संरचना के साथ 15 सेकंड तक जनरेट करें (सेटअप → एक्शन → समाधान)

प्रोफेशनल 1080p क्वालिटी

सिनेमैटिक क्वालिटी और एन्हांस्ड विजुअल स्टेबिलिटी के साथ 24fps पर नेटिव 1080p आउटपुट

नेटिव ऑडियो सिंक

संवाद होंठों की हरकत से मेल खाते हैं, बैकग्राउंड म्यूज़िक गति के साथ तालमेल बिठाता है, और साउंड इफेक्ट बिल्कुल सही समय पर ट्रिगर होते हैं

कैरेक्टर कंसिस्टेंसी

शॉट्स और मल्टिपल वीडियो में कैरेक्टर अपीयरेंस, कॉस्ट्यूम्स और आइडेंटिटी को मेंटेन करें

सिनेमैटिक कैमरा कंट्रोल

पैन, ज़ूम, ट्रैकिंग शॉट और डॉली शॉट सहित प्रोफेशनल कैमरा मूवमेंट

फ्लेक्सिबल आस्पेक्ट रेशियो

16:9 (YouTube), 9:16 (Reels), 1:1 (स्क्वायर) - हर प्लेटफॉर्म के लिए ऑप्टिमाइज़्ड, बिना किसी पोस्ट-प्रोडक्शन क्रॉपिंग के

Wan 2.6 vs Wan 2.5: मेजर इम्प्रूवमेंट्स

नवीनतम रिलीज़ में नया क्या है देखें

वीडियो अवधि

15 सेकंड तक

Wan 2.5: अधिकतम 10 सेकंड

मल्टी-शॉट क्षमता

स्टोरीबोर्ड लॉजिक समझता है

Wan 2.5: सिंगल शॉट या मेसी मॉर्फिंग

रेफरेंस वीडियो सपोर्ट

फुल प्रिज़र्वेशन के साथ R2V मोड

Wan 2.5: केवल इमेज रेफरेंस

कैरेक्टर कंसिस्टेंसी

शॉट्स में एक्सीलेंट

Wan 2.5: कैरेक्टर ड्रिफ्ट इश्यूज़

मोशन स्टेबिलिटी

रिड्यूस्ड जिटर और आर्टिफैक्ट्स

Wan 2.5: ओकेज़नल फ्रेम ड्रिफ्ट

प्रॉम्प्ट अंडरस्टैंडिंग

कॉम्प्लेक्स मल्टी-कैरेक्टर सीन्स

Wan 2.5: बेसिक सीन जनरेशन

तीन स्पेशलाइज़्ड जनरेशन मोड्स

अपने क्रिएटिव वर्कफ्लो के लिए सही मोड चुनें

टेक्स्ट-टू-वीडियो (T2V)

सबसे लोकप्रिय

एन्हांस्ड मल्टी-शॉट सेगमेंटेशन और इम्प्रूव्ड प्रॉम्प्ट हैंडलिंग के साथ टेक्स्ट प्रॉम्प्ट्स से कम्पलीट वीडियो जनरेट करें। स्टोरीटेलिंग और क्रिएटिव एक्सप्लोरेशन के लिए परफेक्ट।

सिंगल प्रॉम्प्ट से ऑटोमैटिक शॉट सेगमेंटेशन
मल्टी-कैरेक्टर इंटरैक्शन अंडरस्टैंडिंग
कैमरा मूवमेंट और इमोशनल क्यूज़
एनवायरनमेंटल डिटेल प्रिज़र्वेशन

इमेज-टू-वीडियो (I2V)

एन्हांस्ड

इम्प्रूव्ड मोशन कोहेरेंस के साथ स्टिल इमेजेज़ को मोशन वीडियो में ट्रांसफॉर्म करें। प्रोडक्ट शोकेसेज़, फोटो एनिमेशन और विजुअल स्टोरीटेलिंग के लिए आदर्श।

प्रोडक्ट्स के लिए प्रिसाइज़ टेक्स्ट रेंडरिंग
फ्रेम्स में स्टाइल कंसिस्टेंसी
स्टैटिक इमेजेज़ से नेचुरल मोशन
नैरेटिव-ड्रिवेन विजुअल ऑप्टिमाइज़ेशन

रेफरेंस-टू-वीडियो (R2V)

नया

कैरेक्टर अपीयरेंस, मूवमेंट पैटर्न्स और वॉइस को प्रिज़र्व करने के लिए रेफरेंस वीडियो (2-30s) अपलोड करें। कैरेक्टर-ड्रिवेन कंटेंट के लिए स्ट्रॉन्गेस्ट कंसिस्टेंसी गारंटी।

फुल कैरेक्टर आइडेंटिटी प्रिज़र्वेशन
वॉइस कैरेक्टरिस्टिक्स एक्सट्रैक्शन
मूवमेंट पैटर्न रेप्लिकेशन
मल्टी-कैरेक्टर को-एक्टिंग सीन्स

परफेक्ट फॉर

मार्केटिंग और एडवर्टाइज़िंग

टेक्स्ट रेंडरिंग के साथ प्रोडक्ट डेमोज़, कैरेक्टर कंसिस्टेंसी के साथ ब्रांड कैंपेन और प्रोमोशनल वीडियो

कंटेंट क्रिएशन

YouTube वीडियो, सोशल मीडिया रील्स, मल्टी-शॉट स्टोरीटेलिंग और वीडियो एडिटिंग वर्कफ्लोज़

ई-कॉमर्स

एक्यूरेट टेक्स्ट के साथ प्रोडक्ट शोकेसेज़, ट्यूटोरियल वीडियो और कस्टमर टेस्टिमोनियल रीक्रिएशन

एजुकेशन और ट्रेनिंग

इंस्ट्रक्शनल कंटेंट, कोर्स मटेरियल्स और मल्टी-सीन एजुकेशनल नैरेटिव्स

एंटरटेनमेंट

शॉर्ट फिल्म्स, कैरेक्टर-ड्रिवेन स्टोरीज़, सिनेमैटिक सीक्वेंसेज़ और क्रिएटिव एक्सपेरिमेंट्स

प्री-विजुअलाइज़ेशन

फिल्म कॉन्सेप्ट डेवलपमेंट, स्टोरीबोर्ड क्रिएशन और प्रोडक्शन के लिए सीन प्लानिंग

Wan 2.6 T2V, I2V और R2V API इंटीग्रेशन

टेक्स्ट-टू-वीडियो, इमेज-टू-वीडियो और रेफरेंस-टू-वीडियो जनरेशन के लिए कम्पलीट API सूट

टेक्स्ट-टू-वीडियो API (T2V API)

हमारा Wan 2.6 T2V API ऑटोमैटिक सीन सेगमेंटेशन के साथ टेक्स्ट प्रॉम्प्ट्स को मल्टी-शॉट सिनेमैटिक वीडियो में ट्रांसफॉर्म करता है। नेटिव ऑडियो सिंक के साथ 15 सेकंड तक के प्रोफेशनल 1080p वीडियो जनरेट करें।

सिंगल प्रॉम्प्ट से मल्टी-शॉट स्टोरीटेलिंग

थ्री एक्ट स्ट्रक्चर के साथ 15-सेकंड की अवधि

कॉम्प्लेक्स सीन्स के लिए एन्हांस्ड प्रॉम्प्ट अंडरस्टैंडिंग

फ्लेक्सिबल आस्पेक्ट रेशियो: 16:9, 9:16, 1:1

इमेज-टू-वीडियो API (I2V API)

हमारा Wan 2.6 I2V API प्रिसाइज़ मोशन कंट्रोल और टेक्स्ट रेंडरिंग के साथ स्टिल इमेजेज़ को जीवंत करता है। प्रोडक्ट वीडियो, फोटो एनिमेशन और ब्रांडेड कंटेंट क्रिएशन के लिए परफेक्ट।

प्रोडक्ट्स और साइनेज के लिए प्रिसाइज़ टेक्स्ट रेंडरिंग

एनिमेशन फ्रेम्स में स्टाइल कंसिस्टेंसी

इम्प्रूव्ड कोहेरेंस के साथ नेचुरल मोशन

नैरेटिव-ऑप्टिमाइज़्ड विजुअल आउटपुट

रेफरेंस-टू-वीडियो API (R2V API)

हमारा Wan 2.6 R2V API रेफरेंस वीडियो से कैरेक्टर आइडेंटिटी को प्रिज़र्व करता है। कंसिस्टेंट कैरेक्टर जनरेशन के लिए अपीयरेंस, वॉइस और मूवमेंट पैटर्न्स एक्सट्रैक्ट करने के लिए 2-30 सेकंड की क्लिप्स अपलोड करें।

कैरेक्टर अपीयरेंस और आइडेंटिटी प्रिज़र्वेशन

वॉइस कैरेक्टरिस्टिक्स एक्सट्रैक्शन और रेप्लिकेशन

मूवमेंट पैटर्न एनालिसिस और रिप्रोडक्शन

मल्टी-कैरेक्टर सीन सपोर्ट

💡

कम्पलीट API सूट

सभी तीन Wan 2.6 API मोड्स (T2V API, I2V API, R2V API) कॉम्प्रिहेंसिव डॉक्यूमेंटेशन के साथ RESTful आर्किटेक्चर को सपोर्ट करते हैं। Python, Node.js और अधिक के लिए SDK के साथ शुरुआत करें। प्रत्येक एंडपॉइंट में नेटिव ऑडियो-विजुअल सिंक्रोनाइज़ेशन और फुल कमर्शियल यूसेज राइट्स शामिल हैं।

Wan 2.6 के साथ कैसे शुरुआत करें

दो सरल पथों के साथ मिनटों में प्रोफेशनल वीडियो बनाना शुरू करें

API इंटीग्रेशन

एप्लिकेशन बनाने वाले डेवलपर्स के लिए

साइन अप और लॉगिन करें

कंसोल तक पहुंचने के लिए अपना Atlas Cloud अकाउंट बनाएं या लॉगिन करें

पेमेंट मेथड जोड़ें

अपने अकाउंट को फंड करने के लिए बिलिंग सेक्शन में अपना क्रेडिट कार्ड बाइंड करें

API की जनरेट करें

Console → API Keys पर नेविगेट करें और अपनी ऑथेंटिकेशन की बनाएं

बिल्डिंग शुरू करें

Wan 2.6 को अपने एप्लिकेशन में इंटीग्रेट करने के लिए T2V, I2V या R2V API एंडपॉइंट्स का उपयोग करें

Playground अनुभव

क्विक टेस्टिंग और एक्सपेरिमेंटेशन के लिए

साइन अप और लॉगिन करें

प्लेटफॉर्म तक पहुंचने के लिए अपना Atlas Cloud अकाउंट बनाएं या लॉगिन करें

पेमेंट मेथड जोड़ें

शुरुआत करने के लिए बिलिंग सेक्शन में अपना क्रेडिट कार्ड बाइंड करें

Playground का उपयोग करें

Wan 2.6 playground पर जाएं, T2V/I2V/R2V मोड चुनें और तुरंत वीडियो जनरेट करें

💡

प्रो टिप: पहले Playground में विभिन्न जनरेशन मोड्स टेस्ट करें ताकि यह समझ सकें कि आपके यूज़ केस के लिए कौन सा बेस्ट काम करता है, फिर प्रोडक्शन स्केल के लिए संबंधित API को इंटीग्रेट करें।

अक्सर पूछे जाने वाले प्रश्न

Wan 2.6 की मल्टी-शॉट क्षमता को क्या यूनीक बनाता है?

Wan 2.6 स्टोरीबोर्ड लॉजिक को सच्चे अर्थ में समझने वाला पहला मॉडल है। Wan 2.5 के विपरीत जो मेसी "मॉर्फिंग" इफेक्ट्स बनाता था, Wan 2.6 ऑटोमैटिकली एक सिंगल प्रॉम्प्ट को सुसंगत ट्रांज़िशन के साथ मल्टिपल डिस्टिंक्ट शॉट्स में सेगमेंट कर सकता है, सीन चेंजेज़ में कैरेक्टर कंसिस्टेंसी बनाए रखता है।

रेफरेंस-टू-वीडियो (R2V) कैसे काम करता है?

2-30 सेकंड का रेफरेंस वीडियो अपलोड करें, और Wan 2.6 कैरेक्टर की अपीयरेंस, मूवमेंट पैटर्न्स और वॉइस कैरेक्टरिस्टिक्स एक्सट्रैक्ट करता है। फिर आप कंसिस्टेंट आइडेंटिटी के साथ उसी कैरेक्टर को फीचर करते हुए नए वीडियो जनरेट कर सकते हैं—कैरेक्टर-ड्रिवेन कंटेंट सीरीज़ बनाने के लिए आदर्श।

कौन से वीडियो फॉर्मेट और ड्यूरेशन सपोर्टेड हैं?

Wan 2.6, 5 से 15 सेकंड की अवधि के साथ 24fps पर 1080p वीडियो जनरेट करता है। समर्थित आस्पेक्ट रेशियो में 16:9 (YouTube), 9:16 (Instagram Reels/TikTok) और 1:1 (स्क्वायर फॉर्मेट) शामिल हैं, जो हर प्लेटफॉर्म के लिए ऑप्टिमाइज़्ड हैं और इन्हें किसी पोस्ट-प्रोडक्शन क्रॉपिंग की ज़रूरत नहीं होती।

क्या Wan 2.6 वीडियो में टेक्स्ट रेंडर कर सकता है?

हां! Wan 2.6 में प्रोडक्ट पैकेजिंग, साइनेज और ब्रांडेड कंटेंट के लिए इंडस्ट्री-लीडिंग टेक्स्ट रेंडरिंग है। मॉडल वीडियो फ्रेम्स के भीतर क्लियर, रीडेबल टेक्स्ट जनरेट कर सकता है—एक क्रिटिकल फीचर जो Seedance और ज़्यादातर प्रतियोगियों में नहीं है।

T2V, I2V और R2V मोड्स में क्या अंतर है?

T2V (टेक्स्ट-टू-वीडियो) मल्टी-शॉट क्षमता के साथ टेक्स्ट प्रॉम्प्ट्स से जनरेट करता है। I2V (इमेज-टू-वीडियो) प्रिसाइज़ टेक्स्ट रेंडरिंग के साथ स्टिल इमेजेज़ को एनिमेट करता है। R2V (रेफरेंस-टू-वीडियो) जनरेशन में कैरेक्टर आइडेंटिटी को प्रिज़र्व करने के लिए वीडियो रेफरेंसेज़ का उपयोग करता है। अपने इनपुट टाइप और कंसिस्टेंसी ज़रूरतों के आधार पर चुनें।

क्या मेरे पास जनरेटेड वीडियो पर कमर्शियल राइट्स हैं?

हां! प्रत्येक Wan 2.6 क्रिएशन फुल कमर्शियल यूसेज राइट्स के साथ आता है। वीडियो मार्केटिंग कैंपेन, क्लाइंट डिलीवरेबल्स, ब्रांडेड कंटेंट और कमर्शियल एप्लिकेशन के लिए प्रोडक्शन-रेडी हैं बिना अतिरिक्त लाइसेंसिंग रिक्वायरमेंट्स के।

Atlas Cloud पर Wan 2.6 का उपयोग क्यों करें?

अपने प्रोफेशनल वीडियो जनरेशन वर्कफ्लोज़ के लिए एंटरप्राइज़-ग्रेड इन्फ्रास्ट्रक्चर का लाभ उठाएं

पर्पज़-बिल्ट इन्फ्रास्ट्रक्चर

डिमांडिंग AI वीडियो वर्कलोड्स के लिए विशेष रूप से ऑप्टिमाइज़्ड इन्फ्रास्ट्रक्चर पर Wan 2.6 की मल्टी-शॉट जनरेशन और R2V क्षमताओं को डिप्लॉय करें। 1080p 15-सेकंड जनरेशन के लिए मैक्सिमम परफॉर्मेंस।

सभी मॉडल्स के लिए यूनिफाइड API

एक यूनिफाइड API के माध्यम से 400+ AI मॉडल्स (LLMs, इमेज, वीडियो, ऑडियो) के साथ Wan 2.6 (T2V, I2V, R2V) तक पहुंचें। कंसिस्टेंट ऑथेंटिकेशन के साथ आपकी सभी जेनेरेटिव AI ज़रूरतों के लिए सिंगल इंटीग्रेशन।

कंपीटिटिव प्राइसिंग

पारदर्शी, पे-एज़-यू-गो प्राइसिंग के साथ AWS की तुलना में 70% तक की बचत करें। कोई छिपा शुल्क नहीं, कोई प्रतिबद्धता नहीं—प्रोटोटाइप से प्रोडक्शन तक बिना किसी भारी खर्च के स्केल करें।

SOC 2 सर्टिफाइड सिक्योरिटी

SOC 2 सर्टिफिकेशन और HIPAA कंप्लायंस के साथ आपके रेफरेंस वीडियो और जनरेटेड कंटेंट प्रोटेक्टेड। एन्क्रिप्टेड ट्रांसमिशन और स्टोरेज के साथ एंटरप्राइज़-ग्रेड सिक्योरिटी।

99.9% अपटाइम SLA

गारंटीड 99.9% अपटाइम के साथ एंटरप्राइज़-ग्रेड रिलायबिलिटी। आपकी Wan 2.6 मल्टी-शॉट वीडियो जनरेशन प्रोडक्शन कैंपेन और क्रिटिकल कंटेंट वर्कफ्लोज़ के लिए हमेशा उपलब्ध है।

आसान इंटीग्रेशन

REST API और मल्टी-लैंग्वेज SDK (Python, Node.js, Go) के साथ मिनटों में कम्पलीट इंटीग्रेशन। यूनिफाइड एंडपॉइंट स्ट्रक्चर के साथ T2V, I2V और R2V मोड्स के बीच सीमलेसली स्विच करें।

99.9%

अपटाइम

70%

AWS की तुलना में कम लागत

400+

Gen AI मॉडल्स

24/7

प्रो सपोर्ट

तकनीकी विनिर्देश

Architecture

मल्टी-मोडल अंडरस्टैंडिंग के साथ एडवांस्ड ट्रांसफॉर्मर

Resolution

1080p (फ़ुल HD)

Frame Rate

24 FPS

Duration

5-15 सेकंड (मोड पर निर्भर)

Aspect Ratios

16:9, 9:16, 1:1

Generation Modes

T2V, I2V, R2V

Audio

लिप-सिंक के साथ नेटिव सिंक्रोनाइज़ेशन

Commercial Rights

फुल कमर्शियल यूसेज शामिल

प्रोफेशनल मल्टी-शॉट वीडियो जनरेशन का अनुभव करें

दुनिया भर के कंटेंट क्रिएटर्स, मार्केटर्स और फिल्ममेकर्स में शामिल हों जो Wan 2.6 की ग्राउंडब्रेकिंग मल्टी-शॉट स्टोरीटेलिंग और कैरेक्टर कंसिस्टेंसी क्षमताओं के साथ वीडियो प्रोडक्शन में क्रांति ला रहे हैं।

Alibaba WAN 2.6 Text-to-Image Model

Alibaba WAN 2.6 is a cutting-edge text-to-image model on Alibaba Cloud’s DashScope. It generates high-quality, detailed images directly from text prompts and supports multiple output resolutions. The Tongyi Wanxiang text-to-image model generates images based on text, supporting various artistic styles and realistic photography effects to meet diverse creative needs.

What makes it stand out?

High Fidelity: Wan 2.6 produces crisp, detailed images that capture complex scene descriptions and artistic styles.
Creative Flexibility: From product design mockups to character art, Wan 2.6 supports diverse use cases and genres.
Multiple Styles & Formats: Choose from photo-realistic, anime, sketch, or artistic rendering modes—adaptable to your creative vision.
Customizable Size: Easily adjust width and height with simple sliders. Set the exact dimensions you need.

Designed For

Design teams: Quick iterations on visuals, product concepts, and campaign mockups.
Content creators: Generate unique visuals for blogs, social posts, and digital branding.
Storytellers & artists: Visualize characters, scenes, and worlds from simple text prompts.
Enterprises: Efficiently produce consistent visuals across marketing, training, and documentation.

Pricing

Every image is just cost $0.03!!

Billing Rules

Minimum charge: 1 image.
Total cost = number of images × price per resolution.

How to Use

Write your prompt.
Submit your request.
Preview and download the generated image.

समान मॉडल देखें

Wan-2.7 Pro Image-to-image

Edits and recomposes images with Wan 2.7 image pro using text instructions and multi-image references for higher quality outputs.

Wan-2.7 Pro Text-to-image

Generates images from text prompts with Wan 2.7 image pro, supporting higher fidelity outputs and 4K-ready workflows.

Wan-2.7 Image-to-image

Edits and recomposes images with Wan 2.7 image using text instructions, multi-image references, and optional interaction boxes.

Wan-2.7 Text-to-image

Generates images from text prompts with Wan 2.7 image, supporting fast iteration and strong prompt fidelity for illustration and photorealistic outputs.

Qwen Image 2.0 Pro Text-to-image

Qwen Image 2.0 Pro is a professional-grade text-to-image model with superior quality and advanced prompt understanding. Up to 2k. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Qwen Image 2.0 Pro Edit

Qwen Image 2.0 Pro Edit is a professional-grade image editing model with superior quality and advanced instruction understanding. Up to 2k. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Qwen Image 2.0 Edit

Qwen Image 2.0 Edit is an advanced image-editing model with improved quality and better understanding of instructions. Up to 2k. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Qwen Image 2.0 Text-to-image

Qwen Image 2.0 is an advanced text-to-image model with enhanced image quality and improved prompt understanding. Up to 2k. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Qwen-Image Edit Plus 20251215

Supports multiple image inputs and outputs, allowing for precise modification of text within images, addition, deletion, or movement of objects, alteration of subject actions, transfer of image styles, and enhancement of image details.

From$0.03/छवि

$0.021/छवि

-30%