z-image/turbo-lora

文生图

TURBO

Z-Image-Turbo LoRA API by Atlas Cloud

z-image/turbo-lora

Turbo-lora

Z-Image Turbo text-to-image generation with user-supplied LoRA weights, custom sizes, and reproducible output.

输入

提示词 *

Loras

最多: 3

尺寸

宽度

高度

1024 × 1024 px范围: 256 - 1536

随机种子

Num inference steps

Guidance scale

Output format

Civitai links require your API token appended: download_url&token=YOUR_TOKEN. Get it from Civitai Account Settings → API Keys.
Hugging Face links must point to a specific file (e.g. .safetensors), not a repository directory.

输出

空闲

生成的图片将在这里显示

配置参数后点击运行开始生成

每次运行将花费 $0.01。$10 可运行约 1000 次。

你可以继续：

图生视频图生图

参数

代码示例
import requests
import time

# Step 1: Start image generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "z-image/turbo-lora",  # Required. Model name
    "prompt": "A beautiful landscape with mountains and lake",  # Required. The positive prompt for the generation
    "loras": [],  # List of LoRA weights to apply. default: []
    "size": "1024*1024",  # Output image size in width*height format. (max: 1536)
    "seed": -1,  # Seed for reproducible output
    "num_inference_steps": 9,  # Number of inference steps used for generation. (min: 1)
    "guidance_scale": 1,  # Classifier-free guidance scale
    "output_format": "png",  # Image output format. options: png | jpeg | webp
    "enable_sync_mode": False,  # Wait for the result to be generated and uploaded before returning the response
    "enable_base64_output": False,  # Return base64 image data instead of an image URL
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] == "completed":
            print("Generated image:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

image_url = check_status()

安装

安装所需的依赖包。

pip install requests

认证

所有 API 请求需要通过 API Key 进行认证。您可以在 Atlas Cloud 控制台获取 API Key。

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP 请求头

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

保护好您的 API Key

切勿在客户端代码或公开仓库中暴露您的 API Key。请使用环境变量或后端代理。

提交请求

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

提交请求

提交一个异步生成请求。API 返回一个 prediction ID，您可以用它来检查状态和获取结果。

POST/api/v1/model/generateImage

请求体

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "z-image/turbo-lora",
    "prompt": "A beautiful landscape with mountains and lake"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

响应

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

检查状态

轮询 prediction 端点以检查请求的当前状态。

GET/api/v1/model/prediction/{prediction_id}

轮询示例

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

状态值

processing请求仍在处理中。

completed生成完成，输出可用。

succeeded生成成功，输出可用。

failed生成失败，请检查 error 字段。

完成响应

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.png"
    ],
    "metrics": {
      "predict_time": 8.3
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

上传文件

将文件上传到 Atlas Cloud 存储，获取可在 API 请求中使用的 URL。使用 multipart/form-data 上传。

POST/api/v1/model/uploadMedia

上传示例

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

响应

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

Input Schema

以下参数在请求体中被接受。

总计: 10必填: 2可选: 8

Civitai links require your API token appended: download_url&token=YOUR_TOKEN. Get it from Civitai Account Settings → API Keys.
Hugging Face links must point to a specific file (e.g. .safetensors), not a repository directory.

modelstringrequired

Model name.

Default: "z-image/turbo-lora"

promptstringrequired

The positive prompt for the generation.

lorasarray[object]

List of LoRA weights to apply.

Default: []Max items: 3

pathstringrequired

URL or path to the LoRA weights.

scalenumber

Scale used before merging the LoRA weights.

Default: 1Min: 0Max: 4

sizestring

Output image size in width*height format.

Default: "1024*1024"Max: 1536

seedinteger

Seed for reproducible output. Use -1 for random.

Default: -1

num_inference_stepsinteger

Number of inference steps used for generation.

Default: 9Min: 1

guidance_scalenumber

Classifier-free guidance scale. The default value follows the Z-Image Turbo path.

Default: 1

output_formatstring

Image output format.

Default: "png"

pngjpegwebp

enable_sync_modeboolean

Wait for the result to be generated and uploaded before returning the response. This property is only available through the API.

Default: false

enable_base64_outputboolean

Return base64 image data instead of an image URL. This property is only available through the API.

Default: false

请求体示例

{
  "model": "z-image/turbo-lora",
  "prompt": "A beautiful landscape",
  "loras": [],
  "size": "1024*1024",
  "seed": -1,
  "num_inference_steps": 9,
  "guidance_scale": 1,
  "output_format": "png",
  "enable_sync_mode": false,
  "enable_base64_output": false
}

Output Schema

API 返回包含生成输出 URL 的 prediction 响应。

created_atstring

ISO timestamp when the request was created.

idstring

Unique prediction id.

modelstring

Model id used for the prediction.

outputsarray

Generated image URLs or base64 payloads.

statusstring

Task status.

响应示例

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.png"
  ],
  "metrics": {
    "predict_time": 8.3
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills 将 400+ AI 模型直接集成到您的 AI 编程助手中。一条命令安装，即可用自然语言生成图像、视频和与 LLM 对话。

支持的客户端

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ 支持的客户端

安装

npx skills add AtlasCloudAI/atlas-cloud-skills

设置 API Key

从 Atlas Cloud 控制台获取 API Key，并将其设置为环境变量。

export ATLASCLOUD_API_KEY="your-api-key-here"

功能

安装后，您可以在 AI 助手中使用自然语言访问所有 Atlas Cloud 模型。

图像生成使用 Nano Banana 2、Z-Image 等模型生成图像。

视频创作使用 Kling、Vidu、Veo 等模型从文本或图像创建视频。

LLM 对话与 Qwen、DeepSeek 等大语言模型对话。

媒体上传上传本地文件用于图像编辑和图生视频工作流。

MCP Server

Atlas Cloud MCP Server 通过 Model Context Protocol 将您的 IDE 与 400+ AI 模型连接。支持任何兼容 MCP 的客户端。

支持的客户端

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ 支持的客户端

安装

npx -y atlascloud-mcp

配置

将以下配置添加到您的 IDE 的 MCP 设置文件中。

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

可用工具

atlas_generate_image从文本提示生成图像。

atlas_generate_video从文本或图像创建视频。

atlas_chat与大语言模型对话。

atlas_list_models浏览 400+ 可用 AI 模型。

atlas_quick_generate一步式内容创建，自动选择最佳模型。

atlas_upload_media上传本地文件用于 API 工作流。

了解更多

github.com/AtlasCloudAI/mcp-server

API Schema

{
  "components": {
    "schemas": {
      "Input": {
        "type": "object",
        "required": [
          "model",
          "prompt"
        ],
        "properties": {
          "model": {
            "type": "string",
            "description": "Model name.",
            "default": "z-image/turbo-lora"
          },
          "prompt": {
            "type": "string",
            "description": "The positive prompt for the generation."
          },
          "loras": {
            "type": "array",
            "description": "List of LoRA weights to apply.",
            "default": [],
            "maxItems": 3,
            "items": {
              "$ref": "#/components/schemas/LoraWeight"
            }
          },
          "size": {
            "type": "string",
            "description": "Output image size in width*height format.",
            "default": "1024*1024",
            "maximum": 1536
          },
          "seed": {
            "type": "integer",
            "description": "Seed for reproducible output. Use -1 for random.",
            "default": -1
          },
          "num_inference_steps": {
            "type": "integer",
            "description": "Number of inference steps used for generation.",
            "default": 9,
            "minimum": 1
          },
          "guidance_scale": {
            "type": "number",
            "description": "Classifier-free guidance scale. The default value follows the Z-Image Turbo path.",
            "default": 1
          },
          "output_format": {
            "type": "string",
            "description": "Image output format.",
            "default": "png",
            "enum": [
              "png",
              "jpeg",
              "webp"
            ]
          },
          "enable_sync_mode": {
            "type": "boolean",
            "description": "Wait for the result to be generated and uploaded before returning the response. This property is only available through the API.",
            "default": false,
            "disabled": true
          },
          "enable_base64_output": {
            "type": "boolean",
            "description": "Return base64 image data instead of an image URL. This property is only available through the API.",
            "default": false,
            "disabled": true
          }
        },
        "x-order-properties": [
          "model",
          "prompt",
          "loras",
          "size",
          "seed",
          "num_inference_steps",
          "guidance_scale",
          "output_format",
          "enable_sync_mode",
          "enable_base64_output"
        ]
      },
      "LoraWeight": {
        "type": "object",
        "required": [
          "path"
        ],
        "properties": {
          "path": {
            "type": "string",
            "description": "URL or path to the LoRA weights."
          },
          "scale": {
            "type": "number",
            "description": "Scale used before merging the LoRA weights.",
            "default": 1,
            "minimum": 0,
            "maximum": 4
          }
        },
        "x-order-properties": [
          "path",
          "scale"
        ]
      },
      "PredictionResponse": {
        "type": "object",
        "properties": {
          "created_at": {
            "type": "string",
            "format": "date-time",
            "description": "ISO timestamp when the request was created."
          },
          "has_nsfw_contents": {
            "type": "array",
            "description": "NSFW detection result for each output.",
            "items": {
              "type": "boolean"
            }
          },
          "id": {
            "type": "string",
            "description": "Unique prediction id."
          },
          "model": {
            "type": "string",
            "description": "Model id used for the prediction."
          },
          "outputs": {
            "type": "array",
            "description": "Generated image URLs or base64 payloads.",
            "items": {
              "type": "string"
            }
          },
          "status": {
            "type": "string",
            "description": "Task status."
          },
          "urls": {
            "type": "object",
            "description": "Related API endpoints."
          }
        }
      }
    },
    "securitySchemes": {
      "apiKeyAuth": {
        "type": "apiKey",
        "in": "header",
        "name": "Authorization"
      }
    }
  },
  "info": {
    "title": "AtlasCloud API",
    "version": "1.0.0",
    "description": "The AtlasCloud API."
  },
  "openapi": "3.0.0",
  "paths": {
    "/api/v1/model/prediction/{request_id}": {
      "get": {
        "parameters": [
          {
            "in": "path",
            "name": "request_id",
            "required": true,
            "schema": {
              "type": "string",
              "description": "Request ID"
            }
          }
        ],
        "responses": {
          "200": {
            "description": "Result of the request.",
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            }
          }
        }
      },
      "x-api-name": "model_result"
    },
    "/api/v1/model/generateImage": {
      "post": {
        "requestBody": {
          "required": true,
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/Input"
              }
            }
          }
        },
        "responses": {
          "200": {
            "description": "The request status.",
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            }
          }
        }
      },
      "x-api-name": "model_run"
    }
  },
  "servers": [
    {
      "url": "https://api.atlascloud.ai"
    }
  ]
}

LLM 友好的提示词模板

# z-image/turbo-lora

> Z-Image Turbo text-to-image generation with user-supplied LoRA weights, custom sizes, and reproducible output.


## Overview

- **Submit endpoint (POST)**: `https://api.atlascloud.ai/api/v1/model/generateImage` — start an async generation; returns a `prediction_id`
- **Poll endpoint (GET)**: `https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}` — poll this until the prediction finishes
- **Model ID**: `z-image/turbo-lora`


## API Information

This model can be used via our HTTP API or more conveniently via our client libraries.
See the input and output schema below, as well as the usage examples.


### Input Schema

The API accepts the following input parameters:

- **`model`** (`string`, _required_):
  Model name.
  - Default: `"z-image/turbo-lora"`

- **`prompt`** (`string`, _required_):
  The positive prompt for the generation.

- **`loras`** (`array`, _optional_):
  List of LoRA weights to apply.
  - Default: `[]`
  - Max items: 3

- **`size`** (`string`, _optional_):
  Output image size in width*height format.
  - Default: `"1024*1024"`
  - Max: 1536

- **`seed`** (`integer`, _optional_):
  Seed for reproducible output. Use -1 for random.
  - Default: `-1`

- **`num_inference_steps`** (`integer`, _optional_):
  Number of inference steps used for generation.
  - Default: `9`
  - Min: 1

- **`guidance_scale`** (`number`, _optional_):
  Classifier-free guidance scale. The default value follows the Z-Image Turbo path.
  - Default: `1`

- **`output_format`** (`string`, _optional_):
  Image output format.
  - Default: `"png"`
  - Options: "png", "jpeg", "webp"

- **`enable_sync_mode`** (`boolean`, _optional_):
  Wait for the result to be generated and uploaded before returning the response. This property is only available through the API.
  - Default: `false`

- **`enable_base64_output`** (`boolean`, _optional_):
  Return base64 image data instead of an image URL. This property is only available through the API.
  - Default: `false`



**Required Parameters Example**:

```json
{
  "model": "z-image/turbo-lora",
  "prompt": ""
}
```


**Full Example**:

```json
{
  "model": "z-image/turbo-lora",
  "prompt": "",
  "loras": [],
  "size": "1024*1024",
  "seed": -1,
  "num_inference_steps": 9,
  "guidance_scale": 1,
  "output_format": "png",
  "enable_sync_mode": false,
  "enable_base64_output": false
}
```


### Output Schema

The API returns the following output format:


- **`created_at`** (`string`, _optional_):
  ISO timestamp when the request was created.

- **`has_nsfw_contents`** (`array[boolean]`, _optional_):
  NSFW detection result for each output.

- **`id`** (`string`, _optional_):
  Unique prediction id.

- **`model`** (`string`, _optional_):
  Model id used for the prediction.

- **`outputs`** (`array[string]`, _optional_):
  Generated image URLs or base64 payloads.

- **`status`** (`string`, _optional_):
  Task status.

- **`urls`** (`object`, _optional_):
  Related API endpoints.



**Example Response**:

```json
{
  "created_at": "",
  "has_nsfw_contents": [],
  "id": "",
  "model": "",
  "outputs": [
    ""
  ],
  "status": "",
  "urls": {}
}
```


## Usage Examples

### cURL

```bash
# Step 1: Start generation (async)
curl -X POST "https://api.atlascloud.ai/api/v1/model/generateImage" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "z-image/turbo-lora",
  "prompt": "",
  "loras": [],
  "size": "1024*1024",
  "seed": -1,
  "num_inference_steps": 9,
  "guidance_scale": 1,
  "output_format": "png",
  "enable_sync_mode": false,
  "enable_base64_output": false
}'

# Response will contain: {"code": 200, "data": {"id": "prediction_id", "status": "processing"}}

# Step 2: Poll for result (replace {prediction_id} with the id returned above)
curl -X GET "https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"

# Keep polling until status is "completed", "succeeded" or "failed"
# When completed, outputs will contain the generated content URL(s)
```

## Additional Resources

### Documentation

- [Model Playground](https://www.atlascloud.ai/models/z-image/turbo-lora)

A dramatic cinematic movie poster with dark atmospheric lighting, central silhouette of a lone figure, smoky background, bold large title text “BEYOND THE SHADOWS”, smaller tagline “Every secret has a price”, credit block at bottom, lens flare, professional Hollywood poster style

A hyper-realistic close-up of a giant ramen bowl with a tiny glowing UFO slowly descending onto the noodles, steam rising, rich broth reflections, surreal but delicious-looking detail

An ultra-detailed robot sitting in a quiet garden carefully repairing a broken flower stem with tiny mechanical tools, soft sunlight, beautiful depth of field, metallic and organic textures blending naturally

A photorealistic macro shot of a fragile soap bubble floating in the air, but inside it is a vivid swirling galaxy with stars and nebulae, high detail, cosmic glow, razor-sharp reflections on bubble surface

a gigantic neon-colored snake coiled around a disco stage, mirror-ball scales reflecting rainbow lights, people dancing below, hyper-realistic glitter particles floating through the air

加载中...

Z-Image Turbo - 极速文生图模型

阿里巴巴战略模型矩阵

阿里巴巴提供三大专业 AI 图像生成系统，各自针对不同应用场景优化

速度冠军

Z-Image Turbo

通义万相团队

Best For: 速度关键的生产工作负载

⚡ 最快：8 步推理，亚秒生成
🏆 开源模型排名第一
💰 最具性价比（$0.005/张）
🎯 快速迭代优化

质量之王

Qwen-Image

通义千问团队

Best For: 最高质量的最终渲染

🎨 无与伦比的真实感和皮肤纹理
💡 卓越的光照交互效果
⏱️ 较慢（20秒 vs Z-Image 的 5-10秒）
🎯 适合高端制作工作

多功能专家

Wan 2.5/2.6

通义万相团队

Best For: 多媒体多样性

🎬 文生视频 + 图生视频
📹 多分辨率支持（480P-720P）
🔄 音视频同步
🎯 跨模态内容生成

Key Insight: Z-Image Turbo 比 Qwen-Image 每步快 1.31-1.41 倍，非常适合需要快速生成的应用场景。虽然 Qwen-Image 在最终渲染的真实感方面略胜一筹，但 Z-Image Turbo 在生产环境中提供了速度和质量的最佳平衡。

技术亮点

性能

S3-DiT 架构

采用单流扩散 Transformer（S3-DiT）架构，统一处理各种条件输入。这种 60 亿参数设计在不增加大模型计算开销的情况下实现专业级结果，同时保持最先进的质量。

速度

Decoupled-DMD 蒸馏

先进的蒸馏算法配合 CFG 增强和分布匹配机制，实现 8 步推理（竞品需 20-50 步）。在 H800 GPU 上实现亚秒级生成，在消费级 RTX 3060/4090（16GB 显存）上流畅运行。

质量

领先的开源性能

在 Artificial Analysis Image Arena 上排名第一的开源模型，超越 FLUX.2 [dev]、HunyuanImage 3.0 和 Qwen-Image。擅长中英文双语文本渲染、逼真图像生成和强大的指令遵循。采用 Apache 2.0 许可证，允许商业使用。

完美适用于

🎨

数字艺术创作

📸

产品摄影

📊

营销素材

🎬

概念设计

📱

社交媒体内容

🖼️

图库摄影

🎮

游戏资产

✨

创意原型设计

为什么选择 Z-Image Turbo

⚡

即时生成

亚秒级生成，零冷启动延迟。立即获得您的图像，无需任何等待。

💰

高性价比

实惠的价格，每张图片仅需 $0.005。轻松扩展您的创意项目，无需担心预算。

🔌

开箱即用的 API

简单的 REST API 集成。通过我们完善的文档，几分钟内即可开始生成图像。

技术规格

模型架构60 亿参数

推理步骤8 NFEs（函数评估次数）

生成速度H800 亚秒级，消费级 GPU 5-10 秒

显存要求16GB（兼容 RTX 3060/4090）

架构单流扩散 Transformer（S3-DiT）

蒸馏方法Decoupled-DMD 配合 CFG 增强

许可证Apache 2.0（允许商业使用）

排名Artificial Analysis Arena 开源第一

价格每张图片 $0.005

立即开始使用 Z-Image Turbo

体验极速、逼真的图像生成。无需设置，调用我们的 API 即可开始创作。

零冷启动 - 即时生成

实惠价格 - 每张 $0.005

专业级质量结果

Z-Image-Turbo LoRA — 6B-parameter, ultra-fast text-to-image with custom styles

Z-Image-Turbo LoRA is a personalised version of Tongyi-MAI’s 6B-parameter Z-Image-Turbo model. It keeps the same 8-step, ultra-fast sampler and low VRAM footprint, while letting you plug in up to three LoRA adapters to inject your own styles, characters, or brand identity into each generation.

Ultra-fast generation with LoRA personalisation

Where many diffusion models need dozens of steps, Z-Image-Turbo LoRA stays aggressively optimised around 8 sampling steps. On top of that, it adds LoRA hooks so you can steer the visual style without retraining the base model—perfect for interactive products, dashboards, and large-scale backends that still need a branded look.

Why it looks so good

• Photorealistic output at speed Generates high-fidelity, realistic images suitable for product photos, hero banners, and UI visuals—now with your own LoRA styles layered on top.

• Bilingual prompts and text Understands prompts in English and Chinese, and can render multilingual on-image text, ideal for cross-market campaigns and UI screenshots.

• LoRA-powered customisation Attach up to 3 LoRAs per request to add a specific art style, character look, or brand aesthetics without touching the base weights.

• Low-latency, low-step design Only 8 function evaluations per image deliver extremely low latency, ideal for chatbots, configuration tools, design assistants, and any “type → image” workflow.

• Friendly VRAM footprint Runs well in 16 GB VRAM environments, reducing hardware costs and making local or edge deployments more realistic—even with LoRAs enabled.

• Scales for bulk generation The efficient sampler keeps large jobs—catalogues, continuous feeds, or mass thumbnail generation—practical, even when every image uses one or more LoRAs.

• Reproducible generations A controllable seed parameter lets you recreate previous images or generate small, controlled variations for brand safety and experimentation.

How to use

prompt – natural-language description of the scene, style, and any on-image text (English or Chinese).
size (width / height) – choose the output resolution that fits your use case.
seed – set to -1 for random results, or use a fixed integer to make outputs reproducible.
loras – optional list of up to three LoRA adapters:
path – a LoRA identifier such as <owner>/<model-name> or a direct .safetensors URL.
scale – numeric strength for that LoRA; higher values apply a stronger stylistic effect.

You can click “Add Item” in the loras panel to add 1–3 LoRAs. They are combined during generation, so a single prompt can mix, for example, a character LoRA, a style LoRA, and a brand-colour LoRA.

Pricing

Simple per-image billing:

$0.01 per generated image

探索类似模型

NEW

图像工具

Face Swap

Swap a face onto any photo. The facial identity is transferred while pose, clothing and background stay intact.

Photo Cleanup

Restore single images by reducing dust, scratches, noise, and repeated micro-texture artifacts without changing the image content.

Openai GPT Image 2 Text-to-Image

GPT Image 2 text to image is OpenAI's fast, cost-efficient text-to-image generator powered by GPT-5 guidance. Create photorealistic shots, product renders, concept art, and stylized graphics from natural-language prompts (optionally conditioned with an image). Supports custom aspect ratios, seeds, negative prompts, hex color hints, and style presets. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image 2 Edit

GPT Image 2 Edit is OpenAI's image model for precise, natural-language edits. Add/remove objects, swap backgrounds, retouch faces, adjust colors/lighting, edit text/graphics, crop/resize, and apply hex color control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

GPT Image 2 Developer Edit

GPT Image 2 Developer Edit applies natural-language instructions to one or more reference images, with common aspect ratios and 1k, 2k, or supported 4k output tiers. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

GPT Image 2 Developer Text-to-Image

GPT Image 2 Developer Text-to-Image generates polished visuals from natural-language prompts, with common aspect ratios and 1k, 2k, or supported 4k output tiers. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Seedream v5.0 Pro Edit

ByteDance flagship next-generation image editing model. Supports up to 10 reference images while preserving identity, lighting, and color tones for professional-quality modifications.

Seedream v5.0 Pro Text-to-Image

ByteDance flagship next-generation image generation model with stronger prompt adherence, refined typography, and photorealistic detail. Single-image output at 1.5K and 2K tiers with JPEG and PNG support.

Nano Banana 2 Lite Edit Developer

Google's fastest and most cost-efficient Nano Banana image model for editing, applying natural-language edits and multi-image composition to up to 14 reference images with low latency.

Nano Banana 2 Lite Text-to-Image Developer

Google's fastest and most cost-efficient Nano Banana image model, turning natural-language text prompts into high-quality 1k images in as little as 4 seconds for rapid, high-volume generation.

Nano Banana 2 Lite Edit

Nano banana lite is the efficiency-focused model in the image generation family. Sub-2 second latency with cost-effective generation and editing, fast multi-turn local edits, and 14 supported aspect ratios.

Nano Banana 2 Lite Text-to-image

MAI-Image-2.5-Flash Text-to-image

Microsoft's fast, cost-optimized text-to-image generation model, creating high-quality images at lower cost using the same diffusion-based architecture as MAI-Image-2.5.

From

$0.03/张