
Qwen-Image Text-to-Image Max API by Alibaba
General-purpose image generation model that supports various art styles and is particularly good at rendering complex text.
INPUT
OUTPUT
IdleYour request will cost $0.052 per run. For $10 you can run this model approximately 192 times.
Here's what you can do next:
Code Example
import requests
import time
# Step 1: Start image generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
"Content-Type": "application/json",
"Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
"model": "alibaba/qwen-image/text-to-image-max",
"prompt": "A beautiful landscape with mountains and lake",
"width": 512,
"height": 512,
"steps": 20,
"guidance_scale": 7.5,
}
generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]
# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
def check_status():
while True:
response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
result = response.json()
if result["data"]["status"] == "completed":
print("Generated image:", result["data"]["outputs"][0])
return result["data"]["outputs"][0]
elif result["data"]["status"] == "failed":
raise Exception(result["data"]["error"] or "Generation failed")
else:
# Still processing, wait 2 seconds
time.sleep(2)
image_url = check_status()Install
Install the required package for your language.
pip install requestsAuthentication
All API requests require authentication via an API key. You can get your API key from the Atlas Cloud dashboard.
export ATLASCLOUD_API_KEY="your-api-key-here"HTTP Headers
import os
API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
"Content-Type": "application/json",
"Authorization": f"Bearer {API_KEY}"
}Never expose your API key in client-side code or public repositories. Use environment variables or a backend proxy instead.
Submit a request
import requests
url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
"Content-Type": "application/json",
"Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
"model": "your-model",
"prompt": "A beautiful landscape"
}
response = requests.post(url, headers=headers, json=data)
print(response.json())Submit a Request
Submit an asynchronous generation request. The API returns a prediction ID that you can use to check the status and retrieve the result.
/api/v1/model/generateImageRequest Body
import requests
url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
"Content-Type": "application/json",
"Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
"model": "alibaba/qwen-image/text-to-image-max",
"input": {
"prompt": "A beautiful landscape with mountains and lake"
}
}
response = requests.post(url, headers=headers, json=data)
result = response.json()
print(f"Prediction ID: {result['id']}")
print(f"Status: {result['status']}")Response
{
"id": "pred_abc123",
"status": "processing",
"model": "model-name",
"created_at": "2025-01-01T00:00:00Z"
}Check Status
Poll the prediction endpoint to check the current status of your request.
/api/v1/model/prediction/{prediction_id}Polling Example
import requests
import time
prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }
while True:
response = requests.get(url, headers=headers)
result = response.json()
status = result["data"]["status"]
print(f"Status: {status}")
if status in ["completed", "succeeded"]:
output_url = result["data"]["outputs"][0]
print(f"Output URL: {output_url}")
break
elif status == "failed":
print(f"Error: {result['data'].get('error', 'Unknown')}")
break
time.sleep(3)Status Values
processingThe request is still being processed.completedGeneration is complete. Outputs are available.succeededGeneration succeeded. Outputs are available.failedGeneration failed. Check the error field.Completed Response
{
"data": {
"id": "pred_abc123",
"status": "completed",
"outputs": [
"https://storage.atlascloud.ai/outputs/result.png"
],
"metrics": {
"predict_time": 8.3
},
"created_at": "2025-01-01T00:00:00Z",
"completed_at": "2025-01-01T00:00:10Z"
}
}Upload Files
Upload files to Atlas Cloud storage and get a URL you can use in your API requests. Use multipart/form-data to upload.
/api/v1/model/uploadMediaUpload Example
import requests
url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }
with open("image.png", "rb") as f:
files = {"file": ("image.png", f, "image/png")}
response = requests.post(url, headers=headers, files=files)
result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")Response
{
"data": {
"download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
"file_name": "image.png",
"content_type": "image/png",
"size": 1024000
}
}Input Schema
The following parameters are accepted in the request body.
No parameters available.
Example Request Body
{
"model": "alibaba/qwen-image/text-to-image-max"
}Output Schema
The API returns a prediction response with the generated output URLs.
Example Response
{
"id": "pred_abc123",
"status": "completed",
"model": "model-name",
"outputs": [
"https://storage.atlascloud.ai/outputs/result.png"
],
"metrics": {
"predict_time": 8.3
},
"created_at": "2025-01-01T00:00:00Z",
"completed_at": "2025-01-01T00:00:10Z"
}Atlas Cloud Skills
Atlas Cloud Skills integrates 300+ AI models directly into your AI coding assistant. One command to install, then use natural language to generate images, videos, and chat with LLMs.
Supported Clients
Install
npx skills add AtlasCloudAI/atlas-cloud-skillsSetup API Key
Get your API key from the Atlas Cloud dashboard and set it as an environment variable.
export ATLASCLOUD_API_KEY="your-api-key-here"Capabilities
Once installed, you can use natural language in your AI assistant to access all Atlas Cloud models.
MCP Server
Atlas Cloud MCP Server connects your IDE with 300+ AI models via the Model Context Protocol. Works with any MCP-compatible client.
Supported Clients
Install
npx -y atlascloud-mcpConfiguration
Add the following configuration to your IDE's MCP settings file.
{
"mcpServers": {
"atlascloud": {
"command": "npx",
"args": [
"-y",
"atlascloud-mcp"
],
"env": {
"ATLASCLOUD_API_KEY": "your-api-key-here"
}
}
}
}Available Tools
API Schema
Schema not availableNo examples available
Please log in to view request history
You need to be logged in to access your model request history.
Log InAlibaba Qwen-Image Text-to-Image Max
The flagship text-to-image generation model from Alibaba Cloud, designed to deliver state-of-the-art visual quality, exceptional prompt adherence, and rich artistic detail. Qwen-Image Max represents the pinnacle of the Qwen-Image family, capable of transforming complex text descriptions into stunning, high-resolution visuals suitable for professional and creative workflows.
Overview
- Purpose: Generate premium-quality images from natural language descriptions.
- Core Capability: Industry-leading visual fidelity with deep semantic understanding of prompts.
- Foundation: Built on Alibaba's advanced large-scale multi-modal architecture.
- Typical Output: High-resolution, photorealistic or artistic images with precise lighting, texture, and composition.
- Use Cases: Professional design, advertising creatives, concept art, marketing materials, and high-end content creation.
Key Features
- Superior Visual Quality: Delivers the highest level of detail, texture, and lighting realism available in the Qwen-Image series.
- Complex Prompt Understanding: Accurately interprets long, intricate prompts, including spatial relationships, artistic styles, and specific object attributes.
- Text Rendering: Enhanced capability to render legible text within generated images (e.g., signboards, posters).
- Style Versatility: Masterfully handles a wide range of styles, from photorealism and cinematic shots to 3D render, oil painting, and illustration.
- High Resolution: Supports generation of high-definition images suitable for professional use.
Designed For
- Professional Designers: Create high-quality assets, mockups, and final visuals.
- Digital Artists: Explore complex concepts and generate detailed artwork.
- Marketing Agencies: Produce campaign-ready visuals with specific brand requirements.
- Enterprise Users: High-demand use cases requiring consistent, top-tier visual output.
Input Requirements
To achieve the best results, follow these guidelines:
Text Prompt
- Content: Detailed English descriptions of the subject, setting, lighting, style, and mood.
- Length: Supports long context, but concise and descriptive prompts often yield the best focus.
- Negative Prompt: Optional. Specify elements to exclude (e.g., "blur, low quality, distortion").
Parameters
- Aspect Ratio: Supports various standard ratios (1:1, 16:9, 9:16, 4:3, 3:4).
- Resolution: Optimized for high-resolution outputs (e.g., 1024x1024 and above).
- Steps/Guidance: Configurable for fine-tuning the balance between prompt adherence and image quality.
Pricing
Billing is typically based on the number of images generated and the resolution selected.
- Billing Logic: Per-image generation cost.
- Tier: "Max" tier commands a premium rate due to higher computational resources and output quality compared to standard models.
How to Use
- Enter Prompt: Describe the image you want to generate in detail.
- Set Parameters: Choose your desired aspect ratio and number of images.
- Generate: Submit the request to the Qwen-Image Max model.
- Refine: Use the generated image as a reference or adjust the prompt for iterations.
Best Practices
- Be Specific: Instead of "a cat," try "a fluffy white Persian cat sitting on a velvet sofa, cinematic lighting, 8k resolution."
- Define Style: Explicitly state the medium (e.g., "oil painting," "photograph," "3D render").
- Lighting & Composition: Mention lighting conditions (e.g., "golden hour," "studio lighting") and camera angles.
- Iterate: If the first result isn't perfect, tweak the prompt or use a negative prompt to remove unwanted elements.
Limitations
- Text Accuracy: While improved, complex or long text strings within the image may still occasionally have minor errors.
- Spatial Logic: Extremely complex spatial arrangements might sometimes require prompt tuning.
Version
- Model: Alibaba Qwen-Image Text-to-Image Max
- Family: Qwen-Image
- Technical Context: Large-scale diffusion transformer model optimized for maximum visual fidelity.






