
Qwen-Image Text-to-Image Max API by Alibaba
General-purpose image generation model that supports various art styles and is particularly good at rendering complex text.
الإدخال
الإخراج
في انتظار التنفيذكل مرة ستكلف $0.052 مع $10 يمكنك التشغيل حوالي 192 مرة
يمكنك المتابعة بـ:
مثال الكود
import requests
import time
# Step 1: Start image generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
"Content-Type": "application/json",
"Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
"model": "alibaba/qwen-image/text-to-image-max",
"prompt": "A beautiful landscape with mountains and lake",
"width": 512,
"height": 512,
"steps": 20,
"guidance_scale": 7.5,
}
generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]
# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
def check_status():
while True:
response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
result = response.json()
if result["data"]["status"] == "completed":
print("Generated image:", result["data"]["outputs"][0])
return result["data"]["outputs"][0]
elif result["data"]["status"] == "failed":
raise Exception(result["data"]["error"] or "Generation failed")
else:
# Still processing, wait 2 seconds
time.sleep(2)
image_url = check_status()التثبيت
قم بتثبيت الحزمة المطلوبة للغة البرمجة الخاصة بك.
pip install requestsالمصادقة
تتطلب جميع طلبات API المصادقة عبر مفتاح API. يمكنك الحصول على مفتاح API الخاص بك من لوحة تحكم Atlas Cloud.
export ATLASCLOUD_API_KEY="your-api-key-here"ترويسات HTTP
import os
API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
"Content-Type": "application/json",
"Authorization": f"Bearer {API_KEY}"
}لا تكشف أبدًا مفتاح API الخاص بك في الكود من جانب العميل أو المستودعات العامة. استخدم متغيرات البيئة أو وكيل الخادم الخلفي بدلاً من ذلك.
إرسال طلب
import requests
url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
"Content-Type": "application/json",
"Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
"model": "your-model",
"prompt": "A beautiful landscape"
}
response = requests.post(url, headers=headers, json=data)
print(response.json())إرسال طلب
أرسل طلب توليد غير متزامن. تُرجع API معرّف التنبؤ الذي يمكنك استخدامه للتحقق من الحالة واسترداد النتيجة.
/api/v1/model/generateImageنص الطلب
import requests
url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
"Content-Type": "application/json",
"Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
"model": "alibaba/qwen-image/text-to-image-max",
"input": {
"prompt": "A beautiful landscape with mountains and lake"
}
}
response = requests.post(url, headers=headers, json=data)
result = response.json()
print(f"Prediction ID: {result['id']}")
print(f"Status: {result['status']}")الاستجابة
{
"id": "pred_abc123",
"status": "processing",
"model": "model-name",
"created_at": "2025-01-01T00:00:00Z"
}التحقق من الحالة
استعلم عن نقطة نهاية التنبؤ للتحقق من الحالة الحالية لطلبك.
/api/v1/model/prediction/{prediction_id}مثال الاستعلام
import requests
import time
prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }
while True:
response = requests.get(url, headers=headers)
result = response.json()
status = result["data"]["status"]
print(f"Status: {status}")
if status in ["completed", "succeeded"]:
output_url = result["data"]["outputs"][0]
print(f"Output URL: {output_url}")
break
elif status == "failed":
print(f"Error: {result['data'].get('error', 'Unknown')}")
break
time.sleep(3)قيم الحالة
processingلا يزال الطلب قيد المعالجة.completedاكتمل التوليد. المخرجات متاحة.succeededنجح التوليد. المخرجات متاحة.failedفشل التوليد. تحقق من حقل الخطأ.استجابة مكتملة
{
"data": {
"id": "pred_abc123",
"status": "completed",
"outputs": [
"https://storage.atlascloud.ai/outputs/result.png"
],
"metrics": {
"predict_time": 8.3
},
"created_at": "2025-01-01T00:00:00Z",
"completed_at": "2025-01-01T00:00:10Z"
}
}رفع الملفات
ارفع الملفات إلى تخزين Atlas Cloud واحصل على URL يمكنك استخدامه في طلبات API الخاصة بك. استخدم multipart/form-data للرفع.
/api/v1/model/uploadMediaمثال الرفع
import requests
url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }
with open("image.png", "rb") as f:
files = {"file": ("image.png", f, "image/png")}
response = requests.post(url, headers=headers, files=files)
result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")الاستجابة
{
"data": {
"download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
"file_name": "image.png",
"content_type": "image/png",
"size": 1024000
}
}Input Schema
المعاملات التالية مقبولة في نص الطلب.
لا توجد معاملات متاحة.
مثال على نص الطلب
{
"model": "alibaba/qwen-image/text-to-image-max"
}Output Schema
تُرجع API استجابة تنبؤ تحتوي على عناوين URL للمخرجات المولّدة.
مثال على الاستجابة
{
"id": "pred_abc123",
"status": "completed",
"model": "model-name",
"outputs": [
"https://storage.atlascloud.ai/outputs/result.png"
],
"metrics": {
"predict_time": 8.3
},
"created_at": "2025-01-01T00:00:00Z",
"completed_at": "2025-01-01T00:00:10Z"
}Atlas Cloud Skills
يدمج Atlas Cloud Skills أكثر من 300 نموذج ذكاء اصطناعي مباشرة في مساعد البرمجة بالذكاء الاصطناعي الخاص بك. أمر واحد للتثبيت، ثم استخدم اللغة الطبيعية لتوليد الصور ومقاطع الفيديو والدردشة مع LLM.
العملاء المدعومون
التثبيت
npx skills add AtlasCloudAI/atlas-cloud-skillsإعداد مفتاح API
احصل على مفتاح API الخاص بك من لوحة تحكم Atlas Cloud وعيّنه كمتغير بيئة.
export ATLASCLOUD_API_KEY="your-api-key-here"الإمكانيات
بمجرد التثبيت، يمكنك استخدام اللغة الطبيعية في مساعد الذكاء الاصطناعي الخاص بك للوصول إلى جميع نماذج Atlas Cloud.
MCP Server
يربط Atlas Cloud MCP Server بيئة التطوير الخاصة بك بأكثر من 300 نموذج ذكاء اصطناعي عبر Model Context Protocol. يعمل مع أي عميل متوافق مع MCP.
العملاء المدعومون
التثبيت
npx -y atlascloud-mcpالتكوين
أضف التكوين التالي إلى ملف إعدادات MCP في بيئة التطوير الخاصة بك.
{
"mcpServers": {
"atlascloud": {
"command": "npx",
"args": [
"-y",
"atlascloud-mcp"
],
"env": {
"ATLASCLOUD_API_KEY": "your-api-key-here"
}
}
}
}الأدوات المتاحة
مخطط API
المخطط غير متاحلا توجد أمثلة
Alibaba Qwen-Image Text-to-Image Max
The flagship text-to-image generation model from Alibaba Cloud, designed to deliver state-of-the-art visual quality, exceptional prompt adherence, and rich artistic detail. Qwen-Image Max represents the pinnacle of the Qwen-Image family, capable of transforming complex text descriptions into stunning, high-resolution visuals suitable for professional and creative workflows.
Overview
- Purpose: Generate premium-quality images from natural language descriptions.
- Core Capability: Industry-leading visual fidelity with deep semantic understanding of prompts.
- Foundation: Built on Alibaba's advanced large-scale multi-modal architecture.
- Typical Output: High-resolution, photorealistic or artistic images with precise lighting, texture, and composition.
- Use Cases: Professional design, advertising creatives, concept art, marketing materials, and high-end content creation.
Key Features
- Superior Visual Quality: Delivers the highest level of detail, texture, and lighting realism available in the Qwen-Image series.
- Complex Prompt Understanding: Accurately interprets long, intricate prompts, including spatial relationships, artistic styles, and specific object attributes.
- Text Rendering: Enhanced capability to render legible text within generated images (e.g., signboards, posters).
- Style Versatility: Masterfully handles a wide range of styles, from photorealism and cinematic shots to 3D render, oil painting, and illustration.
- High Resolution: Supports generation of high-definition images suitable for professional use.
Designed For
- Professional Designers: Create high-quality assets, mockups, and final visuals.
- Digital Artists: Explore complex concepts and generate detailed artwork.
- Marketing Agencies: Produce campaign-ready visuals with specific brand requirements.
- Enterprise Users: High-demand use cases requiring consistent, top-tier visual output.
Input Requirements
To achieve the best results, follow these guidelines:
Text Prompt
- Content: Detailed English descriptions of the subject, setting, lighting, style, and mood.
- Length: Supports long context, but concise and descriptive prompts often yield the best focus.
- Negative Prompt: Optional. Specify elements to exclude (e.g., "blur, low quality, distortion").
Parameters
- Aspect Ratio: Supports various standard ratios (1:1, 16:9, 9:16, 4:3, 3:4).
- Resolution: Optimized for high-resolution outputs (e.g., 1024x1024 and above).
- Steps/Guidance: Configurable for fine-tuning the balance between prompt adherence and image quality.
Pricing
Billing is typically based on the number of images generated and the resolution selected.
- Billing Logic: Per-image generation cost.
- Tier: "Max" tier commands a premium rate due to higher computational resources and output quality compared to standard models.
How to Use
- Enter Prompt: Describe the image you want to generate in detail.
- Set Parameters: Choose your desired aspect ratio and number of images.
- Generate: Submit the request to the Qwen-Image Max model.
- Refine: Use the generated image as a reference or adjust the prompt for iterations.
Best Practices
- Be Specific: Instead of "a cat," try "a fluffy white Persian cat sitting on a velvet sofa, cinematic lighting, 8k resolution."
- Define Style: Explicitly state the medium (e.g., "oil painting," "photograph," "3D render").
- Lighting & Composition: Mention lighting conditions (e.g., "golden hour," "studio lighting") and camera angles.
- Iterate: If the first result isn't perfect, tweak the prompt or use a negative prompt to remove unwanted elements.
Limitations
- Text Accuracy: While improved, complex or long text strings within the image may still occasionally have minor errors.
- Spatial Logic: Extremely complex spatial arrangements might sometimes require prompt tuning.
Version
- Model: Alibaba Qwen-Image Text-to-Image Max
- Family: Qwen-Image
- Technical Context: Large-scale diffusion transformer model optimized for maximum visual fidelity.






