alibaba/qwen-image/text-to-image-max

General-purpose image generation model that supports various art styles and is particularly good at rendering complex text.

TEXT-TO-IMAGEHOTNEW
टेक्स्ट-से-इमेज

General-purpose image generation model that supports various art styles and is particularly good at rendering complex text.

इनपुट

पैरामीटर कॉन्फ़िगरेशन लोड हो रहा है...

आउटपुट

निष्क्रिय
जेनरेट की गई छवियां यहां दिखाई देंगी
अपनी सेटिंग्स कॉन्फ़िगर करें और शुरू करने के लिए चलाएं पर क्लिक करें

आपके अनुरोध की लागत 0.052 प्रति रन होगी। $10 के साथ आप इस मॉडल को लगभग 192 बार चला सकते हैं।

आगे आप यह कर सकते हैं:

1. कोड उदाहरण

2. स्कीमा

3. LLM प्रॉम्प्ट

4. पैरामीटर

import requests
import time

# Step 1: Start image generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "alibaba/qwen-image/text-to-image-max",
    "prompt": "A beautiful landscape with mountains and lake",
    "width": 512,
    "height": 512,
    "steps": 20,
    "guidance_scale": 7.5
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] == "completed":
            print("Generated image:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

image_url = check_status()

अनुरोध इतिहास देखने के लिए कृपया साइन इन करें

अपने मॉडल अनुरोध इतिहास तक पहुंचने के लिए आपको साइन इन करना होगा।

साइन इन करें

Alibaba Qwen-Image Text-to-Image Max

The flagship text-to-image generation model from Alibaba Cloud, designed to deliver state-of-the-art visual quality, exceptional prompt adherence, and rich artistic detail. Qwen-Image Max represents the pinnacle of the Qwen-Image family, capable of transforming complex text descriptions into stunning, high-resolution visuals suitable for professional and creative workflows.

Overview

  • Purpose: Generate premium-quality images from natural language descriptions.
  • Core Capability: Industry-leading visual fidelity with deep semantic understanding of prompts.
  • Foundation: Built on Alibaba's advanced large-scale multi-modal architecture.
  • Typical Output: High-resolution, photorealistic or artistic images with precise lighting, texture, and composition.
  • Use Cases: Professional design, advertising creatives, concept art, marketing materials, and high-end content creation.

Key Features

  • Superior Visual Quality: Delivers the highest level of detail, texture, and lighting realism available in the Qwen-Image series.
  • Complex Prompt Understanding: Accurately interprets long, intricate prompts, including spatial relationships, artistic styles, and specific object attributes.
  • Text Rendering: Enhanced capability to render legible text within generated images (e.g., signboards, posters).
  • Style Versatility: Masterfully handles a wide range of styles, from photorealism and cinematic shots to 3D render, oil painting, and illustration.
  • High Resolution: Supports generation of high-definition images suitable for professional use.

Designed For

  • Professional Designers: Create high-quality assets, mockups, and final visuals.
  • Digital Artists: Explore complex concepts and generate detailed artwork.
  • Marketing Agencies: Produce campaign-ready visuals with specific brand requirements.
  • Enterprise Users: High-demand use cases requiring consistent, top-tier visual output.

Input Requirements

To achieve the best results, follow these guidelines:

Text Prompt

  • Content: Detailed English descriptions of the subject, setting, lighting, style, and mood.
  • Length: Supports long context, but concise and descriptive prompts often yield the best focus.
  • Negative Prompt: Optional. Specify elements to exclude (e.g., "blur, low quality, distortion").

Parameters

  • Aspect Ratio: Supports various standard ratios (1:1, 16:9, 9:16, 4:3, 3:4).
  • Resolution: Optimized for high-resolution outputs (e.g., 1024x1024 and above).
  • Steps/Guidance: Configurable for fine-tuning the balance between prompt adherence and image quality.

Pricing

Billing is typically based on the number of images generated and the resolution selected.

  • Billing Logic: Per-image generation cost.
  • Tier: "Max" tier commands a premium rate due to higher computational resources and output quality compared to standard models.

How to Use

  1. Enter Prompt: Describe the image you want to generate in detail.
  2. Set Parameters: Choose your desired aspect ratio and number of images.
  3. Generate: Submit the request to the Qwen-Image Max model.
  4. Refine: Use the generated image as a reference or adjust the prompt for iterations.

Best Practices

  • Be Specific: Instead of "a cat," try "a fluffy white Persian cat sitting on a velvet sofa, cinematic lighting, 8k resolution."
  • Define Style: Explicitly state the medium (e.g., "oil painting," "photograph," "3D render").
  • Lighting & Composition: Mention lighting conditions (e.g., "golden hour," "studio lighting") and camera angles.
  • Iterate: If the first result isn't perfect, tweak the prompt or use a negative prompt to remove unwanted elements.

Limitations

  • Text Accuracy: While improved, complex or long text strings within the image may still occasionally have minor errors.
  • Spatial Logic: Extremely complex spatial arrangements might sometimes require prompt tuning.

Version

  • Model: Alibaba Qwen-Image Text-to-Image Max
  • Family: Qwen-Image
  • Technical Context: Large-scale diffusion transformer model optimized for maximum visual fidelity.

विस्तृत विनिर्देश

अवलोकन:

मॉडल प्रदाता:QWEN
मॉडल प्रकार:text-to-image
डिप्लॉयमेंट:इंफरेंसिंग API; प्लेग्राउंड
मूल्य निर्धारण:$0.052/pic

मुख्य विनिर्देश:

साइज़ सीमा:चौड़ाई × ऊंचाई तक (उपयोगकर्ता द्वारा कॉन्फ़िगर करने योग्य)
LoRA समर्थन:नहीं
सीड विकल्प:N/A

अपनी अगली उत्कृष्ट कृति बनाएं

300+ मॉडल से शुरू करें,

सभी मॉडल एक्सप्लोर करें