235B-parameter MoE thinking model in Qwen3 series.

235B-parameter MoE thinking model in Qwen3 series.
import os
from openai import OpenAI
client = OpenAI(
api_key=os.getenv("ATLASCLOUD_API_KEY"),
base_url="https://api.atlascloud.ai/v1"
)
response = client.chat.completions.create(
model="Qwen/Qwen3-235B-A22B-Instruct-2507",
messages=[
{
"role": "user",
"content": "hello"
}
],
max_tokens=1024,
temperature=0.7
)
print(response.choices[0].message.content)為您的程式語言安裝所需的套件。
pip install requests所有 API 請求都需要透過 API 金鑰進行驗證。您可以從 Atlas Cloud 儀表板取得 API 金鑰。
export ATLASCLOUD_API_KEY="your-api-key-here"import os
API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
"Content-Type": "application/json",
"Authorization": f"Bearer {API_KEY}"
}切勿在客戶端程式碼或公開儲存庫中暴露您的 API 金鑰。請改用環境變數或後端代理。
import requests
url = "https://api.atlascloud.ai/v1/chat/completions"
headers = {
"Content-Type": "application/json",
"Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
"model": "your-model",
"messages": [{"role": "user", "content": "Hello"}],
"max_tokens": 1024
}
response = requests.post(url, headers=headers, json=data)
print(response.json())以下參數可在請求主體中使用。
{
"model": "Qwen/Qwen3-235B-A22B-Instruct-2507",
"messages": [
{
"role": "user",
"content": "Hello"
}
],
"max_tokens": 1024,
"temperature": 0.7,
"stream": false
}API 傳回與 ChatCompletion 相容的回應。
{
"id": "chatcmpl-abc123",
"object": "chat.completion",
"created": 1700000000,
"model": "model-name",
"choices": [
{
"index": 0,
"message": {
"role": "assistant",
"content": "Hello! How can I assist you today?"
},
"finish_reason": "stop"
}
],
"usage": {
"prompt_tokens": 10,
"completion_tokens": 20,
"total_tokens": 30
}
}Atlas Cloud Skills 將 300 多個 AI 模型直接整合至您的 AI 程式碼助手。一鍵安裝,即可使用自然語言生成圖片、影片,以及與 LLM 對話。
npx skills add AtlasCloudAI/atlas-cloud-skills從 Atlas Cloud 儀表板取得 API 金鑰,並設為環境變數。
export ATLASCLOUD_API_KEY="your-api-key-here"安裝完成後,您可以在 AI 助手中使用自然語言存取所有 Atlas Cloud 模型。
Atlas Cloud MCP Server 透過 Model Context Protocol 將您的 IDE 與 300 多個 AI 模型連接。支援任何 MCP 相容的客戶端。
npx -y atlascloud-mcp將以下設定新增至您 IDE 的 MCP 設定檔中。
{
"mcpServers": {
"atlascloud": {
"command": "npx",
"args": [
"-y",
"atlascloud-mcp"
],
"env": {
"ATLASCLOUD_API_KEY": "your-api-key-here"
}
}
}
}Advanced multilingual AI with 128K-token context, excelling in coding, reasoning, and enterprise applications.
Qwen3-235B-A22B, developed by Alibaba Cloud, is a flagship large language model leveraging a Mixture-of-Experts (MoE) architecture. With 235 billion total parameters and 22 billion active per inference, it delivers top-tier performance in coding, math, and reasoning across 119 languages. Optimized for enterprise tasks like software development and research, it’s accessible via AI/ML API.
Qwen3-235B-A22B uses a Transformer-based MoE architecture, activating 22 billion of its 235 billion parameters per token via top-8 expert selection, reducing compute costs. It features Rotary Positional Embeddings and Group-Query Attention for efficiency. Pre-trained on 36 trillion tokens across 119 languages, it uses RLHF and a four-stage post-training process for hybrid reasoning.
Context Window: 32K tokens natively, extendable to 128K with YaRN.
Benchmarks:
Performance: 40.1 tokens/second output speed, 0.54s latency (TTFT).
API Pricing:

Qwen3-235B-A22B comparison
Qwen3-235B-A22B excels in hybrid reasoning, toggling between thinking mode (/think) for step-by-step problem-solving and non-thinking mode (/no_think) for rapid responses. It supports 119 languages, enabling seamless global applications like multilingual chatbots and translation. With a 128K-token context, it processes large datasets, codebases, and documents with high coherence, using XML delimiters for structure retention.
Qwen3-235B-A22B is tailored for high-complexity enterprise scenarios requiring deep reasoning and scalability:
Qwen3-235B-A22B stands out among leading models due to its MoE efficiency and multilingual capabilities: