235B-parameter MoE thinking model in Qwen3 series.

235B-parameter MoE thinking model in Qwen3 series.
import os
from openai import OpenAI
client = OpenAI(
api_key=os.getenv("ATLASCLOUD_API_KEY"),
base_url="https://api.atlascloud.ai/v1"
)
response = client.chat.completions.create(
model="Qwen/Qwen3-235B-A22B-Instruct-2507",
messages=[
{
"role": "user",
"content": "hello"
}
],
max_tokens=1024,
temperature=0.7
)
print(response.choices[0].message.content)Install the required package for your language.
pip install requestsAll API requests require authentication via an API key. You can get your API key from the Atlas Cloud dashboard.
export ATLASCLOUD_API_KEY="your-api-key-here"import os
API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
"Content-Type": "application/json",
"Authorization": f"Bearer {API_KEY}"
}Never expose your API key in client-side code or public repositories. Use environment variables or a backend proxy instead.
import requests
url = "https://api.atlascloud.ai/v1/chat/completions"
headers = {
"Content-Type": "application/json",
"Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
"model": "your-model",
"messages": [{"role": "user", "content": "Hello"}],
"max_tokens": 1024
}
response = requests.post(url, headers=headers, json=data)
print(response.json())The following parameters are accepted in the request body.
{
"model": "Qwen/Qwen3-235B-A22B-Instruct-2507",
"messages": [
{
"role": "user",
"content": "Hello"
}
],
"max_tokens": 1024,
"temperature": 0.7,
"stream": false
}The API returns a ChatCompletion-compatible response.
{
"id": "chatcmpl-abc123",
"object": "chat.completion",
"created": 1700000000,
"model": "model-name",
"choices": [
{
"index": 0,
"message": {
"role": "assistant",
"content": "Hello! How can I assist you today?"
},
"finish_reason": "stop"
}
],
"usage": {
"prompt_tokens": 10,
"completion_tokens": 20,
"total_tokens": 30
}
}Atlas Cloud Skills integrates 300+ AI models directly into your AI coding assistant. One command to install, then use natural language to generate images, videos, and chat with LLMs.
npx skills add AtlasCloudAI/atlas-cloud-skillsGet your API key from the Atlas Cloud dashboard and set it as an environment variable.
export ATLASCLOUD_API_KEY="your-api-key-here"Once installed, you can use natural language in your AI assistant to access all Atlas Cloud models.
Atlas Cloud MCP Server connects your IDE with 300+ AI models via the Model Context Protocol. Works with any MCP-compatible client.
npx -y atlascloud-mcpAdd the following configuration to your IDE's MCP settings file.
{
"mcpServers": {
"atlascloud": {
"command": "npx",
"args": [
"-y",
"atlascloud-mcp"
],
"env": {
"ATLASCLOUD_API_KEY": "your-api-key-here"
}
}
}
}Advanced multilingual AI with 128K-token context, excelling in coding, reasoning, and enterprise applications.
Qwen3-235B-A22B, developed by Alibaba Cloud, is a flagship large language model leveraging a Mixture-of-Experts (MoE) architecture. With 235 billion total parameters and 22 billion active per inference, it delivers top-tier performance in coding, math, and reasoning across 119 languages. Optimized for enterprise tasks like software development and research, it’s accessible via AI/ML API.
Qwen3-235B-A22B uses a Transformer-based MoE architecture, activating 22 billion of its 235 billion parameters per token via top-8 expert selection, reducing compute costs. It features Rotary Positional Embeddings and Group-Query Attention for efficiency. Pre-trained on 36 trillion tokens across 119 languages, it uses RLHF and a four-stage post-training process for hybrid reasoning.
Context Window: 32K tokens natively, extendable to 128K with YaRN.
Benchmarks:
Performance: 40.1 tokens/second output speed, 0.54s latency (TTFT).
API Pricing:

Qwen3-235B-A22B comparison
Qwen3-235B-A22B excels in hybrid reasoning, toggling between thinking mode (/think) for step-by-step problem-solving and non-thinking mode (/no_think) for rapid responses. It supports 119 languages, enabling seamless global applications like multilingual chatbots and translation. With a 128K-token context, it processes large datasets, codebases, and documents with high coherence, using XML delimiters for structure retention.
Qwen3-235B-A22B is tailored for high-complexity enterprise scenarios requiring deep reasoning and scalability:
Qwen3-235B-A22B stands out among leading models due to its MoE efficiency and multilingual capabilities: