qwen/qwen3-max-2026-01-23

LLM

Qwen3 Max 20260123 API by Alibaba

qwen/qwen3-max-2026-01-23

Qwen3-max-2026-01-23

Qwen3-Max is a flagship large language model designed for ultra-long context understanding, powerful reasoning, and high-performance text and code generation, making it well suited for complex, large-scale, and production-grade AI applications.

參數

程式碼範例
import os
from openai import OpenAI

client = OpenAI(
    api_key=os.getenv("ATLASCLOUD_API_KEY"),
    base_url="https://api.atlascloud.ai/v1"
)

response = client.chat.completions.create(
    model="qwen/qwen3-max-2026-01-23",
    messages=[
    {
        "role": "user",
        "content": "hello"
    }
],
    max_tokens=1024,
    temperature=0.7
)

print(response.choices[0].message.content)

安裝

安裝所需的相依套件。

pip install requests

驗證

所有 API 請求都需要透過 API Key 進行認證。您可以在 Atlas Cloud 控制台取得 API Key。

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP 標頭

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

保護好您的 API Key

切勿在客戶端程式碼或公開儲存庫中暴露您的 API Key。請使用環境變數或後端代理。

提交請求

import requests

url = "https://api.atlascloud.ai/v1/chat/completions"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "messages": [{"role": "user", "content": "Hello"}],
    "max_tokens": 1024
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

Input Schema

以下參數在請求主體中被接受。

總計: 9必填: 2選填: 7

modelstringrequired

The model ID to use for the completion.

Example: "qwen/qwen3-max-2026-01-23"

messagesarray[object]required

A list of messages comprising the conversation so far.

rolestringrequired

The role of the message author. One of "system", "user", or "assistant".

systemuserassistant

contentstringrequired

The content of the message.

max_tokensinteger

The maximum number of tokens to generate in the completion.

Default: 1024Min: 1

temperaturenumber

Sampling temperature between 0 and 2. Higher values make output more random, lower values more focused and deterministic.

Default: 0.7Min: 0Max: 2

top_pnumber

Nucleus sampling parameter. The model considers the tokens with top_p probability mass.

Default: 1Min: 0Max: 1

streamboolean

If set to true, partial message deltas will be sent as server-sent events.

Default: false

stoparray[string]

Up to 4 sequences where the API will stop generating further tokens.

frequency_penaltynumber

Penalizes new tokens based on their existing frequency in the text so far. Between -2.0 and 2.0.

Default: 0Min: -2Max: 2

presence_penaltynumber

Penalizes new tokens based on whether they appear in the text so far. Between -2.0 and 2.0.

Default: 0Min: -2Max: 2

範例請求主體

{
  "model": "qwen/qwen3-max-2026-01-23",
  "messages": [
    {
      "role": "user",
      "content": "Hello"
    }
  ],
  "max_tokens": 1024,
  "temperature": 0.7,
  "stream": false
}

Output Schema

API 傳回相容 ChatCompletion 的回應格式。

idstringrequired

Unique identifier for the completion.

objectstringrequired

Object type, always "chat.completion".

Default: "chat.completion"

createdintegerrequired

Unix timestamp of when the completion was created.

modelstringrequired

The model used for the completion.

choicesarray[object]required

List of completion choices.

indexintegerrequired

Index of the choice.

messageobjectrequired

The generated message.

finish_reasonstringrequired

The reason generation stopped.

stoplengthcontent_filter

usageobjectrequired

Token usage statistics.

prompt_tokensintegerrequired

Number of tokens in the prompt.

completion_tokensintegerrequired

Number of tokens in the completion.

total_tokensintegerrequired

Total tokens used.

範例回應

{
  "id": "chatcmpl-abc123",
  "object": "chat.completion",
  "created": 1700000000,
  "model": "model-name",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Hello! How can I assist you today?"
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 10,
    "completion_tokens": 20,
    "total_tokens": 30
  }
}

Atlas Cloud Skills

Atlas Cloud Skills 將 300+ AI 模型直接整合到您的 AI 程式碼助手中。一條命令安裝，即可用自然語言生成圖片、影片，以及與 LLM 對話。

支援的客戶端

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ 支援的客戶端

安裝

npx skills add AtlasCloudAI/atlas-cloud-skills

設定 API Key

從 Atlas Cloud 控制台取得 API Key，並將其設定為環境變數。

export ATLASCLOUD_API_KEY="your-api-key-here"

功能

安裝完成後，您可以在 AI 助手中使用自然語言存取所有 Atlas Cloud 模型。

圖片生成使用 Nano Banana 2、Z-Image 等模型生成圖片。

影片創作使用 Kling、Vidu、Veo 等從文字或圖片創建影片。

LLM 對話與 Qwen、DeepSeek 及其他大型語言模型對話。

媒體上傳上傳本機檔案用於圖片編輯和圖生影片工作流程。

MCP Server

Atlas Cloud MCP Server 透過 Model Context Protocol 將您的 IDE 與 300+ AI 模型連接。支援任何相容 MCP 的客戶端。

支援的客戶端

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ 支援的客戶端

安裝

npx -y atlascloud-mcp

設定

將以下設定新增到您的 IDE 的 MCP 設定檔中。

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

可用工具

atlas_generate_image根據文字提示生成圖片。

atlas_generate_video從文字或圖片創建影片。

atlas_chat與大型語言模型對話。

atlas_list_models瀏覽 300+ 可用 AI 模型。

atlas_quick_generate一步式內容創建，自動選擇最佳模型。

atlas_upload_media上傳本機檔案用於 API 工作流程。

了解更多

github.com/AtlasCloudAI/mcp-server

Qwen3-Max Large Language Model

Overview

Qwen3-Max is a state-of-the-art large language model (LLM) developed by Alibaba Cloud as the flagship model of the Qwen3 series. It is designed to deliver high-performance reasoning, ultra-long context processing, and general-purpose AI capabilities for enterprise and developer use cases.

As a trillion-scale Mixture-of-Experts (MoE) model, Qwen3-Max combines massive model capacity with efficient inference, making it suitable for both complex reasoning tasks and large-scale production deployment.

What is Qwen3-Max?

Qwen3-Max is a general-purpose foundation model optimized for:

Advanced reasoning and analytical tasks
Long-context understanding and document processing
Multilingual natural language understanding and generation
Code generation, explanation, and debugging
Instruction-following and conversational AI systems

It is part of the Qwen (Tongyi Qianwen) model family and represents the highest-capacity model available in the Qwen3 lineup.

Core Features

Trillion-Scale MoE Architecture

Built with a Mixture-of-Experts (MoE) architecture to enable scalable intelligence
Trillion-level total parameters, activating only a subset of experts per request for efficiency
Designed for high throughput, stability, and performance at scale
Suitable for cloud-based deployment and enterprise workloads

Ultra-Long Context Capability

Supports extremely long context windows, enabling:
- Long document understanding
- Multi-file code analysis
- Large knowledge base ingestion
- Retrieval-augmented generation (RAG) workflows
Well-suited for legal, financial, technical, and research documents

Advanced Reasoning & Intelligence

Strong performance in:
- Logical reasoning
- Mathematical problem solving
- Multi-step instruction execution
- Knowledge-intensive question answering
Optimized for both fast responses and deep reasoning depending on task complexity

Multilingual & Cross-Domain Support

Trained on large-scale multilingual data
Capable of understanding and generating content across many languages
Performs well across domains such as:
- Technology
- Programming
- Science
- Business
- Education

Model Capabilities

Qwen3-Max can be used as a foundation model for a wide range of AI applications, including but not limited to:

Chatbots and conversational AI
AI assistants and copilots
Code generation and software engineering tools
Enterprise knowledge assistants
Document analysis and summarization
Search, QA, and reasoning systems
Multilingual content generation

API & Integration

OpenAI-Compatible API

Qwen3-Max is available through a cloud API that follows the OpenAI-compatible interface, allowing developers to:

Reuse existing OpenAI-style SDKs
Integrate with minimal code changes
Deploy across backend services, agents, and AI workflows

Example Usage (Python)

from openai import OpenAI
import os

client = OpenAI(
    api_key=os.getenv("API_KEY"),
    base_url="https://dashscope-intl.aliyuncs.com/compatible-mode/v1",
)

response = client.chat.completions.create(
    model="qwen3-max",
    messages=[
        {"role": "user", "content": "Summarize the key capabilities of Qwen3-Max."}
    ],
)

print(response.choices[0].message.content)

Technical Specifications

Specification	Details
Model Name	qwen3-max
Model Family	Qwen3
Model Type	Large Language Model (LLM)
Architecture	Mixture-of-Experts (MoE)
Parameter Scale	Trillion-scale
Context Length	Ultra-long context support
Training Data	Large-scale multilingual corpus
Supported Tasks	Chat, reasoning, coding, multilingual generation, long-document analysis
Deployment	Cloud API, OpenAI-compatible

Performance Characteristics

Designed for high accuracy and robustness on complex tasks
Efficient inference despite large model size
Scales well across concurrent requests
Suitable for production environments requiring stability and reliability

Use Cases & Scenarios

Enterprise Applications

Internal knowledge bases
Intelligent document processing
Customer support automation
Business intelligence assistants

Developer & AI Products

AI coding assistants
Agent frameworks and tool-using LLMs
Search-augmented and RAG systems
Multi-step reasoning pipelines

Research & Advanced Workloads

Long-form reasoning experiments
Large-context evaluation
Multilingual NLP research

Why Choose Qwen3-Max?

Trillion-scale intelligence with efficient MoE design
Strong reasoning and instruction-following performance
Ultra-long context handling for complex real-world data
OpenAI-compatible API for easy integration
Suitable for both enterprise and developer-focused AI systems

Summary

Qwen3-Max is a powerful, large-scale language model designed for modern AI applications that demand reasoning ability, long-context understanding, and production-ready performance. As the flagship model of the Qwen3 series, it provides a strong foundation for building advanced AI products, intelligent assistants, and enterprise-grade solutions.

探索類似模型

NEW

The latest Qwen reasoning model.

Qwen3 Max 20260123 API by Alibaba

參數

程式碼範例

安裝

驗證

HTTP 標頭

提交請求

Input Schema

範例請求主體

Output Schema

範例回應

Atlas Cloud Skills

支援的客戶端

安裝

設定 API Key

功能

MCP Server

支援的客戶端

安裝

設定

可用工具

Qwen3-Max Large Language Model

Overview

What is Qwen3-Max?

Core Features

Trillion-Scale MoE Architecture

Ultra-Long Context Capability

Advanced Reasoning & Intelligence

Multilingual & Cross-Domain Support

Model Capabilities

API & Integration

OpenAI-Compatible API

Example Usage (Python)

Technical Specifications

Performance Characteristics

Use Cases & Scenarios

Enterprise Applications

Developer & AI Products

Research & Advanced Workloads

Why Choose Qwen3-Max?

Summary

探索類似模型

Qwen3.6 35B A3B

Qwen3.6 Plus

Qwen3.5 122B A10B

Qwen3.5 35B A3B

Qwen3.5 27B

Qwen3 Coder Next

Qwen3.5 397BA17B

Qwen3 VL 30B A3B Thinking

Qwen3 VL 8B Instruct

Qwen3 VL 30B A3B Instruct

Qwen3.5 Flash

Qwen3.5 Plus

Qwen3.7 Plus

Qwen3.7 Max

Qwen3-VL-235B-A22B-Instruct

Qwen3 30B A3B Instruct 2507

一個 API，暢享全模態 AI。

Join our Discord community

參數

程式碼範例

安裝

驗證

HTTP 標頭

提交請求

Input Schema

範例請求主體

Output Schema

範例回應

Atlas Cloud Skills

支援的客戶端

安裝

設定 API Key

功能

MCP Server

支援的客戶端

安裝

設定

可用工具