qwen/qwen3-max-2026-01-23

LLM

Qwen3 Max 20260123 API by Alibaba

qwen/qwen3-max-2026-01-23

Qwen3-max-2026-01-23

Qwen3-Max is a flagship large language model designed for ultra-long context understanding, powerful reasoning, and high-performance text and code generation, making it well suited for complex, large-scale, and production-grade AI applications.

Tham số

Ví dụ mã
import os
from openai import OpenAI

client = OpenAI(
    api_key=os.getenv("ATLASCLOUD_API_KEY"),
    base_url="https://api.atlascloud.ai/v1"
)

response = client.chat.completions.create(
    model="qwen/qwen3-max-2026-01-23",
    messages=[
    {
        "role": "user",
        "content": "hello"
    }
],
    max_tokens=1024,
    temperature=0.7
)

print(response.choices[0].message.content)

Cài đặt

Cài đặt gói cần thiết cho ngôn ngữ lập trình của bạn.

pip install requests

Xác thực

Tất cả các yêu cầu API đều cần xác thực thông qua khóa API. Bạn có thể lấy khóa API từ bảng điều khiển Atlas Cloud.

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP Headers

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

Bảo mật khóa API của bạn

Không bao giờ để lộ khóa API trong mã phía máy khách hoặc kho lưu trữ công khai. Thay vào đó, hãy sử dụng biến môi trường hoặc proxy phía máy chủ.

Gửi yêu cầu

import requests

url = "https://api.atlascloud.ai/v1/chat/completions"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "messages": [{"role": "user", "content": "Hello"}],
    "max_tokens": 1024
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

Input Schema

Các tham số sau được chấp nhận trong nội dung yêu cầu.

Tổng cộng: 9Bắt buộc: 2Tùy chọn: 7

modelstringrequired

The model ID to use for the completion.

Example: "qwen/qwen3-max-2026-01-23"

messagesarray[object]required

A list of messages comprising the conversation so far.

rolestringrequired

The role of the message author. One of "system", "user", or "assistant".

systemuserassistant

contentstringrequired

The content of the message.

max_tokensinteger

The maximum number of tokens to generate in the completion.

Default: 1024Min: 1

temperaturenumber

Sampling temperature between 0 and 2. Higher values make output more random, lower values more focused and deterministic.

Default: 0.7Min: 0Max: 2

top_pnumber

Nucleus sampling parameter. The model considers the tokens with top_p probability mass.

Default: 1Min: 0Max: 1

streamboolean

If set to true, partial message deltas will be sent as server-sent events.

Default: false

stoparray[string]

Up to 4 sequences where the API will stop generating further tokens.

frequency_penaltynumber

Penalizes new tokens based on their existing frequency in the text so far. Between -2.0 and 2.0.

Default: 0Min: -2Max: 2

presence_penaltynumber

Penalizes new tokens based on whether they appear in the text so far. Between -2.0 and 2.0.

Default: 0Min: -2Max: 2

Ví dụ nội dung yêu cầu

{
  "model": "qwen/qwen3-max-2026-01-23",
  "messages": [
    {
      "role": "user",
      "content": "Hello"
    }
  ],
  "max_tokens": 1024,
  "temperature": 0.7,
  "stream": false
}

Output Schema

API trả về phản hồi tương thích với ChatCompletion.

idstringrequired

Unique identifier for the completion.

objectstringrequired

Object type, always "chat.completion".

Default: "chat.completion"

createdintegerrequired

Unix timestamp of when the completion was created.

modelstringrequired

The model used for the completion.

choicesarray[object]required

List of completion choices.

indexintegerrequired

Index of the choice.

messageobjectrequired

The generated message.

finish_reasonstringrequired

The reason generation stopped.

stoplengthcontent_filter

usageobjectrequired

Token usage statistics.

prompt_tokensintegerrequired

Number of tokens in the prompt.

completion_tokensintegerrequired

Number of tokens in the completion.

total_tokensintegerrequired

Total tokens used.

Ví dụ phản hồi

{
  "id": "chatcmpl-abc123",
  "object": "chat.completion",
  "created": 1700000000,
  "model": "model-name",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Hello! How can I assist you today?"
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 10,
    "completion_tokens": 20,
    "total_tokens": 30
  }
}

Atlas Cloud Skills

Atlas Cloud Skills tích hợp hơn 300 mô hình AI trực tiếp vào trợ lý lập trình AI của bạn. Một lệnh để cài đặt, sau đó sử dụng ngôn ngữ tự nhiên để tạo hình ảnh, video và trò chuyện với LLM.

Ứng dụng được hỗ trợ

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ ứng dụng được hỗ trợ

Cài đặt

npx skills add AtlasCloudAI/atlas-cloud-skills

Thiết lập khóa API

Lấy khóa API từ bảng điều khiển Atlas Cloud và đặt nó làm biến môi trường.

export ATLASCLOUD_API_KEY="your-api-key-here"

Khả năng

Sau khi cài đặt, bạn có thể sử dụng ngôn ngữ tự nhiên trong trợ lý AI để truy cập tất cả các mô hình Atlas Cloud.

Tạo hình ảnhTạo hình ảnh với các mô hình như Nano Banana 2, Z-Image và nhiều hơn nữa.

Tạo videoTạo video từ văn bản hoặc hình ảnh với Kling, Vidu, Veo, v.v.

Trò chuyện LLMTrò chuyện với Qwen, DeepSeek và các mô hình ngôn ngữ lớn khác.

Tải lên phương tiệnTải tệp cục bộ lên để chỉnh sửa hình ảnh và quy trình chuyển hình ảnh sang video.

Tìm hiểu thêm

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

Atlas Cloud MCP Server kết nối IDE của bạn với hơn 300 mô hình AI thông qua Model Context Protocol. Hoạt động với bất kỳ ứng dụng tương thích MCP nào.

Ứng dụng được hỗ trợ

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ ứng dụng được hỗ trợ

Cài đặt

npx -y atlascloud-mcp

Cấu hình

Thêm cấu hình sau vào tệp cài đặt MCP của IDE.

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

Công cụ khả dụng

atlas_generate_imageTạo hình ảnh từ mô tả văn bản.

atlas_generate_videoTạo video từ văn bản hoặc hình ảnh.

atlas_chatTrò chuyện với các mô hình ngôn ngữ lớn.

atlas_list_modelsDuyệt hơn 300 mô hình AI khả dụng.

atlas_quick_generateTạo nội dung một bước với khả năng tự động chọn mô hình tốt nhất.

atlas_upload_mediaTải tệp cục bộ lên cho quy trình API.

Tìm hiểu thêm

github.com/AtlasCloudAI/mcp-server

Qwen3-Max Large Language Model

Overview

Qwen3-Max is a state-of-the-art large language model (LLM) developed by Alibaba Cloud as the flagship model of the Qwen3 series. It is designed to deliver high-performance reasoning, ultra-long context processing, and general-purpose AI capabilities for enterprise and developer use cases.

As a trillion-scale Mixture-of-Experts (MoE) model, Qwen3-Max combines massive model capacity with efficient inference, making it suitable for both complex reasoning tasks and large-scale production deployment.

What is Qwen3-Max?

Qwen3-Max is a general-purpose foundation model optimized for:

Advanced reasoning and analytical tasks
Long-context understanding and document processing
Multilingual natural language understanding and generation
Code generation, explanation, and debugging
Instruction-following and conversational AI systems

It is part of the Qwen (Tongyi Qianwen) model family and represents the highest-capacity model available in the Qwen3 lineup.

Core Features

Trillion-Scale MoE Architecture

Built with a Mixture-of-Experts (MoE) architecture to enable scalable intelligence
Trillion-level total parameters, activating only a subset of experts per request for efficiency
Designed for high throughput, stability, and performance at scale
Suitable for cloud-based deployment and enterprise workloads

Ultra-Long Context Capability

Supports extremely long context windows, enabling:
- Long document understanding
- Multi-file code analysis
- Large knowledge base ingestion
- Retrieval-augmented generation (RAG) workflows
Well-suited for legal, financial, technical, and research documents

Advanced Reasoning & Intelligence

Strong performance in:
- Logical reasoning
- Mathematical problem solving
- Multi-step instruction execution
- Knowledge-intensive question answering
Optimized for both fast responses and deep reasoning depending on task complexity

Multilingual & Cross-Domain Support

Trained on large-scale multilingual data
Capable of understanding and generating content across many languages
Performs well across domains such as:
- Technology
- Programming
- Science
- Business
- Education

Model Capabilities

Qwen3-Max can be used as a foundation model for a wide range of AI applications, including but not limited to:

Chatbots and conversational AI
AI assistants and copilots
Code generation and software engineering tools
Enterprise knowledge assistants
Document analysis and summarization
Search, QA, and reasoning systems
Multilingual content generation

API & Integration

OpenAI-Compatible API

Qwen3-Max is available through a cloud API that follows the OpenAI-compatible interface, allowing developers to:

Reuse existing OpenAI-style SDKs
Integrate with minimal code changes
Deploy across backend services, agents, and AI workflows

Example Usage (Python)

from openai import OpenAI
import os

client = OpenAI(
    api_key=os.getenv("API_KEY"),
    base_url="https://dashscope-intl.aliyuncs.com/compatible-mode/v1",
)

response = client.chat.completions.create(
    model="qwen3-max",
    messages=[
        {"role": "user", "content": "Summarize the key capabilities of Qwen3-Max."}
    ],
)

print(response.choices[0].message.content)

Technical Specifications

Specification	Details
Model Name	qwen3-max
Model Family	Qwen3
Model Type	Large Language Model (LLM)
Architecture	Mixture-of-Experts (MoE)
Parameter Scale	Trillion-scale
Context Length	Ultra-long context support
Training Data	Large-scale multilingual corpus
Supported Tasks	Chat, reasoning, coding, multilingual generation, long-document analysis
Deployment	Cloud API, OpenAI-compatible

Performance Characteristics

Designed for high accuracy and robustness on complex tasks
Efficient inference despite large model size
Scales well across concurrent requests
Suitable for production environments requiring stability and reliability

Use Cases & Scenarios

Enterprise Applications

Internal knowledge bases
Intelligent document processing
Customer support automation
Business intelligence assistants

Developer & AI Products

AI coding assistants
Agent frameworks and tool-using LLMs
Search-augmented and RAG systems
Multi-step reasoning pipelines

Research & Advanced Workloads

Long-form reasoning experiments
Large-context evaluation
Multilingual NLP research

Why Choose Qwen3-Max?

Trillion-scale intelligence with efficient MoE design
Strong reasoning and instruction-following performance
Ultra-long context handling for complex real-world data
OpenAI-compatible API for easy integration
Suitable for both enterprise and developer-focused AI systems

Summary

Qwen3-Max is a powerful, large-scale language model designed for modern AI applications that demand reasoning ability, long-context understanding, and production-ready performance. As the flagship model of the Qwen3 series, it provides a strong foundation for building advanced AI products, intelligent assistants, and enterprise-grade solutions.

Khám phá Các Mô hình Tương tự

NEW

The latest Qwen reasoning model.

Qwen3 Max 20260123 API by Alibaba

Tham số

Ví dụ mã

Cài đặt

Xác thực

HTTP Headers

Gửi yêu cầu

Input Schema

Ví dụ nội dung yêu cầu

Output Schema

Ví dụ phản hồi

Atlas Cloud Skills

Ứng dụng được hỗ trợ

Cài đặt

Thiết lập khóa API

Khả năng

MCP Server

Ứng dụng được hỗ trợ

Cài đặt

Cấu hình

Công cụ khả dụng

Qwen3-Max Large Language Model

Overview

What is Qwen3-Max?

Core Features

Trillion-Scale MoE Architecture

Ultra-Long Context Capability

Advanced Reasoning & Intelligence

Multilingual & Cross-Domain Support

Model Capabilities

API & Integration

OpenAI-Compatible API

Example Usage (Python)

Technical Specifications

Performance Characteristics

Use Cases & Scenarios

Enterprise Applications

Developer & AI Products

Research & Advanced Workloads

Why Choose Qwen3-Max?

Summary

Khám phá Các Mô hình Tương tự

Qwen3.6 35B A3B

Qwen3.6 Plus

Qwen3.5 122B A10B

Qwen3.5 35B A3B

Qwen3.5 27B

Qwen3 Coder Next

Qwen3.5 397BA17B

Qwen3 VL 30B A3B Thinking

Qwen3 VL 8B Instruct

Qwen3 VL 30B A3B Instruct

Qwen3.5 Flash

Qwen3.5 Plus

Qwen3.7 Plus

Qwen3.7 Max

Qwen3-VL-235B-A22B-Instruct

Qwen3 30B A3B Instruct 2507

Một API cho mọi AI đa phương tiện.

Join our Discord community

Tham số

Ví dụ mã

Cài đặt

Xác thực

HTTP Headers

Gửi yêu cầu

Input Schema

Ví dụ nội dung yêu cầu

Output Schema

Ví dụ phản hồi

Atlas Cloud Skills

Ứng dụng được hỗ trợ

Cài đặt

Thiết lập khóa API

Khả năng

MCP Server

Ứng dụng được hỗ trợ

Cài đặt

Cấu hình

Công cụ khả dụng