DeepSeek V4 Pro

GLM 5.2

Agent-oriented model built for complex reasoning, tool use, and autonomous task execution.

Entrada:$1.4/M tokens

Saída:$4.4/M tokens

$1.4/4.4M Entrada/Saída

Kimi K2.7 Code

Powerful coding model for programming, debugging, and AI developer workflows.

Entrada:$0.95/M tokens

$0.95/4M Entrada/Saída

Grok Build 0.1

Specialized coding model optimized for software development, code generation, debugging, refactoring, and developer workflows.

Grok 4.3

Advanced conversational AI model optimized for natural dialogue, knowledge exploration, reasoning, and interactive chat experiences.

Entrada:$1.25/M tokens

Saída:$2.5/M tokens

Saída Máxima:1000.00K

$1.25/2.5M Entrada/Saída

Claude Opus 4.8

Anthropic's most capable model, built for advanced reasoning, complex workflows, deep analysis, and high-quality content generation.

Gemini 3.5 Flash

Fast and cost-efficient multimodal model designed for high-throughput applications, real-time interactions, and everyday AI tasks.

Entrada:$1.5/M tokens

Saída:$9/M tokens

$1.68/3.38M Entrada/Saída

$1.5/9M Entrada/Saída

DeepSeek V4 Pro

Entrada:$1.68/M tokens

Saída:$3.38/M tokens

Saída Máxima:393.22K

DeepSeek V4 Flash

DeepSeek V4 Flash is a state-of-the-art large language model combining efficient sparse attention, strong reasoning, and integrated agent capabilities for robust long-context understanding and versatile AI applications.

Entrada:$0.14/M tokens

Saída:$0.28/M tokens

Saída Máxima:393.22K

$0.14/0.28M Entrada/Saída

OWL

Sem descrição

Kimi K2.6

Enhanced model for reasoning, coding, and productivity.

Entrada:$0.95/M tokens

$0.95/4M Entrada/Saída

Qwen3.6 35B A3B

The latest Qwen reasoning model.

Entrada:$0.161/M tokens

Saída:$0.965/M tokens

$0.161/0.965M Entrada/Saída

Qwen3.6 Plus

Versatile model for chat, and productivity workflows.

Entrada:$0.325/M tokens

Saída:$1.95/M tokens

$0.325/1.95M Entrada/Saída

Doubao Seed 2.0 Pro

Professional-grade model built for advanced workloads, complex analysis, and enterprise AI applications.

Entrada:$0.5/M tokens

Saída:$3/M tokens

$0.5/3M Entrada/Saída

Doubao Seed 2.0 Code Preview

Developer-focused model specialized in coding agents, repository understanding, and software engineering.

Entrada:$0.5/M tokens

Saída:$3/M tokens

$0.5/3M Entrada/Saída

Doubao Seed 2.0 Lite

Ultra-efficient model focused on lightweight AI tasks, rapid inference, and large-scale deployment.

Entrada:$0.25/M tokens

Saída:$2/M tokens

$0.25/2M Entrada/Saída

Doubao Seed 2.0 Mini

Small yet capable model designed for edge scenarios, automation, and cost-sensitive services.

Entrada:$0.1/M tokens

Saída:$0.4/M tokens

$0.1/0.4M Entrada/Saída

Doubao Seed 1.8

Next-generation assistant model with improved instruction following and deeper contextual understanding.

Entrada:$0.25/M tokens

Saída:$2/M tokens

$0.075/0.3M Entrada/Saída

$0.25/2M Entrada/Saída

Doubao Seed 1.6 Flash

High-speed model engineered for instant responses, real-time interaction, and massive request workloads.

Entrada:$0.075/M tokens

Saída:$0.3/M tokens

Saída Máxima:32.77K

Doubao Seed 1.6

Versatile foundation model providing reliable conversation, knowledge understanding, and content creation.

Entrada:$0.25/M tokens

Saída:$2/M tokens

$0.25/2M Entrada/Saída

GLM 5.1

GLM-5.1 is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

Entrada:$1.26/M tokens

Saída:$3.96/M tokens

$1.26/3.96M Entrada/Saída

MiniMax M2.7

MiniMax-M2.7 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world capability while maintaining exceptional latency, scalability, and cost efficiency.

Entrada:$0.3/M tokens

Saída:$1.2/M tokens

$0.42/1.68M Entrada/Saída

$0.3/1.2M Entrada/Saída

MiniMax M3

MiniMax M3 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world capability while maintaining exceptional latency, scalability, and cost efficiency.

Entrada:$0.42/M tokens

Saída:$1.68/M tokens

Saída Máxima:524.29K

Qwen3.5 122B A10B

Qwen3.5 represents a significant leap forward, integrating breakthroughs in multimodal learning, architectural efficiency, reinforcement learning scale, and global accessibility to empower developers and enterprises with unprecedented capability and efficiency.

Entrada:$0.3/M tokens

Saída:$2.4/M tokens

$0.3/2.4M Entrada/Saída

Qwen3.5 35B A3B

Entrada:$0.225/M tokens

Saída:$1.8/M tokens

$0.225/1.8M Entrada/Saída

Qwen3.5 27B

Entrada:$0.27/M tokens

Saída:$2.16/M tokens

$0.27/2.16M Entrada/Saída

Qwen3 Coder Next

Qwen3 Coder represents a significant leap forward, integrating breakthroughs in multimodal learning, architectural efficiency, reinforcement learning scale, and global accessibility to empower developers and enterprises with unprecedented capability and efficiency.

Entrada:$0.18/M tokens

Saída:$1.35/M tokens

$0.18/1.35M Entrada/Saída

Gradient-Based

Qwen3.5 397BA17B

Entrada:$0.55/M tokens

Saída:$3.5/M tokens

$0.55/3.5M Entrada/Saída

MiniMax M2.5

MiniMax-M2.5 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world capability while maintaining exceptional latency, scalability, and cost efficiency.

Entrada:$0.295/M tokens

Saída:$1.2/M tokens

$0.295/1.2M Entrada/Saída

GLM 5v Turbo

GLM-5v Turbo is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

Entrada:$1.2/M tokens

$1.2/4M Entrada/Saída

GLM 5 Turbo

GLM-5 Turbo is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

Entrada:$1.2/M tokens

$1.2/4M Entrada/Saída

GLM 5

GLM-5 is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

Entrada:$0.95/M tokens

Saída:$3.15/M tokens

$0.95/3.15M Entrada/Saída

Qwen3 VL 30B A3B Thinking

The latest Qwen reasoning model.

Entrada:$0.15/M tokens

Saída:$1.5/M tokens

$0.15/1.5M Entrada/Saída

Qwen3 VL 8B Instruct

The latest Qwen reasoning model.

Entrada:$0.08/M tokens

Saída:$0.5/M tokens

$0.08/0.5M Entrada/Saída

Qwen3 VL 30B A3B Instruct

The latest Qwen reasoning model.

Entrada:$0.15/M tokens

Saída:$0.6/M tokens

$0.15/0.6M Entrada/Saída

Kimi K2.5

Powerful model for long-context and intelligent workflows.

Entrada:$0.49/M tokens

Saída:$2.5/M tokens

$0.49/2.5M Entrada/Saída

Qwen3.7 Max

Flagship model for advanced reasoning, coding, and complex tasks.

Entrada:$2.5/M tokens

Saída:$7.5/M tokens

$2.5/7.5M Entrada/Saída

Qwen3.7 Plus

Balanced model combining strong capability, speed, and efficiency.

Entrada:$0.4/M tokens

Saída:$1.6/M tokens

$0.4/1.6M Entrada/Saída

Qwen3.5 Plus

Efficient model for everyday tasks and AI assistants.

Entrada:$0.4/M tokens

Saída:$2.4/M tokens

$0.4/2.4M Entrada/Saída

Qwen3.5 Flash

Fast model optimized for instant responses and large-scale usage.

Entrada:$0.1/M tokens

Saída:$0.4/M tokens

$0.1/0.4M Entrada/Saída

Qwen3 Max 20260123

Qwen3-Max is a flagship large language model designed for ultra-long context understanding, powerful reasoning, and high-performance text and code generation, making it well suited for complex, large-scale, and production-grade AI applications.

Entrada:$1.2/M tokens

Saída:$6/M tokens

$1.2/6M Entrada/Saída

Gradient-Based

MiniMax M2.1

MiniMax-M2.1 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world capability while maintaining exceptional latency, scalability, and cost efficiency.

Entrada:$0.29/M tokens

Saída:$0.95/M tokens

$0.29/0.95M Entrada/Saída

GLM 4.7

GLM-4.7 is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

Entrada:$0.52/M tokens

Saída:$1.85/M tokens

$0.52/1.85M Entrada/Saída

DeepSeek V3.2

DeepSeek V3.2 is a state-of-the-art large language model combining efficient sparse attention, strong reasoning, and integrated agent capabilities for robust long-context understanding and versatile AI applications.

Entrada:$0.26/M tokens

Saída:$0.38/M tokens

Saída Máxima:163.84K

$0.26/0.38M Entrada/Saída

GPT 5.4

Advanced multimodal model optimized for reasoning, coding, content generation, and complex problem-solving with strong accuracy and reliability.

Entrada:$2.5/M tokens

Saída:$15/M tokens

Saída Máxima:128.00K

$2.5/15M Entrada/Saída

GPT 5.5

Advanced multimodal model optimized for reasoning, coding, content generation, and complex problem-solving with strong accuracy and reliability.

KAT Coder Pro V2

KAT Coder Pro is KwaiKAT's most advanced agentic coding model in the KAT-Coder series. Designed specifically for agentic coding tasks, it excels in real-world software engineering scenarios, achieving 73.4% solve rate on the SWE-Bench Verified benchmark.

Entrada:$0.3/M tokens

Saída:$1.2/M tokens

Saída Máxima:144.00K

$0.3/1.2M Entrada/Saída

Gemini 3.1 Pro Preview

Preview version of Google's flagship reasoning model, offering enhanced analytical capabilities, long-context understanding, and advanced multimodal performance.

MiniMax M2

MiniMax-M2 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world capability while maintaining exceptional latency, scalability, and cost efficiency.

Entrada:$0.255/M tokens

Saída:$1/M tokens

$0.255/1M Entrada/Saída