GPU Cloud

On-Demand GPUs

Bare Metal

Pricing

Developers

Company

Model Platform

Create and scale visual imagination.

Large Language Models

Power intelligent reasoning and conversation.

Serverless

Deploy, customize, and build with Dedicated Endpoint, Fine-Tuning, and DevPod.

Soluzioni Agent

GPU Cloud

On-Demand GPUs

Scale AI workloads with high-performance cloud GPUs.

Bare Metal

Run AI models faster on dedicated GPU servers.

Pricing

Developers

Company

minimaxai/minimax-m2.1

LLMHOTCODE

Home

Explore

LLM

All You Need to Know About This Model

Overview:

Model Provider:MINIMAX

Model Type:Large Language Model

Deployment:Inferencing API; Playground

Pricing:$0.3/M Input & $1.2/M Output

Key Specs:

Parameters:-

Context:196k tokens

Architecture Type:-

Knowledge Cutoff:-

Explore Similar Models

HOT

No description available.

LLM

MiniMax-M2

196.6K CONTEXT:

Input type:

Output type:

Context:196.61K

Input:$0.2/M tokens

Output:$1/M tokens

Max Output:131.07K

$0.2/1M in/out

NEW

HOT

Gemini 3 Flash is Googles state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks.

LLM

PREVIEW

Gemini 3 Flash Preview

200.0K CONTEXT:

Input type:

Output type:

Context:200.00K

Input:$0.4/M tokens

Output:$2.4/M tokens

Max Output:65.54K

$0.4/2.4M in/out

NEW

HOT

Gemini 3 Flash is Googles state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks.

LLM

DEV

Gemini 3 Flash Preview Developer

200.0K CONTEXT:

Input type:

Output type:

Context:200.00K

Input:$0.25/M tokens

Output:$1.5/M tokens

Max Output:65.54K

$0.25/1.5M in/out

NEW

HOT

GLM-4.7 is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

LLM