Modeller

ExplorePriser Utvecklare Enterprise

Modeller

Bild- & Videomodeller

Skapa och skala visuell fantasi.

Driva intelligent resonemang och konversation.

Explore

Utvalda kollektioner

Bild- & Videomodeller

Inställningar

Fakturering & påfyllning

Användningshistorik

Priser Utvecklare Enterprise

DeepSeek LLM Models

deepseek-ai/deepseek-ocr

DeepSeek OCR

LLM

DeepSeek OCR API by DeepSeek

deepseek-ai/deepseek-ocr

Deepseek-ocr

The latest Deepseek model.

DeepSeek-V3.1

Introduction

DeepSeek-V3.1 is a hybrid model that supports both thinking mode and non-thinking mode. Compared to the previous version, this upgrade brings improvements in multiple aspects:

Hybrid thinking mode: One model supports both thinking mode and non-thinking mode by changing the chat template.
Smarter tool calling: Through post-training optimization, the model's performance in tool usage and agent tasks has significantly improved.
Higher thinking efficiency: DeepSeek-V3.1-Think achieves comparable answer quality to DeepSeek-R1-0528, while responding more quickly.

DeepSeek-V3.1 is post-trained on the top of DeepSeek-V3.1-Base, which is built upon the original V3 base checkpoint through a two-phase long context extension approach, following the methodology outlined in the original DeepSeek-V3 report. We have expanded our dataset by collecting additional long documents and substantially extending both training phases. The 32K extension phase has been increased 10-fold to 630B tokens, while the 128K extension phase has been extended by 3.3x to 209B tokens. Additionally, DeepSeek-V3.1 is trained using the UE8M0 FP8 scale data format to ensure compatibility with microscaling data formats.

Model Downloads

Model	#Total Params	#Activated Params	Context Length	Download
DeepSeek-V3.1-Base	671B	37B	128K	HuggingFace \| ModelScope
DeepSeek-V3.1	671B	37B	128K	HuggingFace \| ModelScope

Chat Template

The details of our chat template is described in tokenizer_config.json and assets/chat_template.jinja. Here is a brief description.

Thinking

v3.1 support both thinking and no-thinking. we now both support turning thinking with qwen mode

"chat_template_kwargs": {"enable_thinking": true}

and zai mode

"thinking":{"type":"enabled"}

ToolCall

Toolcall is supported in non-thinking mode. The format is:

<｜begin▁of▁sentence｜>{system prompt}{tool_description}<｜User｜>{query}<｜Assistant｜></think> where the tool_description is

## Tools
You have access to the following tools:

### {tool_name1}
Description: {description}

Parameters: {json.dumps(parameters)}

IMPORTANT: ALWAYS adhere to this exact format for tool use:
<｜tool▁calls▁begin｜><｜tool▁call▁begin｜>tool_call_name<｜tool▁sep｜>tool_call_arguments<｜tool▁call▁end｜>{{additional_tool_calls}}<｜tool▁calls▁end｜>

Where:
- `tool_call_name` must be an exact match to one of the available tools
- `tool_call_arguments` must be valid JSON that strictly follows the tool's Parameters Schema
- For multiple tool calls, chain them directly without separators or spaces

Code-Agent

We support various code agent frameworks. Please refer to the above toolcall format to create your own code agents. An example is shown in assets/code_agent_trajectory.html.

Search-Agent

We design a specific format for searching toolcall in thinking mode, to support search agent.

For complex questions that require accessing external or up-to-date information, DeepSeek-V3.1 can leverage a user-provided search tool through a multi-turn tool-calling process.

Please refer to the assets/search_tool_trajectory.html and assets/search_python_tool_trajectory.html for the detailed template.

Evaluation

Category	Benchmark (Metric)	DeepSeek V3.1-NonThinking	DeepSeek V3 0324	DeepSeek V3.1-Thinking	DeepSeek R1 0528
General
	MMLU-Redux (EM)	91.8	90.5	93.7	93.4
	MMLU-Pro (EM)	83.7	81.2	84.8	85.0
	GPQA-Diamond (Pass@1)	74.9	68.4	80.1	81.0
	Humanity's Last Exam (Pass@1)	-	-	15.9	17.7
Search Agent
	BrowseComp	-	-	30.0	8.9
	BrowseComp_zh	-	-	49.2	35.7
	Humanity's Last Exam (Python + Search)	-	-	29.8	24.8
	SimpleQA	-	-	93.4	92.3
Code
	LiveCodeBench (2408-2505) (Pass@1)	56.4	43.0	74.8	73.3
	Codeforces-Div1 (Rating)	-	-	2091	1930
	Aider-Polyglot (Acc.)	68.4	55.1	76.3	71.6
Code Agent
	SWE Verified (Agent mode)	66.0	45.4	-	44.6
	SWE-bench Multilingual (Agent mode)	54.5	29.3	-	30.5
	Terminal-bench (Terminus 1 framework)	31.3	13.3	-	5.7
Math
	AIME 2024 (Pass@1)	66.3	59.4	93.1	91.4
	AIME 2025 (Pass@1)	49.8	51.3	88.4	87.5
	HMMT 2025 (Pass@1)	33.5	29.2	84.2	79.4

Note:

Search agents are evaluated with our internal search framework, which uses a commercial search API + webpage filter + 128K context window. Seach agent results of R1-0528 are evaluated with a pre-defined workflow.
SWE-bench is evaluated with our internal code agent framework.
HLE is evaluated with the text-only subset.

Usage Example

import transformers

tokenizer = transformers.AutoTokenizer.from_pretrained("deepseek-ai/DeepSeek-V3.1")

messages = [
    {"role": "system", "content": "You are a helpful assistant"},
    {"role": "user", "content": "Who are you?"},
    {"role": "assistant", "content": "<think>Hmm</think>I am DeepSeek"},
    {"role": "user", "content": "1+1=?"}
]

tokenizer.apply_chat_template(messages, tokenize=False, thinking=True, add_generation_prompt=True)
# '<｜begin▁of▁sentence｜>You are a helpful assistant<｜User｜>Who are you?<｜Assistant｜></think>I am DeepSeek<｜end▁of▁sentence｜><｜User｜>1+1=?<｜Assistant｜><think>'

tokenizer.apply_chat_template(messages, tokenize=False, thinking=False, add_generation_prompt=True)
# '<｜begin▁of▁sentence｜>You are a helpful assistant<｜User｜>Who are you?<｜Assistant｜></think>I am DeepSeek<｜end▁of▁sentence｜><｜User｜>1+1=?<｜Assistant｜></think>'

How to Run Locally

The model structure of DeepSeek-V3.1 is the same as DeepSeek-V3. Please visit DeepSeek-V3 repo for more information about running this model locally.

License

This repository and the model weights are licensed under the MIT License.

Citation

@misc{deepseekai2024deepseekv3technicalreport,
      title={DeepSeek-V3 Technical Report}, 
      author={DeepSeek-AI},
      year={2024},
      eprint={2412.19437},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2412.19437}, 
}

Contact

If you have any questions, please raise an issue or contact us at [email protected].

Utforska Liknande Modeller

DeepSeek V4 Pro

DeepSeek V4 Pro is a state-of-the-art large language model combining efficient sparse attention, strong reasoning, and integrated agent capabilities for robust long-context understanding and versatile AI applications.

DeepSeek V4 Pro

1048.6K KONTEXT:

Kontext:1048.58K

Inmatning:$1.68/M tokens

Utmatning:$3.38/M tokens

Max Utmatning:393.22K

$1.68/3.38M in/ut

DeepSeek V4 Flash

DeepSeek V4 Flash is a state-of-the-art large language model combining efficient sparse attention, strong reasoning, and integrated agent capabilities for robust long-context understanding and versatile AI applications.

DeepSeek V4 Flash

1048.6K KONTEXT:

Kontext:1048.58K

Inmatning:$0.14/M tokens

Utmatning:$0.28/M tokens

Max Utmatning:393.22K

$0.14/0.28M in/ut

DeepSeek V3.2 Speciale

Fastest, most cost-effective model from DeepSeek Ai.

DeepSeek V3.2 Speciale

163.8K KONTEXT:

Kontext:163.84K

Inmatning:$0.287/M tokens

Utmatning:$0.431/M tokens

Max Utmatning:163.84K

$0.287/0.431M in/ut

DeepSeek V3.2

DeepSeek V3.2 is a state-of-the-art large language model combining efficient sparse attention, strong reasoning, and integrated agent capabilities for robust long-context understanding and versatile AI applications.

DeepSeek V3.2

163.8K KONTEXT:

Kontext:163.84K

Inmatning:$0.26/M tokens

Utmatning:$0.38/M tokens

Max Utmatning:163.84K

$0.26/0.38M in/ut

Börja från 300+ Modeller,

Utforska alla modeller

Join our Discord community

Join the Discord community for the latest model updates, prompts, and support.

Parametrar

Kodexempel
import os
from openai import OpenAI

client = OpenAI(
    api_key=os.getenv("ATLASCLOUD_API_KEY"),
    base_url="https://api.atlascloud.ai/v1"
)

response = client.chat.completions.create(
    model="deepseek-ai/deepseek-ocr",
    messages=[
    {
        "role": "user",
        "content": "hello"
    }
],
    max_tokens=1024,
    temperature=0.7
)

print(response.choices[0].message.content)

Installera

Installera det nödvändiga paketet för ditt programmeringsspråk.

pip install requests

Autentisering

Alla API-förfrågningar kräver autentisering via en API key. Du kan hämta din API key från Atlas Cloud-instrumentpanelen.

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP Headers

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

Håll din API key säker

Exponera aldrig din API key i klientkod eller publika arkiv. Använd miljövariabler eller en backend-proxy istället.

Skicka en förfrågan

import requests

url = "https://api.atlascloud.ai/v1/chat/completions"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "messages": [{"role": "user", "content": "Hello"}],
    "max_tokens": 1024
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

Input Schema

Följande parametrar accepteras i förfrågningsinnehållet.

Totalt: 9Obligatorisk: 2Valfri: 7

modelstringrequired

The model ID to use for the completion.

Example: "deepseek-ai/deepseek-ocr"

messagesarray[object]required

A list of messages comprising the conversation so far.

rolestringrequired

The role of the message author. One of "system", "user", or "assistant".

systemuserassistant

contentstringrequired

The content of the message.

max_tokensinteger

The maximum number of tokens to generate in the completion.

Default: 1024Min: 1

temperaturenumber

Sampling temperature between 0 and 2. Higher values make output more random, lower values more focused and deterministic.

Default: 0.7Min: 0Max: 2

top_pnumber

Nucleus sampling parameter. The model considers the tokens with top_p probability mass.

Default: 1Min: 0Max: 1

streamboolean

If set to true, partial message deltas will be sent as server-sent events.

Default: false

stoparray[string]

Up to 4 sequences where the API will stop generating further tokens.

frequency_penaltynumber

Penalizes new tokens based on their existing frequency in the text so far. Between -2.0 and 2.0.

Default: 0Min: -2Max: 2

presence_penaltynumber

Penalizes new tokens based on whether they appear in the text so far. Between -2.0 and 2.0.

Default: 0Min: -2Max: 2

Exempel på förfrågningsinnehåll

{
  "model": "deepseek-ai/deepseek-ocr",
  "messages": [
    {
      "role": "user",
      "content": "Hello"
    }
  ],
  "max_tokens": 1024,
  "temperature": 0.7,
  "stream": false
}

Output Schema

API:et returnerar ett ChatCompletion-kompatibelt svar.

idstringrequired

Unique identifier for the completion.

objectstringrequired

Object type, always "chat.completion".

Default: "chat.completion"

createdintegerrequired

Unix timestamp of when the completion was created.

modelstringrequired

The model used for the completion.

choicesarray[object]required

List of completion choices.

indexintegerrequired

Index of the choice.

messageobjectrequired

The generated message.

finish_reasonstringrequired

The reason generation stopped.

stoplengthcontent_filter

usageobjectrequired

Token usage statistics.

prompt_tokensintegerrequired

Number of tokens in the prompt.

completion_tokensintegerrequired

Number of tokens in the completion.

total_tokensintegerrequired

Total tokens used.

Exempelsvar

{
  "id": "chatcmpl-abc123",
  "object": "chat.completion",
  "created": 1700000000,
  "model": "model-name",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Hello! How can I assist you today?"
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 10,
    "completion_tokens": 20,
    "total_tokens": 30
  }
}

Atlas Cloud Skills

Atlas Cloud Skills integrerar 300+ AI-modeller direkt i din AI-kodassistent. Ett kommando för att installera, sedan använd naturligt språk för att generera bilder, videor och chatta med LLM.

Stödda klienter

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ stödda klienter

Installera

npx skills add AtlasCloudAI/atlas-cloud-skills

Konfigurera API Key

Hämta din API key från Atlas Cloud-instrumentpanelen och ställ in den som en miljövariabel.

export ATLASCLOUD_API_KEY="your-api-key-here"

Funktioner

När det är installerat kan du använda naturligt språk i din AI-assistent för att komma åt alla Atlas Cloud-modeller.

BildgenereringGenerera bilder med modeller som Nano Banana 2, Z-Image och fler.

VideoskapandeSkapa videor från text eller bilder med Kling, Vidu, Veo m.fl.

LLM-chattChatta med Qwen, DeepSeek och andra stora språkmodeller.

MediauppladdningLadda upp lokala filer för bildredigering och bild-till-video-arbetsflöden.

Läs mer

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

Atlas Cloud MCP Server ansluter din IDE med 300+ AI-modeller via Model Context Protocol. Fungerar med alla MCP-kompatibla klienter.

Stödda klienter

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ stödda klienter

Installera

npx -y atlascloud-mcp

Konfiguration

Lägg till följande konfiguration i din IDE:s MCP-inställningsfil.

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

Tillgängliga verktyg

atlas_generate_imageGenerera bilder från textpromptar.

atlas_generate_videoSkapa videor från text eller bilder.

atlas_chatChatta med stora språkmodeller.

atlas_list_modelsBläddra bland 300+ tillgängliga AI-modeller.

atlas_quick_generateInnehållsskapande i ett steg med automatiskt modellval.

atlas_upload_mediaLadda upp lokala filer för API-arbetsflöden.

Läs mer

github.com/AtlasCloudAI/mcp-server