deepseek-ai/deepseek-ocr

LLM

DeepSeek OCR API by DeepSeek

deepseek-ai/deepseek-ocr

Deepseek-ocr

The latest Deepseek model.

Parámetros

Ejemplo de código
import os
from openai import OpenAI

client = OpenAI(
    api_key=os.getenv("ATLASCLOUD_API_KEY"),
    base_url="https://api.atlascloud.ai/v1"
)

response = client.chat.completions.create(
    model="deepseek-ai/deepseek-ocr",
    messages=[
    {
        "role": "user",
        "content": "hello"
    }
],
    max_tokens=1024,
    temperature=0.7
)

print(response.choices[0].message.content)

Instalar

Instala el paquete de dependencias necesario.

pip install requests

Autenticación

Todas las solicitudes de API requieren autenticación mediante una clave de API. Puedes obtener tu clave de API desde el panel de Atlas Cloud.

export ATLASCLOUD_API_KEY="your-api-key-here"

Encabezados HTTP

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

Mantén tu clave de API segura

Nunca expongas tu clave de API en código del lado del cliente ni en repositorios públicos. Usa variables de entorno o un proxy de backend en su lugar.

Enviar una solicitud

import requests

url = "https://api.atlascloud.ai/v1/chat/completions"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "messages": [{"role": "user", "content": "Hello"}],
    "max_tokens": 1024
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

Schema de entrada

Los siguientes parámetros se aceptan en el cuerpo de la solicitud.

Total: 5Obligatorio: 2Opcional: 3

modelstringrequired

The model ID to use for the completion.

Example: "deepseek-ai/deepseek-ocr"

messagesarray[object]required

A list of messages comprising the conversation so far.

rolestringrequired

The role of the message author. One of "system", "user", or "assistant".

systemuserassistant

contentstringrequired

The content of the message.

max_tokensinteger

The maximum number of tokens to generate in the completion.

Default: 1024Min: 1

temperaturenumber

Sampling temperature between 0 and 2. Higher values make output more random, lower values more focused and deterministic.

Default: 0.7Min: 0Max: 2

streamboolean

If set to true, partial message deltas will be sent as server-sent events.

Default: false

Ejemplo de cuerpo de solicitud

{
  "model": "deepseek-ai/deepseek-ocr",
  "messages": [
    {
      "role": "user",
      "content": "Hello"
    }
  ],
  "max_tokens": 1024,
  "temperature": 0.7,
  "stream": false
}

Schema de salida

La API devuelve una respuesta compatible con ChatCompletion.

idstringrequired

Unique identifier for the completion.

objectstringrequired

Object type, always "chat.completion".

Default: "chat.completion"

createdintegerrequired

Unix timestamp of when the completion was created.

modelstringrequired

The model used for the completion.

choicesarray[object]required

List of completion choices.

indexintegerrequired

Index of the choice.

messageobjectrequired

The generated message.

finish_reasonstringrequired

The reason generation stopped.

stoplengthcontent_filter

usageobjectrequired

Token usage statistics.

prompt_tokensintegerrequired

Number of tokens in the prompt.

completion_tokensintegerrequired

Number of tokens in the completion.

total_tokensintegerrequired

Total tokens used.

Ejemplo de respuesta

{
  "id": "chatcmpl-abc123",
  "object": "chat.completion",
  "created": 1700000000,
  "model": "model-name",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Hello! How can I assist you today?"
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 10,
    "completion_tokens": 20,
    "total_tokens": 30
  }
}

Atlas Cloud Skills

Atlas Cloud Skills integra más de 400 modelos de IA directamente en tu asistente de codificación con IA. Un solo comando para instalar y luego usa lenguaje natural para generar imágenes, videos y chatear con LLM.

Clientes compatibles

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ clientes compatibles

Instalar

npx skills add AtlasCloudAI/atlas-cloud-skills

Configurar clave de API

Obtén tu clave de API desde el panel de Atlas Cloud y configúrala como variable de entorno.

export ATLASCLOUD_API_KEY="your-api-key-here"

Funcionalidades

Una vez instalado, puedes usar lenguaje natural en tu asistente de IA para acceder a todos los modelos de Atlas Cloud.

Generación de imágenesGenera imágenes con modelos como Nano Banana 2, Z-Image y más.

Creación de videosCrea videos a partir de texto o imágenes con Kling, Vidu, Veo, etc.

Chat con LLMChatea con Qwen, DeepSeek y otros modelos de lenguaje de gran escala.

Carga de mediosSube archivos locales para flujos de trabajo de edición de imágenes e imagen a video.

Más información

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

Atlas Cloud MCP Server conecta tu IDE con más de 400 modelos de IA a través del Model Context Protocol. Funciona con cualquier cliente compatible con MCP.

Clientes compatibles

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ clientes compatibles

Instalar

npx -y atlascloud-mcp

Configuración

Agrega la siguiente configuración al archivo de configuración de MCP de tu IDE.

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

Herramientas disponibles

atlas_generate_imageGenera imágenes a partir de indicaciones de texto.

atlas_generate_videoCrea videos a partir de texto o imágenes.

atlas_chatChatea con modelos de lenguaje de gran escala.

atlas_list_modelsExplora más de 400 modelos de IA disponibles.

atlas_quick_generateCreación de contenido en un solo paso con selección automática de modelo.

atlas_upload_mediaSube archivos locales para flujos de trabajo de API.

Más información

github.com/AtlasCloudAI/mcp-server

DeepSeek-V3.1

Introduction

DeepSeek-V3.1 is a hybrid model that supports both thinking mode and non-thinking mode. Compared to the previous version, this upgrade brings improvements in multiple aspects:

Hybrid thinking mode: One model supports both thinking mode and non-thinking mode by changing the chat template.
Smarter tool calling: Through post-training optimization, the model's performance in tool usage and agent tasks has significantly improved.
Higher thinking efficiency: DeepSeek-V3.1-Think achieves comparable answer quality to DeepSeek-R1-0528, while responding more quickly.

DeepSeek-V3.1 is post-trained on the top of DeepSeek-V3.1-Base, which is built upon the original V3 base checkpoint through a two-phase long context extension approach, following the methodology outlined in the original DeepSeek-V3 report. We have expanded our dataset by collecting additional long documents and substantially extending both training phases. The 32K extension phase has been increased 10-fold to 630B tokens, while the 128K extension phase has been extended by 3.3x to 209B tokens. Additionally, DeepSeek-V3.1 is trained using the UE8M0 FP8 scale data format to ensure compatibility with microscaling data formats.

Model Downloads

Model	#Total Params	#Activated Params	Context Length	Download
DeepSeek-V3.1-Base	671B	37B	128K	HuggingFace \| ModelScope
DeepSeek-V3.1	671B	37B	128K	HuggingFace \| ModelScope

Chat Template

The details of our chat template is described in tokenizer_config.json and assets/chat_template.jinja. Here is a brief description.

Thinking

v3.1 support both thinking and no-thinking. we now both support turning thinking with qwen mode

"chat_template_kwargs": {"enable_thinking": true}

and zai mode

"thinking":{"type":"enabled"}

ToolCall

Toolcall is supported in non-thinking mode. The format is:

<｜begin▁of▁sentence｜>{system prompt}{tool_description}<｜User｜>{query}<｜Assistant｜></think> where the tool_description is

## Tools
You have access to the following tools:

### {tool_name1}
Description: {description}

Parameters: {json.dumps(parameters)}

IMPORTANT: ALWAYS adhere to this exact format for tool use:
<｜tool▁calls▁begin｜><｜tool▁call▁begin｜>tool_call_name<｜tool▁sep｜>tool_call_arguments<｜tool▁call▁end｜>{{additional_tool_calls}}<｜tool▁calls▁end｜>

Where:
- `tool_call_name` must be an exact match to one of the available tools
- `tool_call_arguments` must be valid JSON that strictly follows the tool's Parameters Schema
- For multiple tool calls, chain them directly without separators or spaces

Code-Agent

We support various code agent frameworks. Please refer to the above toolcall format to create your own code agents. An example is shown in assets/code_agent_trajectory.html.

Search-Agent

We design a specific format for searching toolcall in thinking mode, to support search agent.

For complex questions that require accessing external or up-to-date information, DeepSeek-V3.1 can leverage a user-provided search tool through a multi-turn tool-calling process.

Please refer to the assets/search_tool_trajectory.html and assets/search_python_tool_trajectory.html for the detailed template.

Evaluation

Category	Benchmark (Metric)	DeepSeek V3.1-NonThinking	DeepSeek V3 0324	DeepSeek V3.1-Thinking	DeepSeek R1 0528
General
	MMLU-Redux (EM)	91.8	90.5	93.7	93.4
	MMLU-Pro (EM)	83.7	81.2	84.8	85.0
	GPQA-Diamond (Pass@1)	74.9	68.4	80.1	81.0
	Humanity's Last Exam (Pass@1)	-	-	15.9	17.7
Search Agent
	BrowseComp	-	-	30.0	8.9
	BrowseComp_zh	-	-	49.2	35.7
	Humanity's Last Exam (Python + Search)	-	-	29.8	24.8
	SimpleQA	-	-	93.4	92.3
Code
	LiveCodeBench (2408-2505) (Pass@1)	56.4	43.0	74.8	73.3
	Codeforces-Div1 (Rating)	-	-	2091	1930
	Aider-Polyglot (Acc.)	68.4	55.1	76.3	71.6
Code Agent
	SWE Verified (Agent mode)	66.0	45.4	-	44.6
	SWE-bench Multilingual (Agent mode)	54.5	29.3	-	30.5
	Terminal-bench (Terminus 1 framework)	31.3	13.3	-	5.7
Math
	AIME 2024 (Pass@1)	66.3	59.4	93.1	91.4
	AIME 2025 (Pass@1)	49.8	51.3	88.4	87.5
	HMMT 2025 (Pass@1)	33.5	29.2	84.2	79.4

Note:

Search agents are evaluated with our internal search framework, which uses a commercial search API + webpage filter + 128K context window. Seach agent results of R1-0528 are evaluated with a pre-defined workflow.
SWE-bench is evaluated with our internal code agent framework.
HLE is evaluated with the text-only subset.

Usage Example

import transformers

tokenizer = transformers.AutoTokenizer.from_pretrained("deepseek-ai/DeepSeek-V3.1")

messages = [
    {"role": "system", "content": "You are a helpful assistant"},
    {"role": "user", "content": "Who are you?"},
    {"role": "assistant", "content": "<think>Hmm</think>I am DeepSeek"},
    {"role": "user", "content": "1+1=?"}
]

tokenizer.apply_chat_template(messages, tokenize=False, thinking=True, add_generation_prompt=True)
# '<｜begin▁of▁sentence｜>You are a helpful assistant<｜User｜>Who are you?<｜Assistant｜></think>I am DeepSeek<｜end▁of▁sentence｜><｜User｜>1+1=?<｜Assistant｜><think>'

tokenizer.apply_chat_template(messages, tokenize=False, thinking=False, add_generation_prompt=True)
# '<｜begin▁of▁sentence｜>You are a helpful assistant<｜User｜>Who are you?<｜Assistant｜></think>I am DeepSeek<｜end▁of▁sentence｜><｜User｜>1+1=?<｜Assistant｜></think>'

How to Run Locally

The model structure of DeepSeek-V3.1 is the same as DeepSeek-V3. Please visit DeepSeek-V3 repo for more information about running this model locally.

License

This repository and the model weights are licensed under the MIT License.

Citation

@misc{deepseekai2024deepseekv3technicalreport,
      title={DeepSeek-V3 Technical Report}, 
      author={DeepSeek-AI},
      year={2024},
      eprint={2412.19437},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2412.19437}, 
}

Contact

If you have any questions, please raise an issue or contact us at [email protected].

DeepSeek OCR API by DeepSeek

Parámetros

Ejemplo de código

Instalar

Autenticación

Encabezados HTTP

Enviar una solicitud

Schema de entrada

Ejemplo de cuerpo de solicitud

Schema de salida

Ejemplo de respuesta

Atlas Cloud Skills

Clientes compatibles

Instalar

Configurar clave de API

Funcionalidades

MCP Server

Clientes compatibles

Instalar

Configuración

Herramientas disponibles

DeepSeek-V3.1

Introduction

Model Downloads

Chat Template

Thinking

ToolCall

Code-Agent

Search-Agent

Evaluation

Usage Example

How to Run Locally

License

Citation

Contact

Explorar Modelos Similares

DeepSeek V4 Pro

DeepSeek V4 Flash

DeepSeek V3.2

DeepSeek V3.2 Exp

DeepSeek-V3.1

DeepSeek V3.1 Terminus

Grok 4.5

KAT Coder Air V2.5

KAT Coder Pro V2.5

Doubao Seed 2.1 Turbo

Doubao Seed 2.1 Pro

GLM 5.2

Kimi K2.7 Code

MiniMax M3

Grok Build 0.1

Grok 4.3

Una sola API para toda la IA multimedia.

Join our Discord community

Parámetros

Ejemplo de código

Instalar

Autenticación

Encabezados HTTP

Enviar una solicitud

Schema de entrada

Ejemplo de cuerpo de solicitud

Schema de salida

Ejemplo de respuesta

Atlas Cloud Skills

Clientes compatibles

Instalar

Configurar clave de API

Funcionalidades

MCP Server

Clientes compatibles

Instalar

Configuración

Herramientas disponibles