kwaivgi/kling-video-o3-std/text-to-video

Texto a Video

Kling Video O3 Std Text-to-Video API by Kuaishou

kwaivgi/kling-video-o3-std/text-to-video

Text-to-video

Kling Omni Video O3 (Standard) is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Generates high-quality videos from text prompts with natural motion and audio generation support.

Entrada

Prompt

Escribe @ para referenciar sujetos.

Multi-toma

Sujetos

MÁX: 6

Relación de Aspecto

Duración

Sonido

Salida

Inactivo

Los videos generados se mostrarán aquí

Configura los parámetros y haz clic en ejecutar para comenzar a generar

Cada ejecución costará $0.071. Con $10 puedes ejecutar aproximadamente 140 veces.

Puedes continuar con:

Seedance 2.0 Kling v3 Vidu Wan2.7

Parámetros

Ejemplo de código
import requests
import time

# Step 1: Start video generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "kwaivgi/kling-video-o3-std/text-to-video",  # Required. model name. default: "kwaivgi/kling-video-o3-std/text-to-video-test"
    "aspect_ratio": "16:9",  # The aspect ratio of the generated video. options: 16:9 | 9:16 | 1:1
    "duration": 5,  # The duration of the generated media in seconds (3-15)
    "prompt": "A beautiful sunset over the ocean with gentle waves",  # The positive prompt for the generation
    "sound": True,  # Whether to generate audio for the video
    "multi_shot": False,  # Whether to enable multi-shot generation
    "shot_type": "example_value",  # Multi-shot mode. options: customize | intelligence
    "multi_prompt": [
        {
            "prompt": "example_prompt",
            "duration": 1
        }
    ],  # Per-shot storyboards
    "elements": [
        {
            "reference_type": "image_refer",
            "frontal_image": "example_frontal_image",
            "refer_images": [
                "https://example.com/image1.jpg"
            ],
            "refer_videos": [
                "https://example.com/image1.jpg"
            ],
            "element_name": "example_element_name",
            "element_description": "example_element_description",
            "element_id": 0
        }
    ],  # Subject references (Atlas naming; maps to Kling 'element_list')
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] in ["completed", "succeeded"]:
            print("Generated video:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

video_url = check_status()

Instalar

Instala el paquete de dependencias necesario.

pip install requests

Autenticación

Todas las solicitudes de API requieren autenticación mediante una clave de API. Puedes obtener tu clave de API desde el panel de Atlas Cloud.

export ATLASCLOUD_API_KEY="your-api-key-here"

Encabezados HTTP

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

Mantén tu clave de API segura

Nunca expongas tu clave de API en código del lado del cliente ni en repositorios públicos. Usa variables de entorno o un proxy de backend en su lugar.

Enviar una solicitud

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

Enviar una solicitud

Envía una solicitud de generación asíncrona. La API devuelve un ID de predicción que puedes usar para verificar el estado y obtener el resultado.

POST/api/v1/model/generateVideo

Cuerpo de la solicitud

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "kwaivgi/kling-video-o3-std/text-to-video",
    "prompt": "A beautiful sunset over the ocean with gentle waves"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

Respuesta

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

Verificar estado

Consulta el endpoint de predicción para verificar el estado actual de tu solicitud.

GET/api/v1/model/prediction/{prediction_id}

Ejemplo de polling

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

Valores de estado

processingLa solicitud aún se está procesando.

completedLa generación está completa. Las salidas están disponibles.

succeededLa generación fue exitosa. Las salidas están disponibles.

failedLa generación falló. Verifica el campo de error.

Respuesta completada

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.mp4"
    ],
    "metrics": {
      "predict_time": 45.2
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

Subir archivos

Sube archivos al almacenamiento de Atlas Cloud y obtén una URL que puedes usar en tus solicitudes de API. Usa multipart/form-data para subir.

POST/api/v1/model/uploadMedia

Ejemplo de carga

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

Respuesta

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

Schema de entrada

Los siguientes parámetros se aceptan en el cuerpo de la solicitud.

Total: 9Obligatorio: 1Opcional: 8

modelstringrequired

model name

Default: "kwaivgi/kling-video-o3-std/text-to-video-test"

aspect_ratiostring

The aspect ratio of the generated video.

Default: "16:9"

16:99:161:1

durationinteger

The duration of the generated media in seconds (3-15).

Default: 5

3456789101112131415

promptstring

The positive prompt for the generation.

soundboolean

Whether to generate audio for the video.

Default: true

multi_shotboolean

Whether to enable multi-shot generation.

Default: false

shot_typestring

Multi-shot mode. customize = caller provides per-shot prompts; intelligence = model auto-splits the top-level prompt into shots. Required when multi_shot=true.

customizeintelligence

multi_promptarray[object]

Per-shot storyboards. Required when multi_shot=true and shot_type=customize. Sum of each shot's duration must equal the top-level duration; each shot duration must be >= 1.

Min items: 1Max items: 6

promptstringrequired

Prompt for this shot. Supports subject mentions like <<<element_1>>>.

durationintegerrequired

Duration of this shot in seconds (string, '1'~'15'). Sum of all shots must equal the top-level duration. Each shot duration >= 1.

Min: 1

elementsarray[object]

Subject references (Atlas naming; maps to Kling 'element_list'). Each item either references an existing subject by element_id, or creates a new one inline with element_name + reference_type + frontal_image / refer_images / refer_videos (Atlas wrapper feature — backend creates the element via Kling's element API, then injects the resulting element_id). Mention subjects in prompt with <<<element_N>>> (1-based, matches array position). For kling-v3-omni: up to 3 elements.

Max items: 6

reference_typestring

Reference media type for the new subject (Atlas wrapper).

Default: "image_refer"

image_refervideo_refer

frontal_imagestring

Frontal image URL of the new subject (required when reference_type=image_refer).

refer_imagesarray[string]

Reference image URLs of the new subject (used with reference_type=image_refer).

Max items: 4

refer_videosarray[string]

Reference video URLs of the new subject (used with reference_type=video_refer).

Max items: 4

element_namestring

Name of the new subject to create inline (Atlas wrapper). Mutually exclusive with element_id.

element_descriptionstring

Optional description of the new subject (Atlas wrapper).

element_idinteger

ID of an existing subject from the Kling element library (long / 64-bit integer). Mutually exclusive with element_name.

Ejemplo de cuerpo de solicitud

{
  "model": "kwaivgi/kling-video-o3-std/text-to-video",
  "aspect_ratio": "16:9",
  "duration": 5,
  "sound": true,
  "multi_shot": false
}

Schema de salida

La API devuelve una respuesta de predicción con las URL de salida generadas.

created_atstring

ISO timestamp of when the request was created.

idstring

Unique identifier for the prediction, the ID of the prediction to get.

modelstring

Model ID used for the prediction.

outputsarray

Array of URLs to the generated content (empty when status is not completed).

statusstring

Status of the task: created, processing, completed, or failed.

Ejemplo de respuesta

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.mp4"
  ],
  "metrics": {
    "predict_time": 45.2
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills integra más de 400 modelos de IA directamente en tu asistente de codificación con IA. Un solo comando para instalar y luego usa lenguaje natural para generar imágenes, videos y chatear con LLM.

Clientes compatibles

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ clientes compatibles

Instalar

npx skills add AtlasCloudAI/atlas-cloud-skills

Configurar clave de API

Obtén tu clave de API desde el panel de Atlas Cloud y configúrala como variable de entorno.

export ATLASCLOUD_API_KEY="your-api-key-here"

Funcionalidades

Una vez instalado, puedes usar lenguaje natural en tu asistente de IA para acceder a todos los modelos de Atlas Cloud.

Generación de imágenesGenera imágenes con modelos como Nano Banana 2, Z-Image y más.

Creación de videosCrea videos a partir de texto o imágenes con Kling, Vidu, Veo, etc.

Chat con LLMChatea con Qwen, DeepSeek y otros modelos de lenguaje de gran escala.

Carga de mediosSube archivos locales para flujos de trabajo de edición de imágenes e imagen a video.

Más información

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

Atlas Cloud MCP Server conecta tu IDE con más de 400 modelos de IA a través del Model Context Protocol. Funciona con cualquier cliente compatible con MCP.

Clientes compatibles

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ clientes compatibles

Instalar

npx -y atlascloud-mcp

Configuración

Agrega la siguiente configuración al archivo de configuración de MCP de tu IDE.

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

Herramientas disponibles

atlas_generate_imageGenera imágenes a partir de indicaciones de texto.

atlas_generate_videoCrea videos a partir de texto o imágenes.

atlas_chatChatea con modelos de lenguaje de gran escala.

atlas_list_modelsExplora más de 400 modelos de IA disponibles.

atlas_quick_generateCreación de contenido en un solo paso con selección automática de modelo.

atlas_upload_mediaSube archivos locales para flujos de trabajo de API.

Más información

github.com/AtlasCloudAI/mcp-server

API Schema

{
  "components": {
    "schemas": {
      "Input": {
        "properties": {
          "model": {
            "type": "string",
            "description": "model name",
            "default": "kwaivgi/kling-video-o3-std/text-to-video-test"
          },
          "aspect_ratio": {
            "default": "16:9",
            "description": "The aspect ratio of the generated video.",
            "enum": [
              "16:9",
              "9:16",
              "1:1"
            ],
            "type": "string"
          },
          "duration": {
            "default": 5,
            "description": "The duration of the generated media in seconds (3-15).",
            "enum": [
              3,
              4,
              5,
              6,
              7,
              8,
              9,
              10,
              11,
              12,
              13,
              14,
              15
            ],
            "type": "integer",
            "x-ui-component": "select"
          },
          "prompt": {
            "description": "The positive prompt for the generation.",
            "type": "string"
          },
          "sound": {
            "default": true,
            "description": "Whether to generate audio for the video.",
            "type": "boolean"
          },
          "multi_shot": {
            "type": "boolean",
            "description": "Whether to enable multi-shot generation.",
            "default": false
          },
          "shot_type": {
            "type": "string",
            "description": "Multi-shot mode. customize = caller provides per-shot prompts; intelligence = model auto-splits the top-level prompt into shots. Required when multi_shot=true.",
            "enum": [
              "customize",
              "intelligence"
            ],
            "x-ui-component": "select"
          },
          "multi_prompt": {
            "type": "array",
            "description": "Per-shot storyboards. Required when multi_shot=true and shot_type=customize. Sum of each shot's duration must equal the top-level duration; each shot duration must be >= 1.",
            "minItems": 1,
            "maxItems": 6,
            "items": {
              "type": "object",
              "required": [
                "prompt",
                "duration"
              ],
              "properties": {
                "prompt": {
                  "type": "string",
                  "description": "Prompt for this shot. Supports subject mentions like <<<element_1>>>.",
                  "maxLength": 512
                },
                "duration": {
                  "type": "integer",
                  "description": "Duration of this shot in seconds (string, '1'~'15'). Sum of all shots must equal the top-level duration. Each shot duration >= 1.",
                  "minimum": 1
                }
              }
            }
          },
          "elements": {
            "type": "array",
            "description": "Subject references (Atlas naming; maps to Kling 'element_list'). Each item either references an existing subject by element_id, or creates a new one inline with element_name + reference_type + frontal_image / refer_images / refer_videos (Atlas wrapper feature — backend creates the element via Kling's element API, then injects the resulting element_id). Mention subjects in prompt with <<<element_N>>> (1-based, matches array position). For kling-v3-omni: up to 3 elements.",
            "maxItems": 6,
            "items": {
              "type": "object",
              "properties": {
                "reference_type": {
                  "type": "string",
                  "description": "Reference media type for the new subject (Atlas wrapper).",
                  "enum": [
                    "image_refer",
                    "video_refer"
                  ],
                  "default": "image_refer"
                },
                "frontal_image": {
                  "type": "string",
                  "description": "Frontal image URL of the new subject (required when reference_type=image_refer)."
                },
                "refer_images": {
                  "type": "array",
                  "description": "Reference image URLs of the new subject (used with reference_type=image_refer).",
                  "items": {
                    "type": "string"
                  },
                  "maxItems": 4
                },
                "refer_videos": {
                  "type": "array",
                  "description": "Reference video URLs of the new subject (used with reference_type=video_refer).",
                  "items": {
                    "type": "string"
                  },
                  "maxItems": 4
                },
                "element_name": {
                  "type": "string",
                  "description": "Name of the new subject to create inline (Atlas wrapper). Mutually exclusive with element_id."
                },
                "element_description": {
                  "type": "string",
                  "description": "Optional description of the new subject (Atlas wrapper).",
                  "x-hidden": true
                },
                "element_id": {
                  "type": "integer",
                  "description": "ID of an existing subject from the Kling element library (long / 64-bit integer). Mutually exclusive with element_name.",
                  "x-hidden": true
                }
              },
              "allOf": [
                {
                  "if": {
                    "properties": {
                      "reference_type": {
                        "const": "video_refer"
                      }
                    },
                    "required": [
                      "reference_type"
                    ]
                  },
                  "then": {
                    "required": [
                      "refer_videos"
                    ],
                    "properties": {
                      "frontal_image": {
                        "x-hidden": true
                      },
                      "refer_images": {
                        "x-hidden": true
                      },
                      "refer_videos": {
                        "minItems": 1
                      }
                    }
                  },
                  "else": {
                    "required": [
                      "frontal_image",
                      "refer_images"
                    ],
                    "properties": {
                      "refer_videos": {
                        "x-hidden": true
                      },
                      "refer_images": {
                        "minItems": 1
                      }
                    }
                  }
                }
              ]
            }
          }
        },
        "allOf": [
          {
            "if": {
              "properties": {
                "multi_shot": {
                  "const": true
                }
              },
              "required": [
                "multi_shot"
              ]
            },
            "then": {
              "required": [
                "shot_type"
              ]
            },
            "else": {
              "required": [
                "prompt"
              ],
              "properties": {
                "shot_type": {
                  "x-hidden": true
                },
                "multi_prompt": {
                  "x-hidden": true
                }
              }
            }
          },
          {
            "if": {
              "properties": {
                "shot_type": {
                  "const": "customize"
                }
              },
              "required": [
                "shot_type"
              ]
            },
            "then": {
              "required": [
                "multi_prompt"
              ],
              "properties": {
                "prompt": {
                  "x-hidden": true,
                  "x-hidden-hint": "Customize mode uses the per-shot prompts below. Your prompt is saved and reappears when you leave Customize mode."
                }
              }
            }
          },
          {
            "if": {
              "properties": {
                "shot_type": {
                  "const": "intelligence"
                }
              },
              "required": [
                "shot_type"
              ]
            },
            "then": {
              "required": [
                "prompt"
              ],
              "properties": {
                "multi_prompt": {
                  "x-hidden": true
                }
              }
            }
          }
        ],
        "required": [
          "model"
        ],
        "type": "object",
        "x-order-properties": [
          "model",
          "prompt",
          "multi_shot",
          "shot_type",
          "multi_prompt",
          "elements",
          "aspect_ratio",
          "duration",
          "sound"
        ]
      },
      "PredictionResponse": {
        "properties": {
          "created_at": {
            "description": "ISO timestamp of when the request was created.",
            "format": "date-time",
            "type": "string"
          },
          "has_nsfw_contents": {
            "description": "Array of boolean values indicating NSFW detection for each output.",
            "items": {
              "type": "boolean"
            },
            "type": "array"
          },
          "id": {
            "description": "Unique identifier for the prediction, the ID of the prediction to get.",
            "type": "string"
          },
          "model": {
            "description": "Model ID used for the prediction.",
            "type": "string"
          },
          "outputs": {
            "description": "Array of URLs to the generated content (empty when status is not completed).",
            "items": {
              "type": "string"
            },
            "type": "array"
          },
          "status": {
            "description": "Status of the task: created, processing, completed, or failed.",
            "type": "string"
          },
          "urls": {
            "description": "Object containing related API endpoints.",
            "type": "object"
          }
        },
        "type": "object"
      }
    },
    "securitySchemes": {
      "apiKeyAuth": {
        "in": "header",
        "name": "Authorization",
        "type": "apiKey"
      }
    }
  },
  "info": {
    "description": "The AtlasCloud API.",
    "title": "AtlasCloud API",
    "version": "1.0.0"
  },
  "openapi": "3.0.0",
  "paths": {
    "/api/v1/model/generateVideo": {
      "post": {
        "requestBody": {
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/Input"
              }
            }
          },
          "required": true
        },
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "The request status."
          }
        }
      },
      "x-api-name": "model_run"
    },
    "/api/v1/model/result/{request_id}": {
      "get": {
        "parameters": [
          {
            "in": "path",
            "name": "request_id",
            "required": true,
            "schema": {
              "description": "Request ID",
              "type": "string"
            }
          }
        ],
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "Result of the request."
          }
        }
      },
      "x-api-name": "model_result"
    }
  },
  "servers": [
    {
      "url": "https://api.atlascloud.ai"
    }
  ],
  "model": "kwaivgi/kling-video-o3-std/text-to-video"
}

Plantilla de Prompt Compatible con LLM

# kwaivgi/kling-video-o3-std/text-to-video

> Kling Omni Video O3 (Standard) is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Generates high-quality videos from text prompts with natural motion and audio generation support.


## Overview

- **Submit endpoint (POST)**: `https://api.atlascloud.ai/api/v1/model/generateVideo` — start an async generation; returns a `prediction_id`
- **Poll endpoint (GET)**: `https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}` — poll this until the prediction finishes
- **Model ID**: `kwaivgi/kling-video-o3-std/text-to-video`


## API Information

This model can be used via our HTTP API or more conveniently via our client libraries.
See the input and output schema below, as well as the usage examples.


### Input Schema

The API accepts the following input parameters:

- **`model`** (`string`, _required_):
  model name
  - Default: `"kwaivgi/kling-video-o3-std/text-to-video-test"`

- **`prompt`** (`string`, _optional_):
  The positive prompt for the generation.

- **`multi_shot`** (`boolean`, _optional_):
  Whether to enable multi-shot generation.
  - Default: `false`

- **`shot_type`** (`string`, _optional_):
  Multi-shot mode. customize = caller provides per-shot prompts; intelligence = model auto-splits the top-level prompt into shots. Required when multi_shot=true.
  - Options: "customize", "intelligence"

- **`multi_prompt`** (`array[object]`, _optional_):
  Per-shot storyboards. Required when multi_shot=true and shot_type=customize. Sum of each shot's duration must equal the top-level duration; each shot duration must be >= 1.
  - Min items: 1
  - Max items: 6
  - Item properties:
    - **`prompt`** (`string`, _required_):
      Prompt for this shot. Supports subject mentions like <<<element_1>>>.

    - **`duration`** (`integer`, _required_):
      Duration of this shot in seconds (string, '1'~'15'). Sum of all shots must equal the top-level duration. Each shot duration >= 1.
      - Min: 1


- **`elements`** (`array[object]`, _optional_):
  Subject references (Atlas naming; maps to Kling 'element_list'). Each item either references an existing subject by element_id, or creates a new one inline with element_name + reference_type + frontal_image / refer_images / refer_videos (Atlas wrapper feature — backend creates the element via Kling's element API, then injects the resulting element_id). Mention subjects in prompt with <<<element_N>>> (1-based, matches array position). For kling-v3-omni: up to 3 elements.
  - Max items: 6
  - Item properties:
    - **`reference_type`** (`string`, _optional_):
      Reference media type for the new subject (Atlas wrapper).
      - Default: `"image_refer"`
      - Options: "image_refer", "video_refer"

    - **`frontal_image`** (`string`, _optional_):
      Frontal image URL of the new subject (required when reference_type=image_refer).

    - **`refer_images`** (`array[string]`, _optional_):
      Reference image URLs of the new subject (used with reference_type=image_refer).
      - Max items: 4

    - **`refer_videos`** (`array[string]`, _optional_):
      Reference video URLs of the new subject (used with reference_type=video_refer).
      - Max items: 4

    - **`element_name`** (`string`, _optional_):
      Name of the new subject to create inline (Atlas wrapper). Mutually exclusive with element_id.

    - **`element_description`** (`string`, _optional_):
      Optional description of the new subject (Atlas wrapper).

    - **`element_id`** (`integer`, _optional_):
      ID of an existing subject from the Kling element library (long / 64-bit integer). Mutually exclusive with element_name.


- **`aspect_ratio`** (`string`, _optional_):
  The aspect ratio of the generated video.
  - Default: `"16:9"`
  - Options: "16:9", "9:16", "1:1"

- **`duration`** (`integer`, _optional_):
  The duration of the generated media in seconds (3-15).
  - Default: `5`
  - Options: 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15

- **`sound`** (`boolean`, _optional_):
  Whether to generate audio for the video.
  - Default: `true`



**Required Parameters Example**:

```json
{
  "model": "kwaivgi/kling-video-o3-std/text-to-video"
}
```


**Full Example**:

```json
{
  "model": "kwaivgi/kling-video-o3-std/text-to-video",
  "prompt": "",
  "multi_shot": false,
  "shot_type": "customize",
  "multi_prompt": [
    {
      "prompt": "",
      "duration": 1
    }
  ],
  "elements": [
    {
      "reference_type": "image_refer",
      "frontal_image": "",
      "refer_images": [
        ""
      ],
      "refer_videos": [
        ""
      ],
      "element_name": "",
      "element_description": "",
      "element_id": 0
    }
  ],
  "aspect_ratio": "16:9",
  "duration": 5,
  "sound": true
}
```


### Output Schema

The API returns the following output format:


- **`created_at`** (`string`, _optional_):
  ISO timestamp of when the request was created.

- **`has_nsfw_contents`** (`array[boolean]`, _optional_):
  Array of boolean values indicating NSFW detection for each output.

- **`id`** (`string`, _optional_):
  Unique identifier for the prediction, the ID of the prediction to get.

- **`model`** (`string`, _optional_):
  Model ID used for the prediction.

- **`outputs`** (`array[string]`, _optional_):
  Array of URLs to the generated content (empty when status is not completed).

- **`status`** (`string`, _optional_):
  Status of the task: created, processing, completed, or failed.

- **`urls`** (`object`, _optional_):
  Object containing related API endpoints.



**Example Response**:

```json
{
  "created_at": "",
  "has_nsfw_contents": [],
  "id": "",
  "model": "",
  "outputs": [
    ""
  ],
  "status": "",
  "urls": {}
}
```


## Usage Examples

### cURL

```bash
# Step 1: Start generation (async)
curl -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "kwaivgi/kling-video-o3-std/text-to-video",
  "prompt": "",
  "multi_shot": false,
  "shot_type": "customize",
  "multi_prompt": [
    {
      "prompt": "",
      "duration": 1
    }
  ],
  "elements": [
    {
      "reference_type": "image_refer",
      "frontal_image": "",
      "refer_images": [
        ""
      ],
      "refer_videos": [
        ""
      ],
      "element_name": "",
      "element_description": "",
      "element_id": 0
    }
  ],
  "aspect_ratio": "16:9",
  "duration": 5,
  "sound": true
}'

# Response will contain: {"code": 200, "data": {"id": "prediction_id", "status": "processing"}}

# Step 2: Poll for result (replace {prediction_id} with the id returned above)
curl -X GET "https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"

# Keep polling until status is "completed", "succeeded" or "failed"
# When completed, outputs will contain the generated content URL(s)
```

## Additional Resources

### Documentation

- [Model Playground](https://www.atlascloud.ai/models/kwaivgi/kling-video-o3-std/text-to-video)

A rain-soaked cyberpunk metropolis at night, massive holographic advertisements glowing above crowded streets. Flying vehicles move between towering buildings under dark clouds. High contrast lighting, deep shadows, reflective surfaces, atmospheric haze, cinematic composition, anamorphic lens flare, ultra detailed.

Cargando...

Kling Video O3 Standard Text-to-Video

Kling Video O3 Standard is Kuaishou's advanced text-to-video model in the O3 family, delivering high-quality cinematic video from text descriptions. With optional synchronized sound generation, multiple aspect ratios, and flexible duration from 3 to 15 seconds, it offers a strong balance of quality and cost.

Why Choose This?

O3-level quality Advanced visual fidelity and motion realism beyond V3.0 models.

Sound generation Optional synchronized sound effects generated alongside the video.

Flexible duration Generate videos from 3 to 15 seconds — any length you need.

Multiple aspect ratios Support for 16:9, 9:16, and 1:1 to fit any platform.

Prompt Enhancer Built-in tool to automatically improve your video descriptions.

Parameters

Parameter	Required	Description
prompt	Yes	Text description of the video scene and motion
aspect_ratio	No	Output ratio: 16:9 (default), 9:16, 1:1
duration	No	Video length: 3-15 seconds (default: 5)
sound	No	Generate synchronized sound (default: disabled)

How to Use

Run — submit and download your video.
Enable sound (optional) — generate synchronized audio with the video.
Set duration — choose any length from 3 to 15 seconds.
Select aspect ratio — match your target platform.
Write your prompt — describe the scene, characters, motion, and style in detail.

Best Use Cases

Long-Form Scenes — Up to 15 seconds for extended scene development.
Concept Visualization — Bring creative ideas to life from text.
Marketing Videos — Produce promotional content with optional sound.
Social Media — Create engaging videos for TikTok, Reels, and Stories.
Professional Content — High-quality videos at a more accessible price than O3 Pro.

Pro Tips

Use O3 Standard for regular production; upgrade to O3 Pro for maximum quality.
Use shorter durations (3-5s) for testing, longer (10-15s) for final production.
Be specific about camera movements, lighting, and atmosphere for best results.
Enable sound for a complete video experience with synchronized audio.
Match aspect ratio to your platform: 16:9 for YouTube, 9:16 for TikTok/Reels, 1:1 for Instagram.
Use the Prompt Enhancer to refine your descriptions automatically.

Notes

Duration supports any value from 3 to 15 seconds.
Only prompt is required; other parameters have defaults.

Kling V3.0 Pro Text-to-Video — V3.0 Pro quality text-to-video.
Kling V3.0 Standard Text-to-Video — V3.0 Standard at lower cost.
Kling Video O3 Pro Image-to-Video — O3 Pro quality image-to-video.

Explorar Modelos Similares

NEW

Texto a Video

TURBO

Kling V3.0 Turbo Text-to-Video

Kling V3.0 Turbo Text-to-Video generates dynamic cinematic videos from text prompts using MVL technology. Supports first/last frame control and audio generation.

Kling V3.0 Turbo Image-to-Video

Kling V3.0 Turbo Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Supports first/last frame control and audio generation.

Kling Video O3 4K Text-to-Video

Kling Omni Video O3 (4K) is Kuaishou advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Generates high-quality videos from text prompts with natural motion and audio generation support.

Kling Video O3 4K Image-to-Video

Kling Omni Video O3 (4K) Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Supports first/last frame control and audio generation.

Kling v3.0 4K Image-to-Video

Kling v3.0 4K Image-to-Video model by Kuaishou. High-quality video generation from images.

Kling v3.0 Std Image-to-Video

Kling v3.0 Standard Image-to-Video model by Kuaishou. High-quality video generation from images.

Kling v3.0 Pro Image-to-Video

Kling v3.0 Professional Image-to-Video model by Kuaishou. Premium quality video generation from images with advanced features.

Kling v3.0 Pro Text-to-Video

Kling v3.0 Professional Text-to-Video model by Kuaishou. Premium quality video generation from text prompts with advanced features.

Kling v3.0 4K Text-to-Video

Kling v3.0 4K Text-to-Video model by Kuaishou. High-quality video generation from text prompts.

Kling v3.0 Std Text-to-Video

Kling v3.0 Standard Text-to-Video model by Kuaishou. High-quality video generation from text prompts.

Kling v2.6 Pro Avatar

Kling V2 AI Avatar Pro generates high-quality AI avatar videos with clean detail, stable motion, and strong identity consistency—ideal for profiles, intros, and social content.

Kling v2.6 Std Avatar

Kling AI Avatar generates high-quality AI avatar videos for profiles, intros, and social content, delivering clean detail and cinematic motion with reliable prompt adherence.

Kling v2.6 Pro Motion Control

Kling 2.6 Pro Motion Control turns reference motion clips (dance, action, gesture) into smooth, realistic animations. Upload a character image (or source video) and a motion video; the model transfers the movement while preserving identity and temporal consistency.

Kling v2.6 Std Motion Control

Kling 2.6 Standard Motion Control transfers motion from reference videos to animate still images. Upload a character image and a motion clip (dance, action, gesture), and the model extracts the movement to generate smooth, realistic video.

Kling Video O3 Pro Video-Edit

Kling Omni Video O3 Video-Edit enables conversational video editing through natural language commands. Professional quality with object removal/replacement, background changes, and effects.

Kling Video O3 Pro Text-to-Video

Kling Omni Video O3 is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Professional quality with enhanced motion and detail.

From$0.112/segundo

$0.095/segundo

-15%

Una sola API para toda la IA multimedia.

Explorar Todos los Modelos

Kling Video O3 Std Text-to-Video API by Kuaishou

Entrada

Salida

Parámetros

Ejemplo de código

Instalar

Autenticación

Encabezados HTTP

Enviar una solicitud

Enviar una solicitud

Cuerpo de la solicitud

Respuesta

Verificar estado

Ejemplo de polling

Valores de estado

Respuesta completada

Subir archivos

Ejemplo de carga

Respuesta

Schema de entrada

Ejemplo de cuerpo de solicitud

Schema de salida

Ejemplo de respuesta

Atlas Cloud Skills

Clientes compatibles

Instalar

Configurar clave de API

Funcionalidades

MCP Server

Clientes compatibles

Instalar

Configuración

Herramientas disponibles

API Schema

Plantilla de Prompt Compatible con LLM

Kling Video O3 Standard Text-to-Video

Why Choose This?

Parameters

How to Use

Best Use Cases

Pro Tips

Notes

Related Models

Explorar Modelos Similares

Kling V3.0 Turbo Text-to-Video

Kling V3.0 Turbo Image-to-Video

Kling Video O3 4K Text-to-Video

Kling Video O3 4K Image-to-Video

Kling v3.0 4K Image-to-Video

Kling v3.0 Std Image-to-Video

Kling v3.0 Pro Image-to-Video

Kling v3.0 Pro Text-to-Video

Kling v3.0 4K Text-to-Video

Kling v3.0 Std Text-to-Video

Kling v2.6 Pro Avatar

Kling v2.6 Std Avatar

Kling v2.6 Pro Motion Control

Kling v2.6 Std Motion Control

Kling Video O3 Pro Video-Edit

Kling Video O3 Pro Text-to-Video

Una sola API para toda la IA multimedia.

Join our Discord community

Entrada

Salida

Parámetros

Ejemplo de código

Instalar

Autenticación

Encabezados HTTP

Enviar una solicitud

Enviar una solicitud

Cuerpo de la solicitud

Respuesta

Verificar estado

Ejemplo de polling

Valores de estado

Respuesta completada

Subir archivos

Ejemplo de carga

Respuesta