kwaivgi/kling-video-o3-std/text-to-video

testo-in-video

Kling Video O3 Std Text-to-Video API by Kuaishou

kwaivgi/kling-video-o3-std/text-to-video

Text-to-video

Kling Omni Video O3 (Standard) is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Generates high-quality videos from text prompts with natural motion and audio generation support.

INPUT

Prompt

Digita @ per citare un soggetto.

Multi-inquadratura

Soggetti

MAX: 6

Proporzioni

Durata

Audio

OUTPUT

In attesa

I video generati appariranno qui

Configura le impostazioni e clicca Esegui per iniziare

La tua richiesta costerà $0.071 per esecuzione. Con $10 puoi eseguire questo modello circa 140 volte.

Ecco cosa puoi fare dopo:

Seedance 2.0 Kling v3 Vidu Wan2.7

Parametri

Esempio di codice
import requests
import time

# Step 1: Start video generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "kwaivgi/kling-video-o3-std/text-to-video",  # Required. model name. default: "kwaivgi/kling-video-o3-std/text-to-video-test"
    "aspect_ratio": "16:9",  # The aspect ratio of the generated video. options: 16:9 | 9:16 | 1:1
    "duration": 5,  # The duration of the generated media in seconds (3-15)
    "prompt": "A beautiful sunset over the ocean with gentle waves",  # The positive prompt for the generation
    "sound": True,  # Whether to generate audio for the video
    "multi_shot": False,  # Whether to enable multi-shot generation
    "shot_type": "example_value",  # Multi-shot mode. options: customize | intelligence
    "multi_prompt": [
        {
            "prompt": "example_prompt",
            "duration": 1
        }
    ],  # Per-shot storyboards
    "elements": [
        {
            "reference_type": "image_refer",
            "frontal_image": "example_frontal_image",
            "refer_images": [
                "https://example.com/image1.jpg"
            ],
            "refer_videos": [
                "https://example.com/image1.jpg"
            ],
            "element_name": "example_element_name",
            "element_description": "example_element_description",
            "element_id": 0
        }
    ],  # Subject references (Atlas naming; maps to Kling 'element_list')
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] in ["completed", "succeeded"]:
            print("Generated video:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

video_url = check_status()

Installa

Installa il pacchetto di dipendenze richiesto.

pip install requests

Autenticazione

Tutte le richieste API richiedono l'autenticazione tramite una chiave API. Puoi ottenere la tua chiave API dalla dashboard di Atlas Cloud.

export ATLASCLOUD_API_KEY="your-api-key-here"

Header HTTP

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

Proteggi la tua chiave API

Non esporre mai la tua chiave API nel codice lato client o nei repository pubblici. Utilizza invece variabili d'ambiente o un proxy backend.

Invia una richiesta

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

Invia una richiesta

Invia una richiesta di generazione asincrona. L'API restituisce un ID di previsione che puoi usare per controllare lo stato e recuperare il risultato.

POST/api/v1/model/generateVideo

Corpo della richiesta

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "kwaivgi/kling-video-o3-std/text-to-video",
    "prompt": "A beautiful sunset over the ocean with gentle waves"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

Risposta

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

Controlla lo stato

Interroga l'endpoint di previsione per verificare lo stato attuale della tua richiesta.

GET/api/v1/model/prediction/{prediction_id}

Esempio di polling

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

Valori di stato

processingLa richiesta è ancora in fase di elaborazione.

completedGenerazione completata. Gli output sono disponibili.

succeededGenerazione riuscita. Gli output sono disponibili.

failedLa generazione è fallita. Controlla il campo errore.

Risposta completata

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.mp4"
    ],
    "metrics": {
      "predict_time": 45.2
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

Carica file

Carica file nello storage Atlas Cloud e ottieni un URL utilizzabile nelle tue richieste API. Usa multipart/form-data per il caricamento.

POST/api/v1/model/uploadMedia

Esempio di caricamento

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

Risposta

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

Schema di input

I seguenti parametri sono accettati nel corpo della richiesta.

Totale: 9Obbligatorio: 1Opzionale: 8

modelstringrequired

model name

Default: "kwaivgi/kling-video-o3-std/text-to-video-test"

aspect_ratiostring

The aspect ratio of the generated video.

Default: "16:9"

16:99:161:1

durationinteger

The duration of the generated media in seconds (3-15).

Default: 5

3456789101112131415

promptstring

The positive prompt for the generation.

soundboolean

Whether to generate audio for the video.

Default: true

multi_shotboolean

Whether to enable multi-shot generation.

Default: false

shot_typestring

Multi-shot mode. customize = caller provides per-shot prompts; intelligence = model auto-splits the top-level prompt into shots. Required when multi_shot=true.

customizeintelligence

multi_promptarray[object]

Per-shot storyboards. Required when multi_shot=true and shot_type=customize. Sum of each shot's duration must equal the top-level duration; each shot duration must be >= 1.

Min items: 1Max items: 6

promptstringrequired

Prompt for this shot. Supports subject mentions like <<<element_1>>>.

durationintegerrequired

Duration of this shot in seconds (string, '1'~'15'). Sum of all shots must equal the top-level duration. Each shot duration >= 1.

Min: 1

elementsarray[object]

Subject references (Atlas naming; maps to Kling 'element_list'). Each item either references an existing subject by element_id, or creates a new one inline with element_name + reference_type + frontal_image / refer_images / refer_videos (Atlas wrapper feature — backend creates the element via Kling's element API, then injects the resulting element_id). Mention subjects in prompt with <<<element_N>>> (1-based, matches array position). For kling-v3-omni: up to 3 elements.

Max items: 6

reference_typestring

Reference media type for the new subject (Atlas wrapper).

Default: "image_refer"

image_refervideo_refer

frontal_imagestring

Frontal image URL of the new subject (required when reference_type=image_refer).

refer_imagesarray[string]

Reference image URLs of the new subject (used with reference_type=image_refer).

Max items: 4

refer_videosarray[string]

Reference video URLs of the new subject (used with reference_type=video_refer).

Max items: 4

element_namestring

Name of the new subject to create inline (Atlas wrapper). Mutually exclusive with element_id.

element_descriptionstring

Optional description of the new subject (Atlas wrapper).

element_idinteger

ID of an existing subject from the Kling element library (long / 64-bit integer). Mutually exclusive with element_name.

Esempio di corpo della richiesta

{
  "model": "kwaivgi/kling-video-o3-std/text-to-video",
  "aspect_ratio": "16:9",
  "duration": 5,
  "sound": true,
  "multi_shot": false
}

Schema di output

L'API restituisce una risposta di previsione con gli URL degli output generati.

created_atstring

ISO timestamp of when the request was created.

idstring

Unique identifier for the prediction, the ID of the prediction to get.

modelstring

Model ID used for the prediction.

outputsarray

Array of URLs to the generated content (empty when status is not completed).

statusstring

Status of the task: created, processing, completed, or failed.

Esempio di risposta

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.mp4"
  ],
  "metrics": {
    "predict_time": 45.2
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills integra oltre 400 modelli di IA direttamente nel tuo assistente di codifica IA. Un comando per installare, poi usa il linguaggio naturale per generare immagini, video e chattare con LLM.

Client supportati

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ client supportati

Installa

npx skills add AtlasCloudAI/atlas-cloud-skills

Configura chiave API

Ottieni la tua chiave API dalla dashboard di Atlas Cloud e impostala come variabile d'ambiente.

export ATLASCLOUD_API_KEY="your-api-key-here"

Funzionalità

Una volta installato, puoi usare il linguaggio naturale nel tuo assistente IA per accedere a tutti i modelli Atlas Cloud.

Generazione di immaginiGenera immagini con modelli come Nano Banana 2, Z-Image e altri.

Creazione di videoCrea video da testo o immagini con Kling, Vidu, Veo, ecc.

Chat LLMChatta con Qwen, DeepSeek e altri grandi modelli linguistici.

Caricamento mediaCarica file locali per la modifica di immagini e flussi di lavoro da immagine a video.

Scopri di più

github.com/AtlasCloudAI/atlas-cloud-skills

Server MCP

Il server MCP di Atlas Cloud collega il tuo IDE con oltre 400 modelli di IA tramite il Model Context Protocol. Funziona con qualsiasi client compatibile MCP.

Client supportati

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ client supportati

Installa

npx -y atlascloud-mcp

Configurazione

Aggiungi la seguente configurazione al file delle impostazioni MCP del tuo IDE.

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

Strumenti disponibili

atlas_generate_imageGenera immagini da prompt testuali.

atlas_generate_videoCrea video da testo o immagini.

atlas_chatChatta con grandi modelli linguistici.

atlas_list_modelsEsplora oltre 400 modelli di IA disponibili.

atlas_quick_generateCreazione di contenuti in un solo passaggio con selezione automatica del modello.

atlas_upload_mediaCarica file locali per i flussi di lavoro API.

Scopri di più

github.com/AtlasCloudAI/mcp-server

API Schema

{
  "components": {
    "schemas": {
      "Input": {
        "properties": {
          "model": {
            "type": "string",
            "description": "model name",
            "default": "kwaivgi/kling-video-o3-std/text-to-video-test"
          },
          "aspect_ratio": {
            "default": "16:9",
            "description": "The aspect ratio of the generated video.",
            "enum": [
              "16:9",
              "9:16",
              "1:1"
            ],
            "type": "string"
          },
          "duration": {
            "default": 5,
            "description": "The duration of the generated media in seconds (3-15).",
            "enum": [
              3,
              4,
              5,
              6,
              7,
              8,
              9,
              10,
              11,
              12,
              13,
              14,
              15
            ],
            "type": "integer",
            "x-ui-component": "select"
          },
          "prompt": {
            "description": "The positive prompt for the generation.",
            "type": "string"
          },
          "sound": {
            "default": true,
            "description": "Whether to generate audio for the video.",
            "type": "boolean"
          },
          "multi_shot": {
            "type": "boolean",
            "description": "Whether to enable multi-shot generation.",
            "default": false
          },
          "shot_type": {
            "type": "string",
            "description": "Multi-shot mode. customize = caller provides per-shot prompts; intelligence = model auto-splits the top-level prompt into shots. Required when multi_shot=true.",
            "enum": [
              "customize",
              "intelligence"
            ],
            "x-ui-component": "select"
          },
          "multi_prompt": {
            "type": "array",
            "description": "Per-shot storyboards. Required when multi_shot=true and shot_type=customize. Sum of each shot's duration must equal the top-level duration; each shot duration must be >= 1.",
            "minItems": 1,
            "maxItems": 6,
            "items": {
              "type": "object",
              "required": [
                "prompt",
                "duration"
              ],
              "properties": {
                "prompt": {
                  "type": "string",
                  "description": "Prompt for this shot. Supports subject mentions like <<<element_1>>>.",
                  "maxLength": 512
                },
                "duration": {
                  "type": "integer",
                  "description": "Duration of this shot in seconds (string, '1'~'15'). Sum of all shots must equal the top-level duration. Each shot duration >= 1.",
                  "minimum": 1
                }
              }
            }
          },
          "elements": {
            "type": "array",
            "description": "Subject references (Atlas naming; maps to Kling 'element_list'). Each item either references an existing subject by element_id, or creates a new one inline with element_name + reference_type + frontal_image / refer_images / refer_videos (Atlas wrapper feature — backend creates the element via Kling's element API, then injects the resulting element_id). Mention subjects in prompt with <<<element_N>>> (1-based, matches array position). For kling-v3-omni: up to 3 elements.",
            "maxItems": 6,
            "items": {
              "type": "object",
              "properties": {
                "reference_type": {
                  "type": "string",
                  "description": "Reference media type for the new subject (Atlas wrapper).",
                  "enum": [
                    "image_refer",
                    "video_refer"
                  ],
                  "default": "image_refer"
                },
                "frontal_image": {
                  "type": "string",
                  "description": "Frontal image URL of the new subject (required when reference_type=image_refer)."
                },
                "refer_images": {
                  "type": "array",
                  "description": "Reference image URLs of the new subject (used with reference_type=image_refer).",
                  "items": {
                    "type": "string"
                  },
                  "maxItems": 4
                },
                "refer_videos": {
                  "type": "array",
                  "description": "Reference video URLs of the new subject (used with reference_type=video_refer).",
                  "items": {
                    "type": "string"
                  },
                  "maxItems": 4
                },
                "element_name": {
                  "type": "string",
                  "description": "Name of the new subject to create inline (Atlas wrapper). Mutually exclusive with element_id."
                },
                "element_description": {
                  "type": "string",
                  "description": "Optional description of the new subject (Atlas wrapper).",
                  "x-hidden": true
                },
                "element_id": {
                  "type": "integer",
                  "description": "ID of an existing subject from the Kling element library (long / 64-bit integer). Mutually exclusive with element_name.",
                  "x-hidden": true
                }
              },
              "allOf": [
                {
                  "if": {
                    "properties": {
                      "reference_type": {
                        "const": "video_refer"
                      }
                    },
                    "required": [
                      "reference_type"
                    ]
                  },
                  "then": {
                    "required": [
                      "refer_videos"
                    ],
                    "properties": {
                      "frontal_image": {
                        "x-hidden": true
                      },
                      "refer_images": {
                        "x-hidden": true
                      },
                      "refer_videos": {
                        "minItems": 1
                      }
                    }
                  },
                  "else": {
                    "required": [
                      "frontal_image",
                      "refer_images"
                    ],
                    "properties": {
                      "refer_videos": {
                        "x-hidden": true
                      },
                      "refer_images": {
                        "minItems": 1
                      }
                    }
                  }
                }
              ]
            }
          }
        },
        "allOf": [
          {
            "if": {
              "properties": {
                "multi_shot": {
                  "const": true
                }
              },
              "required": [
                "multi_shot"
              ]
            },
            "then": {
              "required": [
                "shot_type"
              ]
            },
            "else": {
              "required": [
                "prompt"
              ],
              "properties": {
                "shot_type": {
                  "x-hidden": true
                },
                "multi_prompt": {
                  "x-hidden": true
                }
              }
            }
          },
          {
            "if": {
              "properties": {
                "shot_type": {
                  "const": "customize"
                }
              },
              "required": [
                "shot_type"
              ]
            },
            "then": {
              "required": [
                "multi_prompt"
              ],
              "properties": {
                "prompt": {
                  "x-hidden": true,
                  "x-hidden-hint": "Customize mode uses the per-shot prompts below. Your prompt is saved and reappears when you leave Customize mode."
                }
              }
            }
          },
          {
            "if": {
              "properties": {
                "shot_type": {
                  "const": "intelligence"
                }
              },
              "required": [
                "shot_type"
              ]
            },
            "then": {
              "required": [
                "prompt"
              ],
              "properties": {
                "multi_prompt": {
                  "x-hidden": true
                }
              }
            }
          }
        ],
        "required": [
          "model"
        ],
        "type": "object",
        "x-order-properties": [
          "model",
          "prompt",
          "multi_shot",
          "shot_type",
          "multi_prompt",
          "elements",
          "aspect_ratio",
          "duration",
          "sound"
        ]
      },
      "PredictionResponse": {
        "properties": {
          "created_at": {
            "description": "ISO timestamp of when the request was created.",
            "format": "date-time",
            "type": "string"
          },
          "has_nsfw_contents": {
            "description": "Array of boolean values indicating NSFW detection for each output.",
            "items": {
              "type": "boolean"
            },
            "type": "array"
          },
          "id": {
            "description": "Unique identifier for the prediction, the ID of the prediction to get.",
            "type": "string"
          },
          "model": {
            "description": "Model ID used for the prediction.",
            "type": "string"
          },
          "outputs": {
            "description": "Array of URLs to the generated content (empty when status is not completed).",
            "items": {
              "type": "string"
            },
            "type": "array"
          },
          "status": {
            "description": "Status of the task: created, processing, completed, or failed.",
            "type": "string"
          },
          "urls": {
            "description": "Object containing related API endpoints.",
            "type": "object"
          }
        },
        "type": "object"
      }
    },
    "securitySchemes": {
      "apiKeyAuth": {
        "in": "header",
        "name": "Authorization",
        "type": "apiKey"
      }
    }
  },
  "info": {
    "description": "The AtlasCloud API.",
    "title": "AtlasCloud API",
    "version": "1.0.0"
  },
  "openapi": "3.0.0",
  "paths": {
    "/api/v1/model/generateVideo": {
      "post": {
        "requestBody": {
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/Input"
              }
            }
          },
          "required": true
        },
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "The request status."
          }
        }
      },
      "x-api-name": "model_run"
    },
    "/api/v1/model/result/{request_id}": {
      "get": {
        "parameters": [
          {
            "in": "path",
            "name": "request_id",
            "required": true,
            "schema": {
              "description": "Request ID",
              "type": "string"
            }
          }
        ],
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "Result of the request."
          }
        }
      },
      "x-api-name": "model_result"
    }
  },
  "servers": [
    {
      "url": "https://api.atlascloud.ai"
    }
  ],
  "model": "kwaivgi/kling-video-o3-std/text-to-video"
}

Template di prompt ottimizzato per LLM

# kwaivgi/kling-video-o3-std/text-to-video

> Kling Omni Video O3 (Standard) is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Generates high-quality videos from text prompts with natural motion and audio generation support.


## Overview

- **Submit endpoint (POST)**: `https://api.atlascloud.ai/api/v1/model/generateVideo` — start an async generation; returns a `prediction_id`
- **Poll endpoint (GET)**: `https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}` — poll this until the prediction finishes
- **Model ID**: `kwaivgi/kling-video-o3-std/text-to-video`


## API Information

This model can be used via our HTTP API or more conveniently via our client libraries.
See the input and output schema below, as well as the usage examples.


### Input Schema

The API accepts the following input parameters:

- **`model`** (`string`, _required_):
  model name
  - Default: `"kwaivgi/kling-video-o3-std/text-to-video-test"`

- **`prompt`** (`string`, _optional_):
  The positive prompt for the generation.

- **`multi_shot`** (`boolean`, _optional_):
  Whether to enable multi-shot generation.
  - Default: `false`

- **`shot_type`** (`string`, _optional_):
  Multi-shot mode. customize = caller provides per-shot prompts; intelligence = model auto-splits the top-level prompt into shots. Required when multi_shot=true.
  - Options: "customize", "intelligence"

- **`multi_prompt`** (`array[object]`, _optional_):
  Per-shot storyboards. Required when multi_shot=true and shot_type=customize. Sum of each shot's duration must equal the top-level duration; each shot duration must be >= 1.
  - Min items: 1
  - Max items: 6
  - Item properties:
    - **`prompt`** (`string`, _required_):
      Prompt for this shot. Supports subject mentions like <<<element_1>>>.

    - **`duration`** (`integer`, _required_):
      Duration of this shot in seconds (string, '1'~'15'). Sum of all shots must equal the top-level duration. Each shot duration >= 1.
      - Min: 1


- **`elements`** (`array[object]`, _optional_):
  Subject references (Atlas naming; maps to Kling 'element_list'). Each item either references an existing subject by element_id, or creates a new one inline with element_name + reference_type + frontal_image / refer_images / refer_videos (Atlas wrapper feature — backend creates the element via Kling's element API, then injects the resulting element_id). Mention subjects in prompt with <<<element_N>>> (1-based, matches array position). For kling-v3-omni: up to 3 elements.
  - Max items: 6
  - Item properties:
    - **`reference_type`** (`string`, _optional_):
      Reference media type for the new subject (Atlas wrapper).
      - Default: `"image_refer"`
      - Options: "image_refer", "video_refer"

    - **`frontal_image`** (`string`, _optional_):
      Frontal image URL of the new subject (required when reference_type=image_refer).

    - **`refer_images`** (`array[string]`, _optional_):
      Reference image URLs of the new subject (used with reference_type=image_refer).
      - Max items: 4

    - **`refer_videos`** (`array[string]`, _optional_):
      Reference video URLs of the new subject (used with reference_type=video_refer).
      - Max items: 4

    - **`element_name`** (`string`, _optional_):
      Name of the new subject to create inline (Atlas wrapper). Mutually exclusive with element_id.

    - **`element_description`** (`string`, _optional_):
      Optional description of the new subject (Atlas wrapper).

    - **`element_id`** (`integer`, _optional_):
      ID of an existing subject from the Kling element library (long / 64-bit integer). Mutually exclusive with element_name.


- **`aspect_ratio`** (`string`, _optional_):
  The aspect ratio of the generated video.
  - Default: `"16:9"`
  - Options: "16:9", "9:16", "1:1"

- **`duration`** (`integer`, _optional_):
  The duration of the generated media in seconds (3-15).
  - Default: `5`
  - Options: 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15

- **`sound`** (`boolean`, _optional_):
  Whether to generate audio for the video.
  - Default: `true`



**Required Parameters Example**:

```json
{
  "model": "kwaivgi/kling-video-o3-std/text-to-video"
}
```


**Full Example**:

```json
{
  "model": "kwaivgi/kling-video-o3-std/text-to-video",
  "prompt": "",
  "multi_shot": false,
  "shot_type": "customize",
  "multi_prompt": [
    {
      "prompt": "",
      "duration": 1
    }
  ],
  "elements": [
    {
      "reference_type": "image_refer",
      "frontal_image": "",
      "refer_images": [
        ""
      ],
      "refer_videos": [
        ""
      ],
      "element_name": "",
      "element_description": "",
      "element_id": 0
    }
  ],
  "aspect_ratio": "16:9",
  "duration": 5,
  "sound": true
}
```


### Output Schema

The API returns the following output format:


- **`created_at`** (`string`, _optional_):
  ISO timestamp of when the request was created.

- **`has_nsfw_contents`** (`array[boolean]`, _optional_):
  Array of boolean values indicating NSFW detection for each output.

- **`id`** (`string`, _optional_):
  Unique identifier for the prediction, the ID of the prediction to get.

- **`model`** (`string`, _optional_):
  Model ID used for the prediction.

- **`outputs`** (`array[string]`, _optional_):
  Array of URLs to the generated content (empty when status is not completed).

- **`status`** (`string`, _optional_):
  Status of the task: created, processing, completed, or failed.

- **`urls`** (`object`, _optional_):
  Object containing related API endpoints.



**Example Response**:

```json
{
  "created_at": "",
  "has_nsfw_contents": [],
  "id": "",
  "model": "",
  "outputs": [
    ""
  ],
  "status": "",
  "urls": {}
}
```


## Usage Examples

### cURL

```bash
# Step 1: Start generation (async)
curl -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "kwaivgi/kling-video-o3-std/text-to-video",
  "prompt": "",
  "multi_shot": false,
  "shot_type": "customize",
  "multi_prompt": [
    {
      "prompt": "",
      "duration": 1
    }
  ],
  "elements": [
    {
      "reference_type": "image_refer",
      "frontal_image": "",
      "refer_images": [
        ""
      ],
      "refer_videos": [
        ""
      ],
      "element_name": "",
      "element_description": "",
      "element_id": 0
    }
  ],
  "aspect_ratio": "16:9",
  "duration": 5,
  "sound": true
}'

# Response will contain: {"code": 200, "data": {"id": "prediction_id", "status": "processing"}}

# Step 2: Poll for result (replace {prediction_id} with the id returned above)
curl -X GET "https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"

# Keep polling until status is "completed", "succeeded" or "failed"
# When completed, outputs will contain the generated content URL(s)
```

## Additional Resources

### Documentation

- [Model Playground](https://www.atlascloud.ai/models/kwaivgi/kling-video-o3-std/text-to-video)

Nessun esempio disponibile

Caricamento...

Kling Video O3 Standard Text-to-Video

Kling Video O3 Standard is Kuaishou's advanced text-to-video model in the O3 family, delivering high-quality cinematic video from text descriptions. With optional synchronized sound generation, multiple aspect ratios, and flexible duration from 3 to 15 seconds, it offers a strong balance of quality and cost.

Why Choose This?

O3-level quality Advanced visual fidelity and motion realism beyond V3.0 models.

Sound generation Optional synchronized sound effects generated alongside the video.

Flexible duration Generate videos from 3 to 15 seconds — any length you need.

Multiple aspect ratios Support for 16:9, 9:16, and 1:1 to fit any platform.

Prompt Enhancer Built-in tool to automatically improve your video descriptions.

Parameters

Parameter	Required	Description
prompt	Yes	Text description of the video scene and motion
aspect_ratio	No	Output ratio: 16:9 (default), 9:16, 1:1
duration	No	Video length: 3-15 seconds (default: 5)
sound	No	Generate synchronized sound (default: disabled)

How to Use

Run — submit and download your video.
Enable sound (optional) — generate synchronized audio with the video.
Set duration — choose any length from 3 to 15 seconds.
Select aspect ratio — match your target platform.
Write your prompt — describe the scene, characters, motion, and style in detail.

Best Use Cases

Long-Form Scenes — Up to 15 seconds for extended scene development.
Concept Visualization — Bring creative ideas to life from text.
Marketing Videos — Produce promotional content with optional sound.
Social Media — Create engaging videos for TikTok, Reels, and Stories.
Professional Content — High-quality videos at a more accessible price than O3 Pro.

Pro Tips

Use O3 Standard for regular production; upgrade to O3 Pro for maximum quality.
Use shorter durations (3-5s) for testing, longer (10-15s) for final production.
Be specific about camera movements, lighting, and atmosphere for best results.
Enable sound for a complete video experience with synchronized audio.
Match aspect ratio to your platform: 16:9 for YouTube, 9:16 for TikTok/Reels, 1:1 for Instagram.
Use the Prompt Enhancer to refine your descriptions automatically.

Notes

Duration supports any value from 3 to 15 seconds.
Only prompt is required; other parameters have defaults.

Kling V3.0 Pro Text-to-Video — V3.0 Pro quality text-to-video.
Kling V3.0 Standard Text-to-Video — V3.0 Standard at lower cost.
Kling Video O3 Pro Image-to-Video — O3 Pro quality image-to-video.

Esplora Modelli Simili

NEW

testo-in-video

TURBO

Kling V3.0 Turbo Text-to-Video

Kling V3.0 Turbo Text-to-Video generates dynamic cinematic videos from text prompts using MVL technology. Supports first/last frame control and audio generation.

Kling V3.0 Turbo Image-to-Video

Kling V3.0 Turbo Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Supports first/last frame control and audio generation.

Kling Video O3 4K Text-to-Video

Kling Omni Video O3 (4K) is Kuaishou advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Generates high-quality videos from text prompts with natural motion and audio generation support.

Kling Video O3 4K Image-to-Video

Kling Omni Video O3 (4K) Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Supports first/last frame control and audio generation.

Kling v3.0 4K Image-to-Video

Kling v3.0 4K Image-to-Video model by Kuaishou. High-quality video generation from images.

Kling v3.0 Std Image-to-Video

Kling v3.0 Standard Image-to-Video model by Kuaishou. High-quality video generation from images.

Kling v3.0 Pro Image-to-Video

Kling v3.0 Professional Image-to-Video model by Kuaishou. Premium quality video generation from images with advanced features.

Kling v3.0 Pro Text-to-Video

Kling v3.0 Professional Text-to-Video model by Kuaishou. Premium quality video generation from text prompts with advanced features.

Kling v3.0 4K Text-to-Video

Kling v3.0 4K Text-to-Video model by Kuaishou. High-quality video generation from text prompts.

Kling v3.0 Std Text-to-Video

Kling v3.0 Standard Text-to-Video model by Kuaishou. High-quality video generation from text prompts.

Kling v2.6 Pro Avatar

Kling V2 AI Avatar Pro generates high-quality AI avatar videos with clean detail, stable motion, and strong identity consistency—ideal for profiles, intros, and social content.

Kling v2.6 Std Avatar

Kling AI Avatar generates high-quality AI avatar videos for profiles, intros, and social content, delivering clean detail and cinematic motion with reliable prompt adherence.

Kling v2.6 Pro Motion Control

Kling 2.6 Pro Motion Control turns reference motion clips (dance, action, gesture) into smooth, realistic animations. Upload a character image (or source video) and a motion video; the model transfers the movement while preserving identity and temporal consistency.

Kling v2.6 Std Motion Control

Kling 2.6 Standard Motion Control transfers motion from reference videos to animate still images. Upload a character image and a motion clip (dance, action, gesture), and the model extracts the movement to generate smooth, realistic video.

Kling Video O3 Pro Video-Edit

Kling Omni Video O3 Video-Edit enables conversational video editing through natural language commands. Professional quality with object removal/replacement, background changes, and effects.

Kling Video O3 Pro Text-to-Video

Kling Omni Video O3 is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Professional quality with enhanced motion and detail.

From$0.112/SEC

$0.095/SEC

-15%

Un'unica API per tutta l'IA multimediale.

Esplora tutti i modelli

Kling Video O3 Std Text-to-Video API by Kuaishou

INPUT

OUTPUT

Parametri

Esempio di codice

Installa

Autenticazione

Header HTTP

Invia una richiesta

Invia una richiesta

Corpo della richiesta

Risposta

Controlla lo stato

Esempio di polling

Valori di stato

Risposta completata

Carica file

Esempio di caricamento

Risposta

Schema di input

Esempio di corpo della richiesta

Schema di output

Esempio di risposta

Atlas Cloud Skills

Client supportati

Installa

Configura chiave API

Funzionalità

Server MCP

Client supportati

Installa

Configurazione

Strumenti disponibili

API Schema

Template di prompt ottimizzato per LLM

Kling Video O3 Standard Text-to-Video

Why Choose This?

Parameters

How to Use

Best Use Cases

Pro Tips

Notes

Related Models

Esplora Modelli Simili

Kling V3.0 Turbo Text-to-Video

Kling V3.0 Turbo Image-to-Video

Kling Video O3 4K Text-to-Video

Kling Video O3 4K Image-to-Video

Kling v3.0 4K Image-to-Video

Kling v3.0 Std Image-to-Video

Kling v3.0 Pro Image-to-Video

Kling v3.0 Pro Text-to-Video

Kling v3.0 4K Text-to-Video

Kling v3.0 Std Text-to-Video

Kling v2.6 Pro Avatar

Kling v2.6 Std Avatar

Kling v2.6 Pro Motion Control

Kling v2.6 Std Motion Control

Kling Video O3 Pro Video-Edit

Kling Video O3 Pro Text-to-Video

Un'unica API per tutta l'IA multimediale.

Join our Discord community

INPUT

OUTPUT

Parametri

Esempio di codice

Installa

Autenticazione

Header HTTP

Invia una richiesta

Invia una richiesta

Corpo della richiesta

Risposta

Controlla lo stato

Esempio di polling

Valori di stato

Risposta completata

Carica file

Esempio di caricamento

Risposta