
Veo3.1 Text-to-Video API by Google
Generate high-fidelity videos from text prompts with Google’s most advanced generative video model. Veo 3.1 delivers cinematic quality, dynamic camera motion, and lifelike detail for storytelling and creative production.
輸入
輸出
閒置每次執行將花費 $0.2。$10 可執行約 50 次。
程式碼範例
import requests
import time
# Step 1: Start video generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
"Content-Type": "application/json",
"Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
"model": "google/veo3.1/text-to-video",
"prompt": "A beautiful sunset over the ocean with gentle waves",
"width": 512,
"height": 512,
"duration": 3,
"fps": 24,
}
generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]
# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
def check_status():
while True:
response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
result = response.json()
if result["data"]["status"] in ["completed", "succeeded"]:
print("Generated video:", result["data"]["outputs"][0])
return result["data"]["outputs"][0]
elif result["data"]["status"] == "failed":
raise Exception(result["data"]["error"] or "Generation failed")
else:
# Still processing, wait 2 seconds
time.sleep(2)
video_url = check_status()安裝
為您的程式語言安裝所需的套件。
pip install requests驗證
所有 API 請求都需要透過 API 金鑰進行驗證。您可以從 Atlas Cloud 儀表板取得 API 金鑰。
export ATLASCLOUD_API_KEY="your-api-key-here"HTTP 標頭
import os
API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
"Content-Type": "application/json",
"Authorization": f"Bearer {API_KEY}"
}切勿在客戶端程式碼或公開儲存庫中暴露您的 API 金鑰。請改用環境變數或後端代理。
提交請求
import requests
url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
"Content-Type": "application/json",
"Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
"model": "your-model",
"prompt": "A beautiful landscape"
}
response = requests.post(url, headers=headers, json=data)
print(response.json())提交請求
提交非同步生成請求。API 會傳回一個預測 ID,您可以用它來檢查狀態並取得結果。
/api/v1/model/generateVideo請求主體
import requests
url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
"Content-Type": "application/json",
"Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
"model": "google/veo3.1/text-to-video",
"input": {
"prompt": "A beautiful sunset over the ocean with gentle waves"
}
}
response = requests.post(url, headers=headers, json=data)
result = response.json()
print(f"Prediction ID: {result['id']}")
print(f"Status: {result['status']}")回應
{
"id": "pred_abc123",
"status": "processing",
"model": "model-name",
"created_at": "2025-01-01T00:00:00Z"
}檢查狀態
輪詢預測端點以檢查請求的當前狀態。
/api/v1/model/prediction/{prediction_id}輪詢範例
import requests
import time
prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }
while True:
response = requests.get(url, headers=headers)
result = response.json()
status = result["data"]["status"]
print(f"Status: {status}")
if status in ["completed", "succeeded"]:
output_url = result["data"]["outputs"][0]
print(f"Output URL: {output_url}")
break
elif status == "failed":
print(f"Error: {result['data'].get('error', 'Unknown')}")
break
time.sleep(3)狀態值
processing請求仍在處理中。completed生成完成。輸出已可取得。succeeded生成成功。輸出已可取得。failed生成失敗。請檢查錯誤欄位。完成回應
{
"data": {
"id": "pred_abc123",
"status": "completed",
"outputs": [
"https://storage.atlascloud.ai/outputs/result.mp4"
],
"metrics": {
"predict_time": 45.2
},
"created_at": "2025-01-01T00:00:00Z",
"completed_at": "2025-01-01T00:00:10Z"
}
}上傳檔案
上傳檔案至 Atlas Cloud 儲存空間並取得 URL,可用於您的 API 請求。使用 multipart/form-data 上傳。
/api/v1/model/uploadMedia上傳範例
import requests
url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }
with open("image.png", "rb") as f:
files = {"file": ("image.png", f, "image/png")}
response = requests.post(url, headers=headers, files=files)
result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")回應
{
"data": {
"download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
"file_name": "image.png",
"content_type": "image/png",
"size": 1024000
}
}輸入 Schema
以下參數可在請求主體中使用。
無可用參數。
範例請求主體
{
"model": "google/veo3.1/text-to-video"
}輸出 Schema
API 傳回包含生成輸出 URL 的預測回應。
範例回應
{
"id": "pred_abc123",
"status": "completed",
"model": "model-name",
"outputs": [
"https://storage.atlascloud.ai/outputs/result.mp4"
],
"metrics": {
"predict_time": 45.2
},
"created_at": "2025-01-01T00:00:00Z",
"completed_at": "2025-01-01T00:00:10Z"
}Atlas Cloud Skills
Atlas Cloud Skills 將 300 多個 AI 模型直接整合至您的 AI 程式碼助手。一鍵安裝,即可使用自然語言生成圖片、影片,以及與 LLM 對話。
支援的客戶端
安裝
npx skills add AtlasCloudAI/atlas-cloud-skills設定 API 金鑰
從 Atlas Cloud 儀表板取得 API 金鑰,並設為環境變數。
export ATLASCLOUD_API_KEY="your-api-key-here"功能
安裝完成後,您可以在 AI 助手中使用自然語言存取所有 Atlas Cloud 模型。
MCP Server
Atlas Cloud MCP Server 透過 Model Context Protocol 將您的 IDE 與 300 多個 AI 模型連接。支援任何 MCP 相容的客戶端。
支援的客戶端
安裝
npx -y atlascloud-mcp設定
將以下設定新增至您 IDE 的 MCP 設定檔中。
{
"mcpServers": {
"atlascloud": {
"command": "npx",
"args": [
"-y",
"atlascloud-mcp"
],
"env": {
"ATLASCLOUD_API_KEY": "your-api-key-here"
}
}
}
}可用工具
API Schema
Schema 不可用Google Veo 3.1 — Text-to-Video (T2V) Model
Veo 3.1 T2V is the latest text-to-video model from Google DeepMind, designed to bring cinematic storytelling to life through text. It generates high-fidelity 1080p videos with synchronized, context-aware audio, realistic motion, and narrative consistency — making it one of the most advanced generative video systems ever released.
Why it stands out
-
Cinematic Realism
Produces natural lighting, smooth camera transitions, and accurate perspective for film-like motion.
-
Native Audio Generation
Generates synchronized ambient sound, dialogue, and music directly aligned with the visuals.
-
Dialogue & Lip-Sync
Supports speaking characters and realistic facial expressions — perfect for storytelling, marketing, or short-form content.
-
Subject Consistency (R2V)
Maintains a character’s or object’s identity across frames using 1–3 reference images.
-
Video Interpolation
Seamlessly animates transitions between two given frames — ideal for smooth start-to-end storytelling.
-
Flexible Output
Supports both 720p and 1080p, at 24 FPS, duration for 4s, 6s, 8s, and in both 16:9 (landscape) and 9:16 (portrait) formats.
Key Parameters
-
prompt — Describe your scene or story (e.g., “A drone shot flying over Las Vegas, transitioning from day to night with soft jazz in the background”).
-
durationSeconds — Choose video length (4s, 6s, or 8s).
-
resolution — 720p or 1080p.
-
aspectRatio — Landscape (16:9) or Portrait (9:16).
Pricing (Preview Stage)
| Model | Description | Input Type | Output | Price |
|---|---|---|---|---|
| Veo 3.1 (Video + Audio) | Generate videos with synchronized sound | Text / Image | Video + Audio | $0.40 / sec |
| Veo 3.1 (Video only) | Generate high-quality silent videos | Text / Image | Video | $0.20 / sec |
Minimum cost: ~$3.20 per clip (based on 8s @ 1080p).
How to Use
-
Write a Prompt
Describe the desired motion, camera style, lighting, and sound.
Example: “A cinematic sunset over the ocean, waves glimmering as seagulls fly across the horizon.”
-
Adjust Parameters
Select duration, resolution (720p/1080p), and aspect ratio.
-
Generate
Submit your request — Veo 3.1 will render motion, lighting, and synchronized audio.
-
Preview & Download
Review your video, refine your prompt if needed, then download the final MP4.
Pro Tips
-
Keep prompts focused on one main action or subject for better coherence.
-
Use camera verbs like “tracking,” “zoom out,” or “handheld” for cinematic control.
-
Mention lighting and mood cues (e.g., “under soft moonlight,” “golden-hour glow”).
-
Use R2V for character-based storytelling; Interpolation for smooth transitions.
-
Avoid conflicting instructions (e.g., “fast zoom” and “slow motion” together).
Notes & Limitations
-
Generation time: ~2–3 minutes for an 8-second 1080p clip.
-
Frame rate fixed at 24 FPS.
-
Advanced controls (R2V, I2V, Interpolation) are mutually exclusive — only one per generation.
-
If your prompt is blocked, rewrite it and resubmit (safety thresholds may adjust during preview).






