vidu/q2/reference-to-video

referens-till-video

Vidu Q2 Reference-to-Video API by Vidu

vidu/q2/reference-to-video

Reference-to-video

Vidu Q2 Reference-to-Video is an advanced AI video generation model that brings static images to life. Upload a reference image and describe the motion you want — the model generates high-quality video with smooth animation, optional audio, and cinematic quality up to 1080p.

Inmatning

Prompt *

Subjects *

MIN: 1 / MAX: 7

Varaktighet

Upplösning

Generera Ljud

Audio type

Bildförhållande

Movement amplitude

Seed

Utmatning

Vilande

Dina genererade videor visas här

Konfigurera parametrar och klicka på Kör för att börja generera

Varje körning kostar $0.064. För $10 kan du köra cirka 156 gånger.

Du kan fortsätta med:

Seedance 2.0 Kling v3 Vidu Wan2.7

Parametrar

Kodexempel
import requests
import time

# Step 1: Start video generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "vidu/q2/reference-to-video",  # Required. model name
    "subjects": [
        {
            "id": "example_id",
            "images": [
                "https://example.com/image1.jpg"
            ]
        }
    ],  # Required. Information about the subjects in the images
    "prompt": "Santa Claus and the bear hug by the lakeside.",  # Required. A textual description for video generation
    "duration": 5,  # The duration of the generated media in seconds. (min: 1, max: 10)
    "resolution": "720p",  # The resolution of the generated media. options: 540p | 720p | 1080p
    "audio_type": "all",  # Audio type, required when audio is true, defaults to all. options: all | speech_only | sound_effect_only
    "aspect_ratio": "16:9",  # The aspect ratio of the generated media. options: 16:9 | 9:16 | 1:1 | 4:3 | 3:4
    "movement_amplitude": "auto",  # The movement amplitude of objects in the frame. options: auto | small | medium | large
    "generate_audio": True,  # Whether to generate audio for the video
    "seed": 0,  # The random seed to use for the generation
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] in ["completed", "succeeded"]:
            print("Generated video:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

video_url = check_status()

Installera

Installera det nödvändiga paketet för ditt programmeringsspråk.

pip install requests

Autentisering

Alla API-förfrågningar kräver autentisering via en API key. Du kan hämta din API key från Atlas Cloud-instrumentpanelen.

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP-huvuden

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

Håll din API key säker

Exponera aldrig din API key i klientkod eller publika arkiv. Använd miljövariabler eller en backend-proxy istället.

Skicka en förfrågan

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

Skicka en förfrågan

Skicka en asynkron genereringsförfrågan. API:et returnerar ett prediction ID som du kan använda för att kontrollera statusen och hämta resultatet.

POST/api/v1/model/generateVideo

Förfrågningsinnehåll

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "vidu/q2/reference-to-video",
    "prompt": "A beautiful sunset over the ocean with gentle waves"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

Svar

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

Kontrollera status

Polla prediction-endpointen för att kontrollera den aktuella statusen för din förfrågan.

GET/api/v1/model/prediction/{prediction_id}

Polling-exempel

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

Statusvärden

processingFörfrågan bearbetas fortfarande.

completedGenereringen är klar. Utdata är tillgängliga.

succeededGenereringen lyckades. Utdata är tillgängliga.

failedGenereringen misslyckades. Kontrollera error-fältet.

Slutfört svar

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.mp4"
    ],
    "metrics": {
      "predict_time": 45.2
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

Ladda upp filer

Ladda upp filer till Atlas Cloud-lagring och få en URL som du kan använda i dina API-förfrågningar. Använd multipart/form-data för uppladdning.

POST/api/v1/model/uploadMedia

Uppladdningsexempel

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

Svar

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

Input Schema

Följande parametrar accepteras i förfrågningsinnehållet.

Totalt: 10Obligatorisk: 3Valfri: 7

modelstringrequired

model name

Default: "vidu/q2/reference-to-video"

subjectsarray[object]required

Information about the subjects in the images. Supports 1–7 subjects, total 1–7 images

Min items: 1Max items: 7

idstringrequired

Subject ID. Usable in prompts via @subjectId

imagesarray[string]required

URLs of images corresponding to the subject. Each subject supports up to 3 images. 1.Assets can be provided via URLs or Base64 encode. 2.You must use one of the following codecs: PNG, JPEG, JPG, WebP. 3.The dimensions of the images must be at least 128x128 pixels. 4.The aspect ratio of the images must be less than 1:4 or 4:1. 5.The post body of the HTTP request should not exceed 20MB, and it must include an appropriate content type string.

Min items: 1Max items: 3

promptstringrequired

A textual description for video generation.

Default: "Santa Claus and the bear hug by the lakeside."

durationnumber

The duration of the generated media in seconds.

Default: 5Min: 1Max: 10

resolutionstring

The resolution of the generated media.

Default: "720p"

540p720p1080p

audio_typestring

Audio type, required when audio is true, defaults to all.

Default: "all"

allspeech_onlysound_effect_only

aspect_ratiostring

The aspect ratio of the generated media.

Default: "16:9"

16:99:161:14:33:4

movement_amplitudestring

The movement amplitude of objects in the frame.

Default: "auto"

autosmallmediumlarge

generate_audioboolean

Whether to generate audio for the video.

Default: true

seedinteger

The random seed to use for the generation. -1 means a random seed will be used.

Default: 0

Exempel på förfrågningsinnehåll

{
  "model": "vidu/q2/reference-to-video",
  "subjects": [
    {
      "id": "example_id",
      "images": [
        "https://example.com/file.jpg"
      ]
    }
  ],
  "prompt": "Santa Claus and the bear hug by the lakeside.",
  "duration": 5,
  "resolution": "720p",
  "audio_type": "all",
  "aspect_ratio": "16:9",
  "movement_amplitude": "auto",
  "generate_audio": true,
  "seed": 0
}

Output Schema

API:et returnerar ett prediction-svar med de genererade utdata-URL:erna.

created_atstring

ISO timestamp of when the request was created.

idstring

Unique identifier for the prediction.

modelstring

Model ID used for the prediction.

outputsarray

Array of URLs to the generated content (empty when status is not completed).

statusstring

Status of the task: created, processing, completed, or failed.

Exempelsvar

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.mp4"
  ],
  "metrics": {
    "predict_time": 45.2
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills integrerar 400+ AI-modeller direkt i din AI-kodassistent. Ett kommando för att installera, sedan använd naturligt språk för att generera bilder, videor och chatta med LLM.

Stödda klienter

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ stödda klienter

Installera

npx skills add AtlasCloudAI/atlas-cloud-skills

Konfigurera API Key

Hämta din API key från Atlas Cloud-instrumentpanelen och ställ in den som en miljövariabel.

export ATLASCLOUD_API_KEY="your-api-key-here"

Funktioner

När det är installerat kan du använda naturligt språk i din AI-assistent för att komma åt alla Atlas Cloud-modeller.

BildgenereringGenerera bilder med modeller som Nano Banana 2, Z-Image och fler.

VideoskapandeSkapa videor från text eller bilder med Kling, Vidu, Veo m.fl.

LLM-chattChatta med Qwen, DeepSeek och andra stora språkmodeller.

MediauppladdningLadda upp lokala filer för bildredigering och bild-till-video-arbetsflöden.

Läs mer

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

Atlas Cloud MCP Server ansluter din IDE med 400+ AI-modeller via Model Context Protocol. Fungerar med alla MCP-kompatibla klienter.

Stödda klienter

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ stödda klienter

Installera

npx -y atlascloud-mcp

Konfiguration

Lägg till följande konfiguration i din IDE:s MCP-inställningsfil.

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

Tillgängliga verktyg

atlas_generate_imageGenerera bilder från textpromptar.

atlas_generate_videoSkapa videor från text eller bilder.

atlas_chatChatta med stora språkmodeller.

atlas_list_modelsBläddra bland 400+ tillgängliga AI-modeller.

atlas_quick_generateInnehållsskapande i ett steg med automatiskt modellval.

atlas_upload_mediaLadda upp lokala filer för API-arbetsflöden.

Läs mer

github.com/AtlasCloudAI/mcp-server

API Schema

{
  "components": {
    "schemas": {
      "Input": {
        "properties": {
          "model": {
            "type": "string",
            "description": "model name",
            "default": "vidu/q2/reference-to-video"
          },
          "subjects": {
            "type": "array",
            "items": {
              "type": "object",
              "required": [
                "id",
                "images"
              ],
              "properties": {
                "id": {
                  "type": "string",
                  "description": "Subject ID. Usable in prompts via @subjectId"
                },
                "images": {
                  "type": "array",
                  "items": {
                    "type": "string"
                  },
                  "minItems": 1,
                  "maxItems": 3,
                  "description": "URLs of images corresponding to the subject. Each subject supports up to 3 images. 1.Assets can be provided via URLs or Base64 encode. 2.You must use one of the following codecs: PNG, JPEG, JPG, WebP. 3.The dimensions of the images must be at least 128x128 pixels. 4.The aspect ratio of the images must be less than 1:4 or 4:1. 5.The post body of the HTTP request should not exceed 20MB, and it must include an appropriate content type string."
                }
              }
            },
            "minItems": 1,
            "maxItems": 7,
            "description": "Information about the subjects in the images. Supports 1–7 subjects, total 1–7 images"
          },
          "prompt": {
            "type": "string",
            "default": "Santa Claus and the bear hug by the lakeside.",
            "description": "A textual description for video generation."
          },
          "duration": {
            "default": 5,
            "description": "The duration of the generated media in seconds.",
            "maximum": 10,
            "minimum": 1,
            "step": 1,
            "type": "number",
            "x-ui-component": "slider"
          },
          "resolution": {
            "default": "720p",
            "description": "The resolution of the generated media.",
            "enum": [
              "540p",
              "720p",
              "1080p"
            ],
            "type": "string"
          },
          "audio_type": {
            "enum": [
              "all",
              "speech_only",
              "sound_effect_only"
            ],
            "type": "string",
            "default": "all",
            "description": "Audio type, required when audio is true, defaults to all."
          },
          "aspect_ratio": {
            "default": "16:9",
            "description": "The aspect ratio of the generated media.",
            "enum": [
              "16:9",
              "9:16",
              "1:1",
              "4:3",
              "3:4"
            ],
            "type": "string"
          },
          "movement_amplitude": {
            "default": "auto",
            "description": "The movement amplitude of objects in the frame.",
            "enum": [
              "auto",
              "small",
              "medium",
              "large"
            ],
            "type": "string"
          },
          "generate_audio": {
            "default": true,
            "description": "Whether to generate audio for the video.",
            "type": "boolean"
          },
          "seed": {
            "description": "The random seed to use for the generation. -1 means a random seed will be used.",
            "default": 0,
            "type": "integer"
          }
        },
        "required": [
          "model",
          "prompt",
          "subjects"
        ],
        "type": "object",
        "x-order-properties": [
          "model",
          "subjects",
          "prompt",
          "duration",
          "resolution",
          "generate_audio",
          "audio_type",
          "aspect_ratio",
          "movement_amplitude",
          "seed"
        ]
      },
      "PredictionResponse": {
        "properties": {
          "created_at": {
            "description": "ISO timestamp of when the request was created.",
            "format": "date-time",
            "type": "string"
          },
          "has_nsfw_contents": {
            "description": "Array of boolean values indicating NSFW detection for each output.",
            "items": {
              "type": "boolean"
            },
            "type": "array"
          },
          "id": {
            "description": "Unique identifier for the prediction.",
            "type": "string"
          },
          "model": {
            "description": "Model ID used for the prediction.",
            "type": "string"
          },
          "outputs": {
            "description": "Array of URLs to the generated content (empty when status is not completed).",
            "items": {
              "type": "string"
            },
            "type": "array"
          },
          "status": {
            "description": "Status of the task: created, processing, completed, or failed.",
            "type": "string"
          },
          "urls": {
            "description": "Object containing related API endpoints.",
            "type": "object"
          }
        },
        "type": "object"
      }
    },
    "securitySchemes": {
      "apiKeyAuth": {
        "in": "header",
        "name": "Authorization",
        "type": "apiKey"
      }
    }
  },
  "info": {
    "description": "The AtlasCloud API.",
    "title": "AtlasCloud API",
    "version": "1.0.0"
  },
  "openapi": "3.0.0",
  "paths": {
    "/api/v1/model/prediction/{request_id}": {
      "get": {
        "parameters": [
          {
            "in": "path",
            "name": "request_id",
            "required": true,
            "schema": {
              "description": "Request ID",
              "type": "string"
            }
          }
        ],
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "Result of the request."
          }
        }
      },
      "x-api-name": "model_result"
    },
    "/api/v1/model/generateVideo": {
      "post": {
        "requestBody": {
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/Input"
              }
            }
          },
          "required": true
        },
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "The request status."
          }
        }
      },
      "x-api-name": "model_run"
    }
  },
  "servers": [
    {
      "url": "https://api.atlascloud.ai"
    }
  ]
}

LLM-vänlig Promptmall

# vidu/q2/reference-to-video

> Vidu Q2 Reference-to-Video is an advanced AI video generation model that brings static images to life. Upload a reference image and describe the motion you want — the model generates high-quality video with smooth animation, optional audio, and cinematic quality up to 1080p.


## Overview

- **Submit endpoint (POST)**: `https://api.atlascloud.ai/api/v1/model/generateVideo` — start an async generation; returns a `prediction_id`
- **Poll endpoint (GET)**: `https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}` — poll this until the prediction finishes
- **Model ID**: `vidu/q2/reference-to-video`


## API Information

This model can be used via our HTTP API or more conveniently via our client libraries.
See the input and output schema below, as well as the usage examples.


### Input Schema

The API accepts the following input parameters:

- **`model`** (`string`, _required_):
  model name
  - Default: `"vidu/q2/reference-to-video"`

- **`subjects`** (`array[object]`, _required_):
  Information about the subjects in the images. Supports 1–7 subjects, total 1–7 images
  - Min items: 1
  - Max items: 7
  - Item properties:
    - **`id`** (`string`, _required_):
      Subject ID. Usable in prompts via @subjectId

    - **`images`** (`array[string]`, _required_):
      URLs of images corresponding to the subject. Each subject supports up to 3 images. 1.Assets can be provided via URLs or Base64 encode. 2.You must use one of the following codecs: PNG, JPEG, JPG, WebP. 3.The dimensions of the images must be at least 128x128 pixels. 4.The aspect ratio of the images must be less than 1:4 or 4:1. 5.The post body of the HTTP request should not exceed 20MB, and it must include an appropriate content type string.
      - Min items: 1
      - Max items: 3


- **`prompt`** (`string`, _required_):
  A textual description for video generation.
  - Default: `"Santa Claus and the bear hug by the lakeside."`

- **`duration`** (`number`, _optional_):
  The duration of the generated media in seconds.
  - Default: `5`
  - Min: 1
  - Max: 10

- **`resolution`** (`string`, _optional_):
  The resolution of the generated media.
  - Default: `"720p"`
  - Options: "540p", "720p", "1080p"

- **`generate_audio`** (`boolean`, _optional_):
  Whether to generate audio for the video.
  - Default: `true`

- **`audio_type`** (`string`, _optional_):
  Audio type, required when audio is true, defaults to all.
  - Default: `"all"`
  - Options: "all", "speech_only", "sound_effect_only"

- **`aspect_ratio`** (`string`, _optional_):
  The aspect ratio of the generated media.
  - Default: `"16:9"`
  - Options: "16:9", "9:16", "1:1", "4:3", "3:4"

- **`movement_amplitude`** (`string`, _optional_):
  The movement amplitude of objects in the frame.
  - Default: `"auto"`
  - Options: "auto", "small", "medium", "large"

- **`seed`** (`integer`, _optional_):
  The random seed to use for the generation. -1 means a random seed will be used.
  - Default: `0`



**Required Parameters Example**:

```json
{
  "model": "vidu/q2/reference-to-video",
  "prompt": "Santa Claus and the bear hug by the lakeside.",
  "subjects": [
    {
      "id": "",
      "images": [
        ""
      ]
    }
  ]
}
```


**Full Example**:

```json
{
  "model": "vidu/q2/reference-to-video",
  "subjects": [
    {
      "id": "",
      "images": [
        ""
      ]
    }
  ],
  "prompt": "Santa Claus and the bear hug by the lakeside.",
  "duration": 5,
  "resolution": "720p",
  "generate_audio": true,
  "audio_type": "all",
  "aspect_ratio": "16:9",
  "movement_amplitude": "auto",
  "seed": 0
}
```


### Output Schema

The API returns the following output format:


- **`created_at`** (`string`, _optional_):
  ISO timestamp of when the request was created.

- **`has_nsfw_contents`** (`array[boolean]`, _optional_):
  Array of boolean values indicating NSFW detection for each output.

- **`id`** (`string`, _optional_):
  Unique identifier for the prediction.

- **`model`** (`string`, _optional_):
  Model ID used for the prediction.

- **`outputs`** (`array[string]`, _optional_):
  Array of URLs to the generated content (empty when status is not completed).

- **`status`** (`string`, _optional_):
  Status of the task: created, processing, completed, or failed.

- **`urls`** (`object`, _optional_):
  Object containing related API endpoints.



**Example Response**:

```json
{
  "created_at": "",
  "has_nsfw_contents": [],
  "id": "",
  "model": "",
  "outputs": [
    ""
  ],
  "status": "",
  "urls": {}
}
```


## Usage Examples

### cURL

```bash
# Step 1: Start generation (async)
curl -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "vidu/q2/reference-to-video",
  "subjects": [
    {
      "id": "",
      "images": [
        ""
      ]
    }
  ],
  "prompt": "Santa Claus and the bear hug by the lakeside.",
  "duration": 5,
  "resolution": "720p",
  "generate_audio": true,
  "audio_type": "all",
  "aspect_ratio": "16:9",
  "movement_amplitude": "auto",
  "seed": 0
}'

# Response will contain: {"code": 200, "data": {"id": "prediction_id", "status": "processing"}}

# Step 2: Poll for result (replace {prediction_id} with the id returned above)
curl -X GET "https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"

# Keep polling until status is "completed", "succeeded" or "failed"
# When completed, outputs will contain the generated content URL(s)
```

## Additional Resources

### Documentation

- [Model Playground](https://www.atlascloud.ai/models/vidu/q2/reference-to-video)

The character rides a horse across the grassland

Laddar...