google/gemini-omni-flash/image-to-video

画像から動画

Gemini Omni Flash Image-to-Video API by Google

google/gemini-omni-flash/image-to-video

Image-to-video

A natively multimodal Google DeepMind model that animates a still image into a cinematic, sound-enabled video guided by a text prompt while preserving the source subject and composition.

入力

プロンプト *

画像 *

ファイルをドラッグ＆ドロップするか、クリックしてアップロード

MAX:1

長さ

アスペクト比

Thinking level

解像度

シード

出力

待機中

生成された動画がここに表示されます

設定を構成して「実行」をクリックして開始

各実行には$0.13かかります。$10で約76回実行できます。

次にできること：

Seedance 2.0 Kling v3 Vidu Wan2.7

パラメータ

コード例
import requests
import time

# Step 1: Start video generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "google/gemini-omni-flash/image-to-video",  # Required. model name
    "image": "example_value",  # Required. The image to animate into a video, used as the starting frame or motion guide
    "prompt": "A beautiful sunset over the ocean with gentle waves",  # Required. Text prompt for generation
    "duration": 10,  # The duration of the generated video in seconds. (min: 3, max: 10)
    "aspect_ratio": "16:9",  # The aspect ratio of the generated video. options: 16:9 | 9:16
    "resolution": "720p",  # The resolution of the generated video. options: 720p
    "thinking_level": "default",  # Controls the amount of internal reasoning the model performs before generating a response. options: default | high | low
    "seed": -1,  # The random seed to use for the generation
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] in ["completed", "succeeded"]:
            print("Generated video:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

video_url = check_status()

インストール

お使いの言語に必要なパッケージをインストールしてください。

pip install requests

認証

すべての API リクエストには API キーによる認証が必要です。API キーは Atlas Cloud ダッシュボードから取得できます。

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP ヘッダー

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

API キーを安全に保管してください

API キーをクライアントサイドのコードや公開リポジトリに公開しないでください。代わりに環境変数またはバックエンドプロキシを使用してください。

リクエストを送信

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

リクエストを送信

非同期生成リクエストを送信します。API は予測 ID を返し、それを使用してステータスの確認や結果の取得ができます。

POST/api/v1/model/generateVideo

リクエストボディ

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "google/gemini-omni-flash/image-to-video",
    "prompt": "A beautiful sunset over the ocean with gentle waves"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

レスポンス

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

ステータスを確認

予測エンドポイントをポーリングして、リクエストの現在のステータスを確認します。

GET/api/v1/model/prediction/{prediction_id}

ポーリング例

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

ステータス値

processingリクエストはまだ処理中です。

completed生成が完了しました。出力が利用可能です。

succeeded生成が成功しました。出力が利用可能です。

failed生成に失敗しました。エラーフィールドを確認してください。

完了レスポンス

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.mp4"
    ],
    "metrics": {
      "predict_time": 45.2
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

ファイルをアップロード

Atlas Cloud ストレージにファイルをアップロードし、API リクエストで使用できる URL を取得します。multipart/form-data を使用してアップロードします。

POST/api/v1/model/uploadMedia

アップロード例

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

レスポンス

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

入力 Schema

以下のパラメータがリクエストボディで使用できます。

合計: 8必須: 3任意: 5

modelstringrequired

model name

Default: "google/gemini-omni-flash/image-to-video"

imagestringrequired

The image to animate into a video, used as the starting frame or motion guide. Supported formats: PNG, JPEG, JPG, WebP. Limited to 20MB. Supports both a public URL and a base64-encoded image.

Format: uri

promptstringrequired

Text prompt for generation. Describes the target content, style, camera language, or character actions. Maximum 20,000 characters.

durationinteger

The duration of the generated video in seconds.

Default: 10Min: 3Max: 10

aspect_ratiostring

The aspect ratio of the generated video.

Default: "16:9"

16:99:16

resolutionstring

The resolution of the generated video.

Default: "720p"

720p

thinking_levelstring

Controls the amount of internal reasoning the model performs before generating a response. Higher levels may improve quality on complex tasks but increase latency.

Default: "default"

defaulthighlow

seedinteger

The random seed to use for the generation. -1 means a random seed will be used.

Default: -1

リクエストボディの例

{
  "model": "google/gemini-omni-flash/image-to-video",
  "image": "example_image",
  "prompt": "A beautiful landscape",
  "duration": 10,
  "aspect_ratio": "16:9",
  "resolution": "720p",
  "thinking_level": "default",
  "seed": -1
}

出力 Schema

API は生成された出力 URL を含む予測レスポンスを返します。

codeinteger

HTTP status code of the response.

messagestring

Human-readable message; non-empty on failure.

dataobject

レスポンス例

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.mp4"
  ],
  "metrics": {
    "predict_time": 45.2
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills は 400 以上の AI モデルを AI コーディングアシスタントに直接統合します。ワンコマンドでインストールし、自然言語で画像・動画生成や LLM との対話が可能です。

対応クライアント

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ 対応クライアント

インストール

npx skills add AtlasCloudAI/atlas-cloud-skills

API キーの設定

Atlas Cloud ダッシュボードから API キーを取得し、環境変数として設定してください。

export ATLASCLOUD_API_KEY="your-api-key-here"

機能

インストール後、AI アシスタントで自然言語を使用してすべての Atlas Cloud モデルにアクセスできます。

画像生成Nano Banana 2、Z-Image などのモデルで画像を生成します。

動画作成Kling、Vidu、Veo などでテキストや画像から動画を作成します。

LLM チャットQwen、DeepSeek などの大規模言語モデルと対話します。

メディアアップロード画像編集や画像から動画へのワークフロー用にローカルファイルをアップロードします。

詳細を見る

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

Atlas Cloud MCP Server は Model Context Protocol を通じて IDE と 400 以上の AI モデルを接続します。MCP 対応のあらゆるクライアントで動作します。

対応クライアント

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ 対応クライアント

インストール

npx -y atlascloud-mcp

設定

以下の設定を IDE の MCP 設定ファイルに追加してください。

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

利用可能なツール

atlas_generate_imageテキストプロンプトから画像を生成します。

atlas_generate_videoテキストや画像から動画を作成します。

atlas_chat大規模言語モデルと対話します。

atlas_list_models400 以上の利用可能な AI モデルを閲覧します。

atlas_quick_generate最適なモデルを自動選択し、ワンステップでコンテンツを作成。

atlas_upload_mediaAPI ワークフロー用にローカルファイルをアップロードします。

詳細を見る

github.com/AtlasCloudAI/mcp-server

APIスキーマ

{
  "info": {
    "title": "AtlasCloud API",
    "version": "1.0.0",
    "description": "The AtlasCloud API."
  },
  "openapi": "3.0.0",
  "paths": {
    "/api/v1/model/generateVideo": {
      "post": {
        "requestBody": {
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/Input"
              }
            }
          },
          "required": true
        },
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "The request status."
          }
        }
      },
      "x-api-name": "model_run"
    },
    "/api/v1/model/prediction/{request_id}": {
      "get": {
        "parameters": [
          {
            "in": "path",
            "name": "request_id",
            "required": true,
            "schema": {
              "description": "Request ID",
              "type": "string"
            }
          }
        ],
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "Result of the request."
          }
        }
      },
      "x-api-name": "model_result"
    }
  },
  "components": {
    "schemas": {
      "Input": {
        "properties": {
          "model": {
            "type": "string",
            "description": "model name",
            "default": "google/gemini-omni-flash/image-to-video"
          },
          "image": {
            "description": "The image to animate into a video, used as the starting frame or motion guide. Supported formats: PNG, JPEG, JPG, WebP. Limited to 20MB. Supports both a public URL and a base64-encoded image.",
            "type": "string",
            "format": "uri",
            "x-ui-component": "uploader"
          },
          "prompt": {
            "description": "Text prompt for generation. Describes the target content, style, camera language, or character actions. Maximum 20,000 characters.",
            "type": "string"
          },
          "duration": {
            "default": 10,
            "description": "The duration of the generated video in seconds.",
            "maximum": 10,
            "minimum": 3,
            "type": "integer",
            "x-ui-component": "select"
          },
          "aspect_ratio": {
            "default": "16:9",
            "description": "The aspect ratio of the generated video.",
            "enum": [
              "16:9",
              "9:16"
            ],
            "type": "string",
            "x-placeholder": "Select one",
            "x-ui-component": "select"
          },
          "resolution": {
            "default": "720p",
            "description": "The resolution of the generated video.",
            "enum": [
              "720p"
            ],
            "type": "string",
            "x-placeholder": "Select one",
            "x-ui-component": "select"
          },
          "thinking_level": {
            "description": "Controls the amount of internal reasoning the model performs before generating a response. Higher levels may improve quality on complex tasks but increase latency.",
            "default": "default",
            "enum": [
              "default",
              "high",
              "low"
            ],
            "type": "string"
          },
          "seed": {
            "default": -1,
            "description": "The random seed to use for the generation. -1 means a random seed will be used.",
            "type": "integer"
          }
        },
        "required": [
          "model",
          "image",
          "prompt"
        ],
        "type": "object",
        "x-order-properties": [
          "model",
          "prompt",
          "image",
          "duration",
          "aspect_ratio",
          "thinking_level",
          "resolution",
          "seed"
        ]
      },
      "PredictionResponse": {
        "type": "object",
        "properties": {
          "code": {
            "description": "HTTP status code of the response.",
            "type": "integer"
          },
          "message": {
            "description": "Human-readable message; non-empty on failure.",
            "type": "string"
          },
          "data": {
            "type": "object",
            "properties": {
              "id": {
                "description": "Unique identifier for the prediction.",
                "type": "string"
              },
              "model": {
                "description": "Model ID used for the prediction.",
                "type": "string"
              },
              "outputs": {
                "description": "Array of URLs to the generated content. Null when status is not completed.",
                "type": "array",
                "items": {
                  "type": "string"
                },
                "nullable": true
              },
              "urls": {
                "description": "Object containing related API endpoints.",
                "type": "object",
                "properties": {
                  "get": {
                    "description": "URL to poll for the prediction result.",
                    "type": "string",
                    "format": "uri"
                  }
                }
              },
              "has_nsfw_contents": {
                "description": "Array of boolean values indicating NSFW detection for each output. Null if not applicable.",
                "type": "array",
                "items": {
                  "type": "boolean"
                },
                "nullable": true
              },
              "status": {
                "description": "Status of the task: created, processing, completed, timeout, or failed.",
                "type": "string"
              },
              "created_at": {
                "description": "ISO timestamp of when the request was created (e.g., \"2023-04-01T12:34:56.789Z\").",
                "format": "date-time",
                "type": "string"
              },
              "error": {
                "description": "Error message if the task failed, empty string otherwise.",
                "type": "string"
              },
              "error_code": {
                "description": "Error code if the task failed.",
                "type": "integer"
              },
              "executionTime": {
                "description": "Total execution time in milliseconds.",
                "type": "number"
              },
              "timings": {
                "description": "Detailed timing breakdown.",
                "type": "object",
                "properties": {
                  "inference": {
                    "description": "Inference time in milliseconds.",
                    "type": "number"
                  }
                }
              }
            }
          }
        }
      }
    },
    "securitySchemes": {
      "apiKeyAuth": {
        "in": "header",
        "name": "Authorization",
        "type": "apiKey"
      }
    }
  },
  "servers": [
    {
      "url": "https://api.atlascloud.ai"
    }
  ]
}

LLMフレンドリーなプロンプトテンプレート

# google/gemini-omni-flash/image-to-video

> A natively multimodal Google DeepMind model that animates a still image into a cinematic, sound-enabled video guided by a text prompt while preserving the source subject and composition.


## Overview

- **Submit endpoint (POST)**: `https://api.atlascloud.ai/api/v1/model/generateVideo` — start an async generation; returns a `prediction_id`
- **Poll endpoint (GET)**: `https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}` — poll this until the prediction finishes
- **Model ID**: `google/gemini-omni-flash/image-to-video`


## API Information

This model can be used via our HTTP API or more conveniently via our client libraries.
See the input and output schema below, as well as the usage examples.


### Input Schema

The API accepts the following input parameters:

- **`model`** (`string`, _required_):
  model name
  - Default: `"google/gemini-omni-flash/image-to-video"`

- **`prompt`** (`string`, _required_):
  Text prompt for generation. Describes the target content, style, camera language, or character actions. Maximum 20,000 characters.

- **`image`** (`string`, _required_):
  The image to animate into a video, used as the starting frame or motion guide. Supported formats: PNG, JPEG, JPG, WebP. Limited to 20MB. Supports both a public URL and a base64-encoded image.

- **`duration`** (`integer`, _optional_):
  The duration of the generated video in seconds.
  - Default: `10`
  - Min: 3
  - Max: 10

- **`aspect_ratio`** (`string`, _optional_):
  The aspect ratio of the generated video.
  - Default: `"16:9"`
  - Options: "16:9", "9:16"

- **`thinking_level`** (`string`, _optional_):
  Controls the amount of internal reasoning the model performs before generating a response. Higher levels may improve quality on complex tasks but increase latency.
  - Default: `"default"`
  - Options: "default", "high", "low"

- **`resolution`** (`string`, _optional_):
  The resolution of the generated video.
  - Default: `"720p"`
  - Options: "720p"

- **`seed`** (`integer`, _optional_):
  The random seed to use for the generation. -1 means a random seed will be used.
  - Default: `-1`



**Required Parameters Example**:

```json
{
  "model": "google/gemini-omni-flash/image-to-video",
  "image": "",
  "prompt": ""
}
```


**Full Example**:

```json
{
  "model": "google/gemini-omni-flash/image-to-video",
  "prompt": "",
  "image": "",
  "duration": 10,
  "aspect_ratio": "16:9",
  "thinking_level": "default",
  "resolution": "720p",
  "seed": -1
}
```


### Output Schema

The API returns the following output format:


- **`code`** (`integer`, _optional_):
  HTTP status code of the response.

- **`message`** (`string`, _optional_):
  Human-readable message; non-empty on failure.

- **`data`** (`object`, _optional_):
  - Properties:
    - **`id`** (`string`, _optional_):
      Unique identifier for the prediction.

    - **`model`** (`string`, _optional_):
      Model ID used for the prediction.

    - **`outputs`** (`array[string]`, _optional_):
      Array of URLs to the generated content. Null when status is not completed.

    - **`urls`** (`object`, _optional_):
      Object containing related API endpoints.
      - Properties:
        - **`get`** (`string`, _optional_):
          URL to poll for the prediction result.


    - **`has_nsfw_contents`** (`array[boolean]`, _optional_):
      Array of boolean values indicating NSFW detection for each output. Null if not applicable.

    - **`status`** (`string`, _optional_):
      Status of the task: created, processing, completed, timeout, or failed.

    - **`created_at`** (`string`, _optional_):
      ISO timestamp of when the request was created (e.g., "2023-04-01T12:34:56.789Z").

    - **`error`** (`string`, _optional_):
      Error message if the task failed, empty string otherwise.

    - **`error_code`** (`integer`, _optional_):
      Error code if the task failed.

    - **`executionTime`** (`number`, _optional_):
      Total execution time in milliseconds.

    - **`timings`** (`object`, _optional_):
      Detailed timing breakdown.
      - Properties:
        - **`inference`** (`number`, _optional_):
          Inference time in milliseconds.





**Example Response**:

```json
{
  "code": 0,
  "message": "",
  "data": {
    "id": "",
    "model": "",
    "outputs": [
      ""
    ],
    "urls": {
      "get": ""
    },
    "has_nsfw_contents": [],
    "status": "",
    "created_at": "",
    "error": "",
    "error_code": 0,
    "executionTime": 0,
    "timings": {
      "inference": 0
    }
  }
}
```


## Usage Examples

### cURL

```bash
# Step 1: Start generation (async)
curl -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "google/gemini-omni-flash/image-to-video",
  "prompt": "",
  "image": "",
  "duration": 10,
  "aspect_ratio": "16:9",
  "thinking_level": "default",
  "resolution": "720p",
  "seed": -1
}'

# Response will contain: {"code": 200, "data": {"id": "prediction_id", "status": "processing"}}

# Step 2: Poll for result (replace {prediction_id} with the id returned above)
curl -X GET "https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"

# Keep polling until status is "completed", "succeeded" or "failed"
# When completed, outputs will contain the generated content URL(s)
```

## Additional Resources

### Documentation

- [Model Playground](https://www.atlascloud.ai/models/google/gemini-omni-flash/image-to-video)

The spacecraft is flying at high speed.

読み込み中...

Gemini Omni Flash — Image to Video

Model ID: google/gemini-omni-flash/image-to-video

Gemini Omni Flash is Google DeepMind's high-performance, natively multimodal model built for high-speed video generation, editing, and cinematic control. This variant accepts an image plus a text prompt, animating a still image into a coherent, sound-enabled video guided by your instructions.

Overview

Gemini Omni Flash (gemini-omni-flash-preview) was introduced by Google alongside Nano Banana 2 Lite as a new generation of multimodal media models. Unlike traditional pipelines that stitch modalities together, Omni Flash is a single transformer that processes text, images, audio, and video simultaneously, producing output that is more cohesive, consistent, and controllable.

What sets it apart from earlier video models (such as the Veo family) is that it natively generates audio with every video — dialogue, ambience, music, and sound design are produced together with the picture rather than added afterward. The model is grounded in Gemini's real-world knowledge, so it reasons about physics, narrative logic, culture, and visual composition to produce results that feel intentional and cinematic. Generated media carries an invisible SynthID watermark.

AtlasCloud exposes Gemini Omni Flash through four endpoints — text-to-video, image-to-video, reference-to-video, and video-edit. All four route to the same gemini-omni-flash-preview model and differ only by the input modality they accept, corresponding to the model's task parameter (text_to_video, image_to_video, reference_to_video, edit). This endpoint maps to image_to_video.

Inputs

This variant takes an image and a text prompt. The image is used as the starting frame or motion guide, while the prompt describes how the scene should move, evolve, and sound. This is well suited to bringing a specific photo, illustration, or design to life while preserving its subject and composition.

Image — PNG, JPEG, JPG, or WebP, up to 20 MB. Supplied as a public URL or a base64-encoded image.
Prompt — Natural-language description of motion, camera language, mood, and audio (up to 20,000 characters).

Key Capabilities

Image-grounded animation — Preserves the subject, style, and composition of the source image while adding motion.
Rich prompt understanding — Direct camera movement, action, mood, style, and audio in a single prompt of up to 20,000 characters.
Native audio generation — Every clip is rendered with a synchronized soundtrack (speech, music, effects) driven by your description.
World-grounded realism — Physics, motion, and scene dynamics informed by Gemini's real-world knowledge.
Adjustable reasoning — The thinking_level control trades latency for quality on complex prompts.
Reproducible results — Set a fixed seed to reproduce or iterate on a specific generation.

Input Parameters

Parameter	Type	Required	Default	Description
`model`	string	Yes	`google/gemini-omni-flash/image-to-video`	Model identifier
`prompt`	string	Yes	—	Text description of the motion and scene. Max 20,000 characters.
`image`	string (uri)	Yes	—	Image to animate, used as the starting frame or motion guide. PNG/JPEG/JPG/WebP, ≤20 MB. URL or base64.
`duration`	integer	No	`10`	Video length in seconds. Range: `3`–`10`.
`aspect_ratio`	string	No	`16:9`	Output aspect ratio. Enum: `16:9`, `9:16`.
`thinking_level`	string	No	`default`	Internal reasoning effort. Enum: `default`, `high`, `low`.
`resolution`	string	No	`720p`	Output resolution. Enum: `720p`.
`seed`	integer	No	`-1`	Random seed for reproducibility. `-1` uses a random seed.

Use Cases

Photo-to-motion — Animate product shots, portraits, or artwork into short living scenes.
Concept-to-video — Turn a single key frame or design mockup into a moving preview.
Social content — Produce eye-catching short-form clips from a single hero image.
Marketing assets — Bring campaign visuals to life with motion and sound.
Prototyping — Test how a static composition reads once it moves, before full production.

Pricing

Billing is based on the duration of the generated video, charged at a flat per-second rate.

SKU	Rate
Per second of output	$0.13

Formula: max(3, duration) × $0.13

Billing is per second, with a 3-second minimum — durations below 3s are billed as 3s.
Example: a 10-second video costs 10 × $0.13 = $1.30.
Example: a 3-second video costs 3 × $0.13 = $0.39.

Gemini Omni Flash Image-to-Video API by Google

入力

出力

パラメータ

コード例

インストール

認証

HTTP ヘッダー

リクエストを送信

リクエストを送信

リクエストボディ

レスポンス

ステータスを確認

ポーリング例

ステータス値

完了レスポンス

ファイルをアップロード

アップロード例

レスポンス

入力 Schema

リクエストボディの例

出力 Schema

レスポンス例

Atlas Cloud Skills

対応クライアント

インストール

API キーの設定

機能

MCP Server

対応クライアント

インストール

設定

利用可能なツール

APIスキーマ

LLMフレンドリーなプロンプトテンプレート

Gemini Omni Flash — Image to Video

Overview

Inputs

Key Capabilities

Input Parameters

Use Cases

Pricing

類似モデルを探索

Gemini Omni Flash Image-to-Video Developer

Gemini Omni Flash Text-to-Video Developer

Veo 3.1 Lite Text-to-video

Veo 3.1 Lite Start-End Frame to Video

Veo 3.1 Lite Image-to-video

Veo3.1 Fast Image-to-video

Veo3.1 Fast Text-to-video

Veo3.1 Image-to-video

Veo3.1 Reference-to-video

Veo3.1 Text-to-video

Gemini Omni Flash Reference-to-Video

Gemini Omni Flash Video Edit

Gemini Omni Flash Text-to-Video

Gemini Omni Flash Reference-to-Video Developer

Sync.so Lipsync v3

VEED Lipsync

ひとつのAPIで、あらゆるメディアAIを。

Join our Discord community

入力

出力

パラメータ

コード例

インストール

認証

HTTP ヘッダー

リクエストを送信

リクエストを送信

リクエストボディ

レスポンス

ステータスを確認

ポーリング例

ステータス値

完了レスポンス

ファイルをアップロード

アップロード例

レスポンス

入力 Schema