xai/grok-imagine-video-v1.5/image-to-video

Imagem para Vídeo

Grok Imagine Video v1.5 Image-to-Video API by xAI

xai/grok-imagine-video-v1.5/image-to-video

Image-to-video

xAI Grok Imagine Video v1.5 animates a starting frame image with natural-language motion prompts at 480p/720p/1080P.

Entrada

If a request is blocked due to a violation of the xAI Terms of Service, the associated charges will still be billed to your account.

Prompt *

Image url *

Arraste arquivos aqui ou clique para fazer upload

MAX:1

Duração

Resolução

Proporção

Saída

Inativo

Os vídeos gerados serão exibidos aqui

Configure os parâmetros e clique em executar para começar a gerar

Cada execução custará $0.08. Com $10 você pode executar aproximadamente 125 vezes.

Você pode continuar com:

Seedance 2.0 Kling v3 Vidu Wan2.7

Parâmetros

Exemplo de código
import requests
import time

# Step 1: Start video generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "xai/grok-imagine-video-v1.5/image-to-video",  # Required. Model name
    "prompt": "A beautiful sunset over the ocean with gentle waves",  # Required. Natural-language motion prompt
    "image_url": "example_value",  # Required. Public HTTPS URL or base64 data URI of the starting-frame image (JPEG, PNG, or WebP)
    "duration": 8,  # Length of generated video in seconds. (min: 1, max: 15)
    "resolution": "720p",  # Output resolution. options: 480p | 720p | 1080p
    "aspect_ratio": "16:9",  # Output aspect ratio
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] in ["completed", "succeeded"]:
            print("Generated video:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

video_url = check_status()

Instalar

Instale o pacote necessário para a sua linguagem de programação.

pip install requests

Autenticação

Todas as solicitações de API requerem autenticação por meio de uma chave de API. Você pode obter sua chave de API no painel do Atlas Cloud.

export ATLASCLOUD_API_KEY="your-api-key-here"

Cabeçalhos HTTP

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

Mantenha sua chave de API segura

Nunca exponha sua chave de API em código do lado do cliente ou repositórios públicos. Use variáveis de ambiente ou um proxy de backend.

Enviar uma solicitação

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

Enviar uma solicitação

Envie uma solicitação de geração assíncrona. A API retorna um ID de predição que você pode usar para verificar o status e obter o resultado.

POST/api/v1/model/generateVideo

Corpo da solicitação

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "xai/grok-imagine-video-v1.5/image-to-video",
    "prompt": "A beautiful sunset over the ocean with gentle waves"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

Resposta

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

Verificar status

Consulte o endpoint de predição para verificar o status atual da sua solicitação.

GET/api/v1/model/prediction/{prediction_id}

Exemplo de polling

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

Valores de status

processingA solicitação ainda está sendo processada.

completedA geração está completa. As saídas estão disponíveis.

succeededA geração foi bem-sucedida. As saídas estão disponíveis.

failedA geração falhou. Verifique o campo de erro.

Resposta concluída

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.mp4"
    ],
    "metrics": {
      "predict_time": 45.2
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

Enviar arquivos

Envie arquivos para o armazenamento do Atlas Cloud e obtenha uma URL que pode ser usada nas suas solicitações de API. Use multipart/form-data para enviar.

POST/api/v1/model/uploadMedia

Exemplo de upload

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

Resposta

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

Schema de entrada

Os seguintes parâmetros são aceitos no corpo da solicitação.

Total: 6Obrigatório: 3Opcional: 3

modelstringrequired

Model name.

Default: "xai/grok-imagine-video-v1.5/image-to-video"

promptstringrequired

Natural-language motion prompt. The starting frame is taken from the image.

image_urlstringrequired

Public HTTPS URL or base64 data URI of the starting-frame image (JPEG, PNG, or WebP).

durationinteger

Length of generated video in seconds. Range: 1–15.

Default: 8Min: 1Max: 15

resolutionstring

Output resolution.

Default: "720p"

480p720p1080p

aspect_ratiostring

Output aspect ratio. The default matches the input image; specifying a different value stretches the image.

Default: "16:9"

1:116:99:164:33:43:22:3

Exemplo de corpo da solicitação

{
  "model": "xai/grok-imagine-video-v1.5/image-to-video",
  "prompt": "A beautiful landscape",
  "image_url": "example_image_url",
  "duration": 8,
  "resolution": "720p",
  "aspect_ratio": "16:9"
}

Schema de saída

A API retorna uma resposta de predição com as URLs de saída geradas.

idstring

Unique identifier for the prediction.

modelstring

Model ID used for the prediction.

statusstring

Status of the task: created, processing, completed, or failed.

outputsarray

Array of URLs to the generated video (empty when status is not completed).

created_atstring

ISO timestamp of when the request was created.

Exemplo de resposta

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.mp4"
  ],
  "metrics": {
    "predict_time": 45.2
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

O Atlas Cloud Skills integra mais de 400 modelos de IA diretamente no seu assistente de codificação com IA. Um comando para instalar e depois use linguagem natural para gerar imagens, vídeos e conversar com LLM.

Clientes compatíveis

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ clientes compatíveis

Instalar

npx skills add AtlasCloudAI/atlas-cloud-skills

Configurar chave de API

Obtenha sua chave de API no painel do Atlas Cloud e defina-a como variável de ambiente.

export ATLASCLOUD_API_KEY="your-api-key-here"

Funcionalidades

Após a instalação, você pode usar linguagem natural no seu assistente de IA para acessar todos os modelos do Atlas Cloud.

Geração de imagensGere imagens com modelos como Nano Banana 2, Z-Image e mais.

Criação de vídeosCrie vídeos a partir de texto ou imagens com Kling, Vidu, Veo, etc.

Chat com LLMConverse com Qwen, DeepSeek e outros grandes modelos de linguagem.

Upload de mídiaEnvie arquivos locais para fluxos de trabalho de edição de imagens e imagem para vídeo.

Saiba mais

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

O Atlas Cloud MCP Server conecta seu IDE com mais de 400 modelos de IA através do Model Context Protocol. Funciona com qualquer cliente compatível com MCP.

Clientes compatíveis

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ clientes compatíveis

Instalar

npx -y atlascloud-mcp

Configuração

Adicione a seguinte configuração ao arquivo de configuração de MCP do seu IDE.

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

Ferramentas disponíveis

atlas_generate_imageGere imagens a partir de prompts de texto.

atlas_generate_videoCrie vídeos a partir de texto ou imagens.

atlas_chatConverse com grandes modelos de linguagem.

atlas_list_modelsExplore mais de 400 modelos de IA disponíveis.

atlas_quick_generateCriação de conteúdo em uma etapa com seleção automática de modelo.

atlas_upload_mediaEnvie arquivos locais para fluxos de trabalho de API.

Saiba mais

github.com/AtlasCloudAI/mcp-server

API Schema

{
  "info": {
    "title": "AtlasCloud API",
    "version": "1.0.0",
    "description": "The AtlasCloud API."
  },
  "paths": {
    "/api/v1/model/generateVideo": {
      "post": {
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "The request status."
          }
        },
        "requestBody": {
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/Input"
              }
            }
          },
          "required": true
        }
      },
      "x-api-name": "model_run"
    },
    "/api/v1/model/prediction/{request_id}": {
      "get": {
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "Result of the request."
          }
        },
        "parameters": [
          {
            "in": "path",
            "name": "request_id",
            "schema": {
              "type": "string",
              "description": "Request ID"
            },
            "required": true
          }
        ]
      },
      "x-api-name": "model_result"
    }
  },
  "openapi": "3.0.0",
  "servers": [
    {
      "url": "https://api.atlascloud.ai"
    }
  ],
  "components": {
    "schemas": {
      "Input": {
        "type": "object",
        "required": [
          "model",
          "prompt",
          "image_url"
        ],
        "properties": {
          "model": {
            "type": "string",
            "description": "Model name.",
            "default": "xai/grok-imagine-video-v1.5/image-to-video"
          },
          "prompt": {
            "type": "string",
            "description": "Natural-language motion prompt. The starting frame is taken from the image."
          },
          "image_url": {
            "type": "string",
            "description": "Public HTTPS URL or base64 data URI of the starting-frame image (JPEG, PNG, or WebP)."
          },
          "duration": {
            "type": "integer",
            "default": 8,
            "minimum": 1,
            "maximum": 15,
            "description": "Length of generated video in seconds. Range: 1–15."
          },
          "resolution": {
            "type": "string",
            "default": "720p",
            "enum": [
              "480p",
              "720p",
              "1080p"
            ],
            "description": "Output resolution."
          },
          "aspect_ratio": {
            "type": "string",
            "default": "16:9",
            "enum": [
              "1:1",
              "16:9",
              "9:16",
              "4:3",
              "3:4",
              "3:2",
              "2:3"
            ],
            "description": "Output aspect ratio. The default matches the input image; specifying a different value stretches the image."
          }
        },
        "x-order-properties": [
          "model",
          "prompt",
          "image_url",
          "duration",
          "resolution",
          "aspect_ratio"
        ]
      },
      "PredictionResponse": {
        "type": "object",
        "properties": {
          "id": {
            "type": "string",
            "description": "Unique identifier for the prediction."
          },
          "urls": {
            "type": "object",
            "description": "Object containing related API endpoints."
          },
          "model": {
            "type": "string",
            "description": "Model ID used for the prediction."
          },
          "status": {
            "type": "string",
            "description": "Status of the task: created, processing, completed, or failed."
          },
          "outputs": {
            "type": "array",
            "items": {
              "type": "string"
            },
            "description": "Array of URLs to the generated video (empty when status is not completed)."
          },
          "created_at": {
            "type": "string",
            "format": "date-time",
            "description": "ISO timestamp of when the request was created."
          },
          "has_nsfw_contents": {
            "type": "array",
            "items": {
              "type": "boolean"
            },
            "description": "Array of boolean values indicating NSFW detection for each output."
          }
        }
      }
    },
    "securitySchemes": {
      "apiKeyAuth": {
        "in": "header",
        "name": "Authorization",
        "type": "apiKey"
      }
    }
  }
}

Template de Prompt Compatível com LLM

# xai/grok-imagine-video-v1.5/image-to-video

> xAI Grok Imagine Video v1.5 animates a starting frame image with natural-language motion prompts at 480p/720p/1080P.


## Overview

- **Submit endpoint (POST)**: `https://api.atlascloud.ai/api/v1/model/generateVideo` — start an async generation; returns a `prediction_id`
- **Poll endpoint (GET)**: `https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}` — poll this until the prediction finishes
- **Model ID**: `xai/grok-imagine-video-v1.5/image-to-video`


## API Information

This model can be used via our HTTP API or more conveniently via our client libraries.
See the input and output schema below, as well as the usage examples.


### Input Schema

The API accepts the following input parameters:

- **`model`** (`string`, _required_):
  Model name.
  - Default: `"xai/grok-imagine-video-v1.5/image-to-video"`

- **`prompt`** (`string`, _required_):
  Natural-language motion prompt. The starting frame is taken from the image.

- **`image_url`** (`string`, _required_):
  Public HTTPS URL or base64 data URI of the starting-frame image (JPEG, PNG, or WebP).

- **`duration`** (`integer`, _optional_):
  Length of generated video in seconds. Range: 1–15.
  - Default: `8`
  - Min: 1
  - Max: 15

- **`resolution`** (`string`, _optional_):
  Output resolution.
  - Default: `"720p"`
  - Options: "480p", "720p", "1080p"

- **`aspect_ratio`** (`string`, _optional_):
  Output aspect ratio. The default matches the input image; specifying a different value stretches the image.
  - Default: `"16:9"`
  - Options: "1:1", "16:9", "9:16", "4:3", "3:4", "3:2", "2:3"



**Required Parameters Example**:

```json
{
  "model": "xai/grok-imagine-video-v1.5/image-to-video",
  "prompt": "",
  "image_url": ""
}
```


**Full Example**:

```json
{
  "model": "xai/grok-imagine-video-v1.5/image-to-video",
  "prompt": "",
  "image_url": "",
  "duration": 8,
  "resolution": "720p",
  "aspect_ratio": "16:9"
}
```


### Output Schema

The API returns the following output format:


- **`id`** (`string`, _optional_):
  Unique identifier for the prediction.

- **`urls`** (`object`, _optional_):
  Object containing related API endpoints.

- **`model`** (`string`, _optional_):
  Model ID used for the prediction.

- **`status`** (`string`, _optional_):
  Status of the task: created, processing, completed, or failed.

- **`outputs`** (`array[string]`, _optional_):
  Array of URLs to the generated video (empty when status is not completed).

- **`created_at`** (`string`, _optional_):
  ISO timestamp of when the request was created.

- **`has_nsfw_contents`** (`array[boolean]`, _optional_):
  Array of boolean values indicating NSFW detection for each output.



**Example Response**:

```json
{
  "id": "",
  "urls": {},
  "model": "",
  "status": "",
  "outputs": [
    ""
  ],
  "created_at": "",
  "has_nsfw_contents": []
}
```


## Usage Examples

### cURL

```bash
# Step 1: Start generation (async)
curl -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "xai/grok-imagine-video-v1.5/image-to-video",
  "prompt": "",
  "image_url": "",
  "duration": 8,
  "resolution": "720p",
  "aspect_ratio": "16:9"
}'

# Response will contain: {"code": 200, "data": {"id": "prediction_id", "status": "processing"}}

# Step 2: Poll for result (replace {prediction_id} with the id returned above)
curl -X GET "https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"

# Keep polling until status is "completed", "succeeded" or "failed"
# When completed, outputs will contain the generated content URL(s)
```

## Additional Resources

### Documentation

- [Model Playground](https://www.atlascloud.ai/models/xai/grok-imagine-video-v1.5/image-to-video)

Slow cinematic fly-through approaching a gigantic black hole. The camera begins with a wide shot of the surrounding galaxy, then gradually descends toward the glowing accretion disk. Massive rings of plasma rotate rapidly around the event horizon, while distant stars bend and warp through gravitational lensing. The camera subtly tilts and orbits around the black hole, emphasizing its immense scale. Tiny particles drift past the lens, creating depth and realism. Dynamic light scattering, cosmic dust trails, slow motion, breathtaking sci-fi spectacle, ultra realistic space environment.

Carregando...

1. Introduction

Grok Imagine Video V1.5 is a frontier-tier image-to-video generation model developed by xAI that animates static images into short clips of up to 15 seconds with natively generated, synchronized audio — including dialogue, lip-sync, sound effects, and ambient music — produced in a single inference pass.

This README applies to the following API model identifier:

xai/grok-imagine-video-v1.5/image-to-video

Released in preview around late May 2026, Grok Imagine Video V1.5 debuted at the top of the Artificial Analysis Video Arena Image-to-Video leaderboard with a 1404 ±6 Elo rating, surpassing ByteDance Seedance 2.0 and other established competitors. Built on xAI's Aurora engine — an autoregressive mixture-of-experts (MoE) network that jointly models text, image, video, and audio tokens — the model represents a departure from the diffusion-transformer paradigm used by Sora and Veo, enabling tightly coupled audiovisual generation with competitive cost and latency characteristics.

2. Key Features

Native Synchronized Audio Generation: Audio (dialogue, lip-sync, SFX, ambient sound, music) is generated jointly with video tokens in a single inference pass rather than dubbed in post-processing. This produces event-aligned sound effects and natural lip-sync without requiring separate audio pipelines.
Aurora Autoregressive MoE Architecture: Unlike diffusion-transformer competitors, V1.5 uses an autoregressive mixture-of-experts network trained to predict next tokens from interleaved multimodal data. This unified token-space approach is what enables single-pass audio-video coherence.
Granular Duration Control (1–15 seconds): Clips can be requested at any integer second from 1 to 15, supporting precise targeting for short-form formats. V1.5 extends the prior 10-second limit by 50% while maintaining temporal coherence across the longer window.
Improved Physics and Photorealism: V1.5 introduces measurable gains in cloth dynamics, water simulation, hair motion, and object interaction. Subject deformation in high-motion scenes is reduced relative to V1.0, with sharper micro-expressions and improved translucent/glass material rendering.
Fast Inference: A 5-second 720p clip generates in approximately 20–30 seconds end-to-end — roughly 2–3× faster than Seedance 2.0.
Broad Format Support: The model accepts JPG, JPEG, PNG, WEBP, GIF, and AVIF input images and outputs H.264 MP4 at 24 FPS across seven aspect ratios (1:1, 16:9, 9:16, 4:3, 3:4, 3:2, 2:3), at 480p or 720p (1280×704) resolution.
Extend Video Chaining: Optimized clip extension allows users to chain segments into longer multi-shot narratives, with V1.5 improving continuity between extension boundaries relative to V1.0.

3. Model Architecture & Technical Details

xai/grok-imagine-video-v1.5/image-to-video is built on xAI's Aurora engine, an autoregressive mixture-of-experts network that predicts next tokens across an interleaved sequence of text, image, video, and audio modalities. This is architecturally distinct from the diffusion-transformer designs used by OpenAI Sora and Google Veo, and is the mechanism by which V1.5 produces joint video+audio output in a single forward pass rather than chaining separate generative models.

Key infrastructure and lineage points:

Training infrastructure: Trained on xAI's Colossus 2 supercomputer, a ~2 GW, ~555,000 NVIDIA GPU facility — the largest known single-site AI training cluster.
R&D lineage: The video pipeline incorporates technology from Hotshot, a video generation startup acquired by xAI in March 2025.
Aurora foundation: The underlying Aurora image model was first released on December 9, 2024, with video capability progressively layered on top through Imagine 0.9 (October 2025), Imagine 1.0 (February 2026), multi-image and extension support (March 2026), and the V1.5 preview (May 2026).
Joint token modeling: Because audio and video tokens are produced in the same autoregressive stream, lip-sync and event-aligned SFX emerge from the model rather than from separate alignment models.

xAI has not published a technical report, parameter count, training-data disclosure, or formal model card for V1.5, so finer architectural details (expert count, context length, tokenizer design) are not publicly documented.

4. Performance Highlights

xai/grok-imagine-video-v1.5/image-to-video debuted at #1 on the Artificial Analysis Video Arena Image-to-Video leaderboard with an Elo rating of 1404 ±6, displacing ByteDance Seedance 2.0 from the top spot.

Comparative positioning across leading image-to-video and video-with-audio systems:

Model	Developer	Max Duration	Max Resolution	Native Audio
Grok Imagine Video V1.5	xAI	15s	720p	Yes
Sora 2	OpenAI	20s	1080p	Yes
Veo 3.1	Google	8s	1080p	Yes
Kling 3.0	Kuaishou	up to ~3 min	1080p	Yes
Seedance 2.0	ByteDance	4–12s	720p	Yes
Runway Gen-4	Runway	10s	1080p	Partial

Qualitative performance characteristics:

Image-to-video coherence: Currently the top-ranked model on the Artificial Analysis I2V arena, particularly strong on photorealistic portrait animation, micro-expressions, and translucent material rendering.
Audio quality: Sharper lip-sync and cleaner voice rendering than V1.0; still trails Veo 3.1 on lip-sync precision for dense dialogue.
Throughput: Approximately 2–3× faster inference than Seedance 2.0 at comparable resolution.
Scale of adoption: V1.0 reportedly generated 1.245 billion videos in its first 30 days of availability, indicating substantial production-scale deployment.
Known weaknesses: Physics fidelity in combat and collision scenes lags top competitors; 720p output cap places it below 1080p-capable rivals for high-resolution delivery.

5. Use Cases

Short-Form Social Video: Vertical (9:16) and square outputs at 1–15 seconds map directly to TikTok, Instagram Reels, YouTube Shorts, and X clip formats, with native audio eliminating the need for separate sound design.
Marketing and Advertising Creative: Rapid generation of product visuals, brand teasers, and ad concepts makes the model suitable for high-volume creative iteration and A/B testing of motion concepts.
Image Animation: Static portraits, posters, illustrations, and product photography can be animated with motion and synchronized audio, enabling reanimation of existing brand and editorial assets.
Concept Visualization and Pre-Visualization: Fast 20–30 second inference per 5-second clip supports rapid concept testing for filmmakers, designers, and creative directors who need to evaluate motion and audio direction before committing to full production.
Multi-Shot Narratives via Extend Video: The optimized extension pipeline supports chaining clips into longer sequences, suitable for short narrative pieces, episodic memes, and serialized social content.
Game and Interactive Asset Pipelines: The text → image → animated video flow integrates into game development and interactive media workflows for cinematics, character idle/action loops, and trailer footage.
Entertainment and Viral Content: Native distribution through Grok on X, combined with low cost and granular duration control, supports meme, parody, and viral content generation directly inside the X ecosystem.

The model is less well-suited to long-form storytelling, structured brand-consistent campaigns requiring fine-tuning, and applications requiring 1080p or higher output resolution.

Grok Imagine Video v1.5 Image-to-Video API by xAI

Entrada

Saída

Parâmetros

Exemplo de código

Instalar

Autenticação

Cabeçalhos HTTP

Enviar uma solicitação

Enviar uma solicitação

Corpo da solicitação

Resposta

Verificar status

Exemplo de polling

Valores de status

Resposta concluída

Enviar arquivos

Exemplo de upload

Resposta

Schema de entrada

Exemplo de corpo da solicitação

Schema de saída

Exemplo de resposta

Atlas Cloud Skills

Clientes compatíveis

Instalar

Configurar chave de API

Funcionalidades

MCP Server

Clientes compatíveis

Instalar

Configuração

Ferramentas disponíveis

API Schema

Template de Prompt Compatível com LLM

1. Introduction

2. Key Features

3. Model Architecture & Technical Details

4. Performance Highlights

5. Use Cases

Explorar Modelos Semelhantes

Grok Imagine Video Text-to-Video

Grok Imagine Video Image-to-Video

Grok Imagine Video Reference-to-Video

Grok Imagine Video Extend

Grok Imagine Video Edit

Sync.so Lipsync v3

VEED Lipsync

Seedance 2.0 Mini Reference-to-Video

Seedance 2.0 Mini Image-to-Video

Seedance 2.0 Mini Text-to-Video

HappyHorse-1.1 Reference-to-video

HappyHorse-1.1 Image-to-video

HappyHorse-1.1 Text-to-video

Gemini Omni Flash Reference-to-Video

Gemini Omni Flash Image-to-Video

Gemini Omni Flash Video Edit

Uma API para toda a IA de mídia.

Join our Discord community

Entrada

Saída

Parâmetros

Exemplo de código

Instalar

Autenticação

Cabeçalhos HTTP

Enviar uma solicitação

Enviar uma solicitação

Corpo da solicitação

Resposta

Verificar status

Exemplo de polling

Valores de status

Resposta concluída

Enviar arquivos

Exemplo de upload

Resposta

Schema de entrada

Exemplo de corpo da solicitação

Schema de saída