openai/sora-2/image-to-video

OpenAI Sora 2 generates realistic image-to-video content with synchronized audio, improved physics, sharper realism and steerability.

IMAGE-TO-VIDEOHOTNEW
Openai Sora-2 Image-to-video
image-to-video

OpenAI Sora 2 generates realistic image-to-video content with synchronized audio, improved physics, sharper realism and steerability.

INPUT

Loading parameter configuration...

OUTPUT

Idle
Your generated videos will appear here
Configure your settings and click Run to get started

Your request will cost 0.1 per run. For $10 you can run this model approximately 100 times.

Here's what you can do next:

Parameters

Code Example

import requests
import time

# Step 1: Start video generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "openai/sora-2/image-to-video",
    "prompt": "A beautiful sunset over the ocean with gentle waves",
    "width": 512,
    "height": 512,
    "duration": 3,
    "fps": 24,
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] in ["completed", "succeeded"]:
            print("Generated video:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

video_url = check_status()

Install

Install the required package for your language.

bash
pip install requests

Authentication

All API requests require authentication via an API key. You can get your API key from the Atlas Cloud dashboard.

bash
export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP Headers

python
import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}
Keep your API key secure

Never expose your API key in client-side code or public repositories. Use environment variables or a backend proxy instead.

Submit a request

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

Submit a Request

Submit an asynchronous generation request. The API returns a prediction ID that you can use to check the status and retrieve the result.

POST/api/v1/model/generateVideo

Request Body

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "openai/sora-2/image-to-video",
    "input": {
        "prompt": "A beautiful sunset over the ocean with gentle waves"
    }
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['id']}")
print(f"Status: {result['status']}")

Response

{
  "id": "pred_abc123",
  "status": "processing",
  "model": "model-name",
  "created_at": "2025-01-01T00:00:00Z"
}

Check Status

Poll the prediction endpoint to check the current status of your request.

GET/api/v1/model/prediction/{prediction_id}

Polling Example

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

Status Values

processingThe request is still being processed.
completedGeneration is complete. Outputs are available.
succeededGeneration succeeded. Outputs are available.
failedGeneration failed. Check the error field.

Completed Response

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.mp4"
    ],
    "metrics": {
      "predict_time": 45.2
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

Upload Files

Upload files to Atlas Cloud storage and get a URL you can use in your API requests. Use multipart/form-data to upload.

POST/api/v1/model/uploadMedia

Upload Example

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

Response

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

Input Schema

The following parameters are accepted in the request body.

Total: 0Required: 0Optional: 0

No parameters available.

Example Request Body

json
{
  "model": "openai/sora-2/image-to-video"
}

Output Schema

The API returns a prediction response with the generated output URLs.

idstringrequired
Unique identifier for the prediction.
statusstringrequired
Current status of the prediction.
processingcompletedsucceededfailed
modelstringrequired
The model used for generation.
outputsarray[string]
Array of output URLs. Available when status is "completed".
errorstring
Error message if status is "failed".
metricsobject
Performance metrics.
predict_timenumber
Time taken for video generation in seconds.
created_atstringrequired
ISO 8601 timestamp when the prediction was created.
Format: date-time
completed_atstring
ISO 8601 timestamp when the prediction was completed.
Format: date-time

Example Response

json
{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.mp4"
  ],
  "metrics": {
    "predict_time": 45.2
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills integrates 300+ AI models directly into your AI coding assistant. One command to install, then use natural language to generate images, videos, and chat with LLMs.

Supported Clients

Claude Code
OpenAI Codex
Gemini CLI
Cursor
Windsurf
VS Code
Trae
GitHub Copilot
Cline
Roo Code
Amp
Goose
Replit
40+ supported clients

Install

bash
npx skills add AtlasCloudAI/atlas-cloud-skills

Setup API Key

Get your API key from the Atlas Cloud dashboard and set it as an environment variable.

bash
export ATLASCLOUD_API_KEY="your-api-key-here"

Capabilities

Once installed, you can use natural language in your AI assistant to access all Atlas Cloud models.

Image GenerationGenerate images with models like Nano Banana 2, Z-Image, and more.
Video CreationCreate videos from text or images with Kling, Vidu, Veo, etc.
LLM ChatChat with Qwen, DeepSeek, and other large language models.
Media UploadUpload local files for image editing and image-to-video workflows.

MCP Server

Atlas Cloud MCP Server connects your IDE with 300+ AI models via the Model Context Protocol. Works with any MCP-compatible client.

Supported Clients

Cursor
VS Code
Windsurf
Claude Code
OpenAI Codex
Gemini CLI
Cline
Roo Code
100+ supported clients

Install

bash
npx -y atlascloud-mcp

Configuration

Add the following configuration to your IDE's MCP settings file.

json
{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

Available Tools

atlas_generate_imageGenerate images from text prompts.
atlas_generate_videoCreate videos from text or images.
atlas_chatChat with large language models.
atlas_list_modelsBrowse 300+ available AI models.
atlas_quick_generateOne-step content creation with auto model selection.
atlas_upload_mediaUpload local files for API workflows.

API Schema

Schema not available

Please log in to view request history

You need to be logged in to access your model request history.

Log In
🎬PHYSICS-DRIVEN VIDEO GENERATION

Sora 2OpenAI's Cinematic AI Video Revolution

OpenAI's state-of-the-art video generation model with physics-accurate motion, synchronized audio generation, and cinematic realism. Create professional 1080p videos up to 20 seconds with unprecedented control over camera movements, world state consistency, and multi-shot narratives.

Revolutionary Breakthroughs

What makes Sora 2 the frontier of AI video generation

Physics-Accurate Motion

Advanced physics modeling enables realistic dynamics—basketball rebounds, Olympic gymnastics, fluid interactions. If a character makes a mistake, it appears as an authentic human error, not a technical glitch. Sora 2 models the internal world state with scientific precision.

Synchronized Audio Generation

Native audio-visual generation with sophisticated soundscapes, speech, and sound effects. Dialogue syncs perfectly with lip movements, background music matches scene pacing, and environmental sounds enhance immersion across photorealistic to anime styles.

Cameo Feature

Revolutionary self-insertion technology—record yourself once to appear in any generated scene. Full opt-in control with verification protection, voice capture, and appearance preservation. Revocable at any time for complete user sovereignty.

Core Capabilities

Professional 1080p Quality

Native 1080p output with 480p and 720p support, cinematic quality at 24fps for production-ready results

Advanced World Modeling

Maintains continuity across multiple shots—camera perspective, scene lighting, and character appearances stay consistent

Intricate Instruction Following

Handles complex multi-shot prompts with accurate world state persistence and narrative coherence

Expanded Stylistic Range

Excels at realistic, cinematic, and anime styles with consistent quality across visual aesthetics

Flexible Duration Control

Generate videos from 5 to 20 seconds with precise control over timing and narrative pacing

Built-in Safety Features

Visible watermarks, C2PA metadata provenance tracking, and internal moderation tools for responsible AI

Two Powerful Generation Modes

Transform ideas and images into cinematic video content

Text-to-Video (T2V)

Most Popular

Generate complete videos from natural language prompts with physics-accurate motion, synchronized audio, and cinematic camera control. Describe shot type, subject, action, setting, and lighting for best results.

  • Advanced physics simulation for realistic dynamics
  • Multi-shot narratives with world state consistency
  • Synchronized audio with dialogue and soundscapes
  • Support for realistic, cinematic, and anime styles

Image-to-Video (I2V)

Enhanced

Transform static images into dynamic videos with motion, camera movements, and audio. The input image resolution must match the final video resolution (720x1280 or 1280x720) for seamless transformation.

  • Preserves source image composition and style
  • Natural motion generation from still frames
  • Camera movement and perspective shifts
  • Audio generation synchronized with visual motion

Perfect For

Marketing & Advertising

High-resolution cinematic footage for campaigns, product demos with physics-accurate motion, and branded content

Film Production

Pre-visualization, concept development, storyboard creation with consistent world state across scenes

E-commerce

Product showcases with realistic physics, tutorial videos, and customer experience demonstrations

Education & Training

Instructional content with accurate physics demonstrations, course materials, and educational narratives

Entertainment

Anime and photorealistic content, character-driven stories, cinematic sequences with audio

Content Creation

YouTube videos, social media content, rapid prototyping with Cameo feature integration

Sora 2 T2V and I2V API Integration

Complete API suite for Text-to-Video and Image-to-Video generation

Text-to-Video API (T2V API)

Our Sora 2 T2V API transforms natural language prompts into physics-accurate videos with synchronized audio. Generate professional 1080p videos up to 20 seconds with cinematic camera control and world state consistency.

Physics-accurate motion and dynamics simulation
Synchronized audio generation with dialogue and effects
Multi-shot narratives with world state persistence
Flexible durations: 5-20 seconds

Image-to-Video API (I2V API)

Our Sora 2 I2V API brings still images to life with motion, camera movements, and audio generation. Input resolution must match output video resolution (720x1280 or 1280x720) for seamless transformation.

Resolution-matched source image transformation
Natural motion generation preserving composition
Camera movement and perspective control
Audio generation synchronized with visual motion
💡

Complete API Suite

Both Sora 2 T2V API and I2V API support RESTful architecture with comprehensive documentation. Get started with SDKs for Python, Node.js, and more. Choose between sora-2 for rapid iteration or sora-2-pro for polished cinematic results. All endpoints include physics-accurate motion and synchronized audio generation.

How to Get Started with Sora 2

Start creating professional videos in minutes with two simple paths

API Integration

For developers building applications

1

Sign Up & Login

Create your Atlas Cloud account or login to access the console

2

Add Payment Method

Bind your credit card in the Billing section to fund your account

3

Generate API Key

Navigate to Console → API Keys and create your authentication key

4

Start Building

Use T2V or I2V API endpoints to integrate Sora 2 into your application

Playground Experience

For quick testing and experimentation

1

Sign Up & Login

Create your Atlas Cloud account or login to access the platform

2

Add Payment Method

Bind your credit card in the Billing section to get started

3

Use Playground

Go to the Sora 2 playground, choose T2V or I2V mode, and generate videos instantly

💡
Pro Tip: Test with sora-2 model in Playground for rapid iteration, then switch to sora-2-pro API for final production deliverables when you need maximum quality.

Frequently Asked Questions

What makes Sora 2's physics modeling unique?

Sora 2 uses advanced world state modeling to simulate realistic physics—basketballs rebound accurately, gymnastics follow real dynamics, and fluids behave naturally. When characters make "mistakes," they appear as authentic human errors, not technical glitches, because Sora 2 models internal agent behavior.

How does the Cameo feature work?

Record yourself once to capture your likeness and voice. Sora 2 can then insert you into any generated scene with consistent appearance. It's fully opt-in with verification protection against impersonation, and you can revoke access at any time. Your identity, your control.

What video formats and durations are supported?

Sora 2 generates videos from 5 to 20 seconds in 480p, 720p, and 1080p resolutions. For Image-to-Video generation, the input image resolution must match the output video resolution (either 720x1280 or 1280x720) for seamless transformation.

What's the difference between sora-2 and sora-2-pro?

sora-2 is optimized for speed and exploration—fast iteration when testing tone, structure, or visual style. sora-2-pro takes longer but produces higher quality, more polished results ideal for cinematic footage and marketing assets. Choose based on your workflow stage.

Does Sora 2 include safety features?

Yes! Every Sora 2 video includes visible watermarks and C2PA metadata for content provenance tracking. Internal moderation tools detect prohibited or harmful content. The model enforces strict restrictions: no copyrighted characters, no real people generation, only content suitable for audiences under 18.

Can I use Sora 2 for commercial projects?

Yes! Sora 2 videos are production-ready for marketing campaigns, client deliverables, branded content, and commercial applications. The physics-accurate motion and synchronized audio make it ideal for professional use cases across industries.

Why Use Sora 2 on Atlas Cloud?

Leverage enterprise-grade infrastructure for your professional video generation workflows

Purpose-Built Infrastructure

Deploy Sora 2's physics-accurate video generation and audio synchronization on infrastructure specifically optimized for demanding AI workloads. Maximum performance for 1080p 20-second generation.

Unified API for All Models

Access Sora 2 (T2V, I2V) alongside 300+ AI models (LLMs, image, video, audio) through one unified API. Single integration for all your generative AI needs with consistent auth.

Competitive Pricing

Save up to 70% compared to AWS with transparent, pay-as-you-go pricing. No hidden fees, no commitments—scale from prototype to production without breaking the bank.

SOC I & II Certified Security

Your generated content protected with SOC I & II certifications and HIPAA compliance. Enterprise-grade security with encrypted transmission and storage for peace of mind.

99.9% Uptime SLA

Enterprise-grade reliability with guaranteed 99.9% uptime. Your Sora 2 video generation is always available for production campaigns and critical content workflows.

Easy Integration

Complete integration in minutes with REST API and multi-language SDKs (Python, Node.js, Go). Switch between sora-2 and sora-2-pro seamlessly with unified endpoint structure.

99.9%
Uptime
70%
Lower Cost vs AWS
300+
Gen AI Models
24/7
Pro Support

Technical Specifications

Model Provider
OpenAI
Resolution
1080p (720p, 480p also supported)
Frame Rate
24 FPS
Duration
5-20 seconds
Available Models
sora-2, sora-2-pro
Generation Modes
T2V (Text-to-Video), I2V (Image-to-Video)
Audio
Synchronized audio with dialogue and effects
Safety Features
Watermarks, C2PA metadata, content moderation

Experience Physics-Driven Video Generation

Join filmmakers, advertisers, and creators worldwide who are revolutionizing video production with Sora 2's groundbreaking physics-accurate motion and synchronized audio capabilities.

OpenAI Sora 2 — Image-to-Video

Turn a single reference image into a coherent video clip with synchronized audio. Built on Sora 2’s core advances, the image-to-video pipeline preserves identity, lighting, and composition while synthesizing believable motion and camera dynamics.


Why it looks great

  • Identity lock-in: preserves faces, style, textures, and scene layout from the reference image.
  • Parallax & depth hallucination: infers 3D structure for convincing foreground/background separation.
  • Physics-aware motion: contact, inertia, and secondary motion (hair, cloth) behave naturally.
  • Temporal consistency: minimal flicker/ghosting with stable subject features across frames.
  • Smart background extension: clean inpainting beyond the original frame for wider moves.
  • Cinematic camera moves: subtle pans, push-ins, arcs, and handheld vibes without warping.
  • Synchronized audio: optional voice/ambience that matches on-screen action and pacing.
  • Strong steerability: prompt edits and controls (duration, fps, motion strength) produce predictable changes.

How to Use

  1. Upload a single reference image (PNG/JPEG).
  2. Add a short prompt for mood, motion style, or camera behavior.
  3. Duration: choose 4s, 8s, or 12s.
  4. Submit the job; preview and download the result.

Pricing

DurationTotal ($)
4s0.40
8s0.80
12s1.20

Billing Rules: Linear pricing at $0.10/s. Available durations are 4s, 8s, and 12s.


Notes

  • Best results come from high-resolution, clean source images with clear subjects and lighting.
  • For big perspective shifts, start with shorter durations or lower motion strength, then iterate.
  • Ensure you own the rights to your image; outputs inherit input content constraints.
  • Please follow the user rules from OpenAI, you can find details in the reference: What images are permitted and prohibited in Sora-2

Start From 300+ Models,

Explore all models