midjourney/v8.1/image-to-video

Beeld-naar-Video

Midjourney V8.1 Image-to-Video API by MIDJOURNEY

midjourney/v8.1/image-to-video

Image-to-video

Midjourney V8.1 animates an input image into four 5-second videos at 480p or 720p.

Invoer

Parameterconfiguratie laden...

Uitvoer

Inactief

In wachtrij

Elke uitvoering kost $0.086. Voor $10 kunt u ongeveer 116 keer uitvoeren.

U kunt doorgaan met:

Seedance 2.0 Kling v3 Vidu Wan2.7

Parameters

Codevoorbeeld
import requests
import time

# Step 1: Start video generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "midjourney/v8.1/image-to-video",
    "prompt": "A beautiful sunset over the ocean with gentle waves",
    "width": 512,
    "height": 512,
    "duration": 3,
    "fps": 24,
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] in ["completed", "succeeded"]:
            print("Generated video:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

video_url = check_status()

Installeren

Installeer het vereiste pakket voor uw programmeertaal.

pip install requests

Authenticatie

Alle API-verzoeken vereisen authenticatie via een API-sleutel. U kunt uw API-sleutel ophalen via het Atlas Cloud dashboard.

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP-headers

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

Bescherm uw API-sleutel

Stel uw API-sleutel nooit bloot in client-side code of openbare repositories. Gebruik in plaats daarvan omgevingsvariabelen of een backend-proxy.

Een verzoek indienen

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

Een verzoek indienen

Dien een asynchroon generatieverzoek in. De API retourneert een voorspellings-ID waarmee u de status kunt controleren en het resultaat kunt ophalen.

POST/api/v1/model/generateVideo

Verzoekinhoud

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "midjourney/v8.1/image-to-video",
    "input": {
        "prompt": "A beautiful sunset over the ocean with gentle waves"
    }
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['id']}")
print(f"Status: {result['status']}")

Antwoord

{
  "id": "pred_abc123",
  "status": "processing",
  "model": "model-name",
  "created_at": "2025-01-01T00:00:00Z"
}

Status controleren

Bevraag het voorspellings-eindpunt om de huidige status van uw verzoek te controleren.

GET/api/v1/model/prediction/{prediction_id}

Polling-voorbeeld

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

Statuswaarden

processingHet verzoek wordt nog verwerkt.

completedDe generatie is voltooid. Resultaten zijn beschikbaar.

succeededDe generatie is geslaagd. Resultaten zijn beschikbaar.

failedDe generatie is mislukt. Controleer het foutveld.

Voltooid antwoord

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.mp4"
    ],
    "metrics": {
      "predict_time": 45.2
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

Bestanden uploaden

Upload bestanden naar Atlas Cloud opslag en ontvang een URL die u kunt gebruiken in uw API-verzoeken. Gebruik multipart/form-data om te uploaden.

POST/api/v1/model/uploadMedia

Upload-voorbeeld

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

Antwoord

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

Invoer-Schema

De volgende parameters worden geaccepteerd in de verzoekinhoud.

Totaal: 0Vereist: 0Optioneel: 0

Geen parameters beschikbaar.

Voorbeeld verzoekinhoud

{
  "model": "midjourney/v8.1/image-to-video"
}

Uitvoer-Schema

De API retourneert een voorspellingsantwoord met de gegenereerde uitvoer-URL's.

idstringrequired

Unique identifier for the prediction.

statusstringrequired

Current status of the prediction.

processingcompletedsucceededfailed

modelstringrequired

The model used for generation.

outputsarray[string]

Array of output URLs. Available when status is "completed".

errorstring

Error message if status is "failed".

metricsobject

Performance metrics.

predict_timenumber

Time taken for video generation in seconds.

created_atstringrequired

ISO 8601 timestamp when the prediction was created.

Format: date-time

completed_atstring

ISO 8601 timestamp when the prediction was completed.

Format: date-time

Voorbeeldantwoord

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.mp4"
  ],
  "metrics": {
    "predict_time": 45.2
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills integreert meer dan 300 AI-modellen rechtstreeks in uw AI-codeerassistent. Eén commando om te installeren, gebruik daarna natuurlijke taal om afbeeldingen, video's te genereren en te chatten met LLMs.

Ondersteunde clients

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ ondersteunde clients

Installeren

npx skills add AtlasCloudAI/atlas-cloud-skills

API-sleutel instellen

Haal uw API-sleutel op via het Atlas Cloud dashboard en stel deze in als omgevingsvariabele.

export ATLASCLOUD_API_KEY="your-api-key-here"

Mogelijkheden

Eenmaal geïnstalleerd kunt u natuurlijke taal gebruiken in uw AI-assistent om toegang te krijgen tot alle Atlas Cloud modellen.

BeeldgeneratieGenereer afbeeldingen met modellen zoals Nano Banana 2, Z-Image en meer.

VideocreatieMaak video's van tekst of afbeeldingen met Kling, Vidu, Veo, enz.

LLM-chatChat met Qwen, DeepSeek en andere grote taalmodellen.

Media uploadenUpload lokale bestanden voor beeldbewerking en afbeelding-naar-video workflows.

Meer informatie

github.com/AtlasCloudAI/atlas-cloud-skills

MCP-server

De Atlas Cloud MCP-server verbindt uw IDE met meer dan 300 AI-modellen via het Model Context Protocol. Werkt met elke MCP-compatibele client.

Ondersteunde clients

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ ondersteunde clients

Installeren

npx -y atlascloud-mcp

Configuratie

Voeg de volgende configuratie toe aan het MCP-instellingenbestand van uw IDE.

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

Beschikbare tools

atlas_generate_imageGenereer afbeeldingen op basis van tekstprompts.

atlas_generate_videoMaak video's van tekst of afbeeldingen.

atlas_chatChat met grote taalmodellen.

atlas_list_modelsBlader door meer dan 300 beschikbare AI-modellen.

atlas_quick_generateContentcreatie in één stap met automatische modelselectie.

atlas_upload_mediaUpload lokale bestanden voor API-workflows.

Meer informatie

github.com/AtlasCloudAI/mcp-server

API Schema

Schema niet beschikbaar

Geen voorbeelden beschikbaar

Inloggen om aanvraaggeschiedenis te bekijken

U moet ingelogd zijn om toegang te krijgen tot uw modelaanvraaggeschiedenis.

Inloggen

1. Introduction

Midjourney V8.1 is a text-to-image generation model developed by Midjourney, Inc., representing the latest iteration in the company's image synthesis research. This README applies to the following API model identifiers:

midjourney/v8.1/text-to-image
midjourney/v8.1/image-to-video

Midjourney V8.1 is designed to produce high-aesthetic, prompt-faithful imagery at native 2K resolution with substantially faster generation than prior versions. It is built by Midjourney, an independent, self-funded San Francisco research lab (~11–50 staff) founded in August 2021 by David Holz, and is positioned as a speed- and quality-focused evolution of the company's image pipeline rather than a full feature replacement for its predecessor.

The V8 line is a full from-scratch rewrite of Midjourney's image model, accompanied by a migration from TPU-based to GPU-native PyTorch infrastructure. The model's defining methodology is a human-preference aesthetic tuning loop combined with per-user personalization, prioritizing visually compelling output over raw fidelity to a reference dataset. Released into alpha on April 14, 2026 and reaching general availability across web and Discord on April 30, 2026, V8.1 remains in early testing; the prior V7 model continues to serve as Midjourney's documented default due to feature gaps described below.

2. Key Features & Innovations

Native 2K HD output without a separate upscaler: V8.1 generates directly at 2048px resolution, eliminating the dedicated upscaling step required by earlier versions. HD renders take roughly 1.33 GPU-minutes and standard-definition renders under 1 GPU-minute, with HD running approximately 3× faster and cheaper than in V8.
~5× faster generation: The GPU-native PyTorch rewrite delivers an estimated fivefold speedup in generation time over previous Midjourney versions, improving iteration speed for creative workflows.
Improved text rendering: V8.1 renders in-image text more reliably, with quoted strings in prompts used to specify the intended text — narrowing a long-standing weakness relative to text-specialized competitors.
Stronger prompt-following: The model adheres more closely to prompt instructions, improving controllability and reducing the prompt-engineering effort needed to achieve a target composition.
Restored image conditioning: Image prompts and image weights return in V8.1, alongside backward compatibility with V7 style references (srefs), moodboards, and personalization profiles.
Workflow tooling: V8.1 ships with a Prompt Shortener and an updated /describe command, and its aesthetic has been re-tuned "in the spirit of V7" to preserve the look users prefer.
Personalized aesthetic tuning: A human-preference (RLHF-style) aesthetic tuning loop combined with per-user personalization shapes outputs toward individually preferred visual styles.

3. Model Architecture & Technical Details

Midjourney V8.1 is a complete from-scratch rewrite of the company's image model. As part of the V8 program, Midjourney migrated from TPU-based infrastructure to a GPU-native PyTorch stack; David Holz has publicly stated that the original TPU choice "set research back a year." The underlying generative approach is understood to be latent diffusion, though Midjourney has not published a technical paper or model card, and the specific backbone, parameter count, and text encoder remain undisclosed.

Training details are not publicly documented. The dataset has never been disclosed and is currently contested in copyright litigation brought by Disney, NBCUniversal, and DreamWorks (filed June 2025, amended October 2025 to also target video generation). The defining training methodology is a human-preference aesthetic tuning loop (an RLHF-style process) layered with per-user personalization, which together steer the model toward high-aesthetic, user-aligned outputs rather than optimizing for a single fixed objective.

Because V8.1 is still in alpha, several capabilities present in V7 are not yet available, which is why V7 remains the documented default. The missing features include Omni Reference (--oref), Character Reference, the --no negative prompt, multi-prompts, Quality values, the Niji model, Draft Mode, and Turbo mode.

Regarding the midjourney/v8.1/image-to-video identifier: Midjourney's video capability is separately branded V1, launched June 18, 2025, and is image-to-video only (no text-to-video). It produces 5-second base clips at 24fps, extendable to roughly 21 seconds, with a 480p base resolution and 720p plus premium HD available on higher tiers. It offers Low/High Motion, Auto/Manual settings, and looping with end-frame control (added July 2025). No V8-native or "V8.1" video model has been confirmed, so a video endpoint tagged at "v8.1" likely reflects aggregator mislabeling.

4. Performance Highlights

Midjourney has not published quantitative benchmarks, ELO scores, or arena rankings for V8.1, and the absence of a public API limits the model's presence in third-party evaluation arenas. Performance is therefore best described qualitatively:

Speed and efficiency: Approximately 5× faster generation overall, with native 2K HD rendering at ~1.33 GPU-minutes and SD under 1 GPU-minute.
Resolution: Direct 2048px output with no separate upscaling pass.
Text fidelity: Materially improved in-image text rendering versus prior Midjourney versions.
Prompt adherence: Stronger instruction-following and controllability.
Aesthetics: Re-tuned to preserve the visual character of V7 while improving fidelity.

The table below summarizes the competitive landscape for context. No directly comparable arena scores are available across these systems.

Category	Model	Developer	Notable Strength
Text-to-image	Midjourney V8.1	Midjourney	Aesthetics, native 2K HD, speed
Text-to-image	Flux 2	Black Forest Labs	Photorealism, open weights
Text-to-image	Imagen 4	Google	In-image text
Text-to-image	Ideogram v3	Ideogram	In-image text
Text-to-image	GPT Image / DALL·E	OpenAI	Instruction-following
Text-to-image	Firefly 3	Adobe	Commercial licensing
Video	Sora	OpenAI	Text-to-video
Video	Veo	Google	High-fidelity video
Video	Runway / Kling / Luma	Various	Motion control, length

As a rule of thumb, V8.1 is preferred for speed, HD resolution, and text rendering, while V7 remains the choice for full feature coverage.

5. Intended Use & Applications

Concept art & pre-production: Rapid generation of high-resolution concept imagery for games, film, and product design, accelerating early ideation with fast 2K output.
Marketing & social content: Production of on-brand visuals and social media assets at scale, leveraging improved text rendering for graphics that include words and short phrases.
Film storyboarding & previsualization: Creation of storyboard frames and previs imagery, optionally animated into short clips via Midjourney's separate V1 image-to-video pipeline.
Brand & graphic design: Exploration of visual identities, typography-inclusive layouts, and stylistic directions using image prompts, style references, and moodboards.
Personalized creative iteration: Per-user aesthetic personalization tailors outputs to an individual's preferred visual style, supporting consistent look-and-feel across a body of work.

For workflows requiring features not yet in V8.1 — such as Omni Reference, Character Reference, negative prompts, or the Niji model — the V7 default remains the recommended option.

Midjourney V8.1 Image-to-Video API by MIDJOURNEY

Invoer

Uitvoer

Parameters

Codevoorbeeld

Installeren

Authenticatie

HTTP-headers

Een verzoek indienen

Een verzoek indienen

Verzoekinhoud

Antwoord

Status controleren

Polling-voorbeeld

Statuswaarden

Voltooid antwoord

Bestanden uploaden

Upload-voorbeeld

Antwoord

Invoer-Schema

Voorbeeld verzoekinhoud

Uitvoer-Schema

Voorbeeldantwoord

Atlas Cloud Skills

Ondersteunde clients

Installeren

API-sleutel instellen

Mogelijkheden

MCP-server

Ondersteunde clients

Installeren

Configuratie

Beschikbare tools

API Schema

Inloggen om aanvraaggeschiedenis te bekijken

1. Introduction

2. Key Features & Innovations

3. Model Architecture & Technical Details

4. Performance Highlights

5. Intended Use & Applications

Ontdek Vergelijkbare Modellen

Grok Imagine Video v1.5 Image-to-Video

Gemini Omni Flash Image-to-Video Developer

Gemini Omni Flash Text-to-Video Developer

HappyHorse-1.0 Text-to-video

HappyHorse-1.0 Image-to-video

HappyHorse-1.0 Reference-to-video

HappyHorse-1.0 Video-edit

Seedance 2.0 Fast Reference-to-Video

Seedance 2.0 Fast Image-to-Video

Seedance 2.0 Fast Text-to-Video

Seedance 2.0 Reference-to-Video

Seedance 2.0 Image-to-Video

Seedance 2.0 Text-to-Video

Wan-2.7 Text-to-video

Wan-2.7 Image-to-video

Wan-2.7 Video-edit

Eén API voor alle media-AI.

Join our Discord community

Invoer

Uitvoer

Parameters

Codevoorbeeld

Installeren

Authenticatie

HTTP-headers

Een verzoek indienen

Een verzoek indienen

Verzoekinhoud

Antwoord

Status controleren

Polling-voorbeeld

Statuswaarden

Voltooid antwoord

Bestanden uploaden

Upload-voorbeeld

Antwoord

Invoer-Schema

Voorbeeld verzoekinhoud

Uitvoer-Schema