midjourney/v8.1/image-to-image

이미지를 이미지로

Midjourney V8.1 Image-to-Image API by MIDJOURNEY

midjourney/v8.1/image-to-image

Image-to-image

Midjourney V8.1 re-imagines an input image guided by a text prompt, returning four variations. Supports native 2K HD, style reference, and aspect-ratio / stylize / chaos / weird controls.

입력

매개변수 구성 로드 중...

출력

대기

대기 중

요청당 $0.086가 소요됩니다. $10로 이 모델을 약 116번 실행할 수 있습니다.

다음으로 할 수 있는 작업:

이미지를 비디오로 이미지를 이미지로

파라미터

코드 예시
import requests
import time

# Step 1: Start image generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "midjourney/v8.1/image-to-image",
    "prompt": "A beautiful landscape with mountains and lake",
    "width": 512,
    "height": 512,
    "steps": 20,
    "guidance_scale": 7.5,
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] == "completed":
            print("Generated image:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

image_url = check_status()

설치

사용하는 언어에 필요한 패키지를 설치하세요.

pip install requests

인증

모든 API 요청에는 API 키를 통한 인증이 필요합니다. Atlas Cloud 대시보드에서 API 키를 받을 수 있습니다.

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP 헤더

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

API 키를 안전하게 보관하세요

클라이언트 측 코드나 공개 저장소에 API 키를 노출하지 마세요. 대신 환경 변수 또는 백엔드 프록시를 사용하세요.

요청 제출

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

요청 제출

비동기 생성 요청을 제출합니다. API는 상태 확인 및 결과 조회에 사용할 수 있는 예측 ID를 반환합니다.

POST/api/v1/model/generateImage

요청 본문

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "midjourney/v8.1/image-to-image",
    "prompt": "A beautiful landscape with mountains and lake"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

응답

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

상태 확인

예측 엔드포인트를 폴링하여 요청의 현재 상태를 확인합니다.

GET/api/v1/model/prediction/{prediction_id}

폴링 예시

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

상태 값

processing요청이 아직 처리 중입니다.

completed생성이 완료되었습니다. 출력을 사용할 수 있습니다.

succeeded생성이 성공했습니다. 출력을 사용할 수 있습니다.

failed생성에 실패했습니다. 오류 필드를 확인하세요.

완료 응답

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.png"
    ],
    "metrics": {
      "predict_time": 8.3
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

파일 업로드

Atlas Cloud 스토리지에 파일을 업로드하고 API 요청에 사용할 수 있는 URL을 받습니다. multipart/form-data를 사용하여 업로드합니다.

POST/api/v1/model/uploadMedia

업로드 예시

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

응답

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

입력 Schema

다음 파라미터를 요청 본문에서 사용할 수 있습니다.

전체: 0필수: 0선택: 0

사용 가능한 파라미터가 없습니다.

요청 본문 예시

{
  "model": "midjourney/v8.1/image-to-image"
}

출력 Schema

API는 생성된 출력 URL이 포함된 예측 응답을 반환합니다.

idstringrequired

Unique identifier for the prediction.

statusstringrequired

Current status of the prediction.

processingcompletedsucceededfailed

modelstringrequired

The model used for generation.

outputsarray[string]

Array of output URLs. Available when status is "completed".

errorstring

Error message if status is "failed".

metricsobject

Performance metrics.

predict_timenumber

Time taken for image generation in seconds.

created_atstringrequired

ISO 8601 timestamp when the prediction was created.

Format: date-time

completed_atstring

ISO 8601 timestamp when the prediction was completed.

Format: date-time

응답 예시

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.png"
  ],
  "metrics": {
    "predict_time": 8.3
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills는 300개 이상의 AI 모델을 AI 코딩 어시스턴트에 직접 통합합니다. 한 번의 명령으로 설치하고 자연어로 이미지, 동영상 생성 및 LLM과 대화할 수 있습니다.

지원 클라이언트

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ 지원 클라이언트

설치

npx skills add AtlasCloudAI/atlas-cloud-skills

API 키 설정

Atlas Cloud 대시보드에서 API 키를 받아 환경 변수로 설정하세요.

export ATLASCLOUD_API_KEY="your-api-key-here"

기능

설치 후 AI 어시스턴트에서 자연어를 사용하여 모든 Atlas Cloud 모델에 접근할 수 있습니다.

이미지 생성Nano Banana 2, Z-Image 등의 모델로 이미지를 생성합니다.

동영상 제작Kling, Vidu, Veo 등으로 텍스트나 이미지에서 동영상을 만듭니다.

LLM 채팅Qwen, DeepSeek 등 대규모 언어 모델과 대화합니다.

미디어 업로드이미지 편집 및 이미지-동영상 변환 워크플로우를 위해 로컬 파일을 업로드합니다.

더 알아보기

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

Atlas Cloud MCP Server는 Model Context Protocol을 통해 IDE와 300개 이상의 AI 모델을 연결합니다. MCP 호환 클라이언트에서 사용할 수 있습니다.

지원 클라이언트

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ 지원 클라이언트

설치

npx -y atlascloud-mcp

설정

다음 설정을 IDE의 MCP 설정 파일에 추가하세요.

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

사용 가능한 도구

atlas_generate_image텍스트 프롬프트로 이미지를 생성합니다.

atlas_generate_video텍스트나 이미지로 동영상을 만듭니다.

atlas_chat대규모 언어 모델과 대화합니다.

atlas_list_models300개 이상의 사용 가능한 AI 모델을 탐색합니다.

atlas_quick_generate최적 모델을 자동 선택하여 한 번에 콘텐츠를 생성합니다.

atlas_upload_mediaAPI 워크플로우를 위해 로컬 파일을 업로드합니다.

더 알아보기

github.com/AtlasCloudAI/mcp-server

API 스키마

스키마를 사용할 수 없음

사용 가능한 예제 없음

로드 중...

1. Introduction

Midjourney V8.1 Image-to-Image (midjourney/v8.1/image-to-image) generates four new images guided by an input image together with a text prompt. Midjourney treats the supplied image as a visual prompt — it reads the image's core elements and uses them as a source of inspiration for new, original results rather than reproducing the input pixel-for-pixel.

It is part of the Midjourney V8.1 family exposed through this API:

midjourney/v8.1/text-to-image — generate from a text prompt
midjourney/v8.1/image-to-image — generate guided by an input image (this model)
midjourney/v8.1/blend — fuse 2–5 images
midjourney/v8.1/style-transfer — restyle an image, preserving composition
midjourney/v8.1/remove-background — isolate the subject on transparency
midjourney/v8.1/image-to-video — animate an image into a short clip

Midjourney V8.1 is built by Midjourney, Inc., an independent, self-funded San Francisco research lab founded in August 2021 by David Holz. V8.1 is the company's fastest model to date and produces high-aesthetic, prompt-faithful imagery at native 2K resolution.

2. Key Features & Innovations

Image-guided generation: An input image steers composition, subject, and aesthetic while the text prompt directs the outcome. The image is used as inspiration, not copied exactly — ideal for variations and creative reinterpretation.
Image prompts restored in V8.1: Image-prompt conditioning (and Midjourney's internal image-weight handling) were absent in the V8.0 alpha and reinstated in V8.1, returning image-driven workflows to the newest model.
Native 2K HD: With hd enabled, V8.1 renders directly at 2048px without a separate upscaling pass.
~4–5× faster generation than earlier Midjourney versions (Midjourney-stated), from the GPU-native PyTorch rewrite.
Optional style reference: A separate style-reference image (sref) can be supplied to drive the look (colors, medium, texture, lighting) independently of the content image.
Aesthetic controls: stylize, chaos, and weird shape how strongly Midjourney's house aesthetic, variety, and unconventionality are applied.
Four results per request: Each task returns a 4-image grid so you can pick the strongest variation.

3. Parameters & Usage

Parameter	Description
`image` (required)	Input image used as the visual prompt. Provide a publicly reachable HTTPS URL or upload.
`prompt` (required)	Text describing the desired result. Max 1024 characters. A text prompt is required alongside the image — an image alone is not a complete prompt.
`sref`	Optional style-reference image URL to drive the visual style separately from the content image.
`aspect_ratio`	Output aspect ratio (e.g. `1:1`, `16:9`, `9:16`).
`hd`	Enable native 2K (2048px) generation.
`stylize`	Strength of Midjourney's default aesthetic (0–1000).
`chaos`	Variation/unpredictability across the four results (0–100).
`weird`	Unconventionality of the output (0–3000).
`quality`	Detail level; V8.1 supports `1` (default) or `4` (more detail, same price).
`seed`	Fixed seed for reproducible results.

Tips: Pair the input image with a clear, specific prompt — the prompt resolves what the image leaves ambiguous. Use sref when you want the style of one image and the content of another. Note that Midjourney's manual image-weight control (--iw) is not exposed by this endpoint; the model applies its default image/text balance.

4. Model Architecture & Technical Details

Midjourney V8.1 is a complete from-scratch rewrite of the company's image model. As part of the V8 program, Midjourney migrated from TPU-based infrastructure to a GPU-native PyTorch stack. The underlying generative approach is understood to be latent diffusion; Midjourney has not published a technical paper or model card, so the backbone, parameter count, text encoder, and training data remain undisclosed. The defining methodology is a human-preference (RLHF-style) aesthetic tuning loop combined with per-user personalization. V8.1 was released on midjourney.com on April 30, 2026 and became the default Midjourney model on June 10, 2026.

The training dataset has never been disclosed and is the subject of active, unresolved copyright litigation — Disney Enterprises, Inc. v. Midjourney, Inc. (No. 2:25-cv-05275, C.D. Cal.), filed June 11, 2025 by a coalition of major studios including Disney, Marvel, Lucasfilm, Twentieth Century, Universal, and DreamWorks. Those infringement claims are allegations in pending litigation and have not been adjudicated.

5. Intended Use & Applications

Variations & iteration: Produce alternative takes on an existing image while keeping its general subject and feel.
Restyling & reinterpretation: Reimagine a photo, sketch, or render in a new artistic direction guided by the prompt.
Concept development: Evolve a reference into polished concept art for games, film, and product design.
Marketing & social assets: Generate on-brand variants from a hero image, optionally constrained by a style reference.

For pixel-faithful restyling that preserves the original composition exactly, use midjourney/v8.1/style-transfer; to merge several images, use midjourney/v8.1/blend.

유사한 모델 탐색

NEW

이미지를 이미지로

Midjourney V8.1 Remove Background

Midjourney automatically removes the background from an input image, returning one transparent-background result.

Midjourney V8.1 Style Transfer

Midjourney retexture changes the artistic style of an input image while preserving its composition, returning four restyled results.

Midjourney V8.1 Blend

Midjourney V8.1 blends two to five input images into four fused results, with an optional guiding prompt and native 2K HD.

Midjourney V8.1 Text-to-Image

Midjourney V8.1 generates four images from a text prompt, with optional native 2K HD, a style reference, and aspect-ratio / stylize / chaos / weird controls.

Openai GPT Image 2 Text-to-Image

GPT Image 2 text to image is OpenAI's fast, cost-efficient text-to-image generator powered by GPT-5 guidance. Create photorealistic shots, product renders, concept art, and stylized graphics from natural-language prompts (optionally conditioned with an image). Supports custom aspect ratios, seeds, negative prompts, hex color hints, and style presets. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image 2 Edit

GPT Image 2 Edit is OpenAI's image model for precise, natural-language edits. Add/remove objects, swap backgrounds, retouch faces, adjust colors/lighting, edit text/graphics, crop/resize, and apply hex color control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

GPT Image 2 Developer Edit

GPT Image 2 Developer Edit applies natural-language instructions to one or more reference images, with common aspect ratios and 1k, 2k, or supported 4k output tiers. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

GPT Image 2 Developer Text-to-Image

GPT Image 2 Developer Text-to-Image generates polished visuals from natural-language prompts, with common aspect ratios and 1k, 2k, or supported 4k output tiers. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Seed3D 2.0 Image-to-3D

ByteDance Seed3D 2.0 — generates a textured, PBR-shaded 3D model (glb/obj/usd/usdz) from a single input image. Returns a downloadable .zip archive containing the 3D file.

Hunyuan 3D Rapid Image-to-3D

Tencent Hunyuan 3D Rapid (Express) — fast lightweight 3D mesh generation from a single image, with optional PBR materials. Outputs GLB/OBJ/USDZ/FBX/STL/MP4.

Hunyuan 3D Rapid Text-to-3D

Tencent Hunyuan 3D Rapid (Express) — fast lightweight 3D mesh generation from a text prompt, with optional PBR materials. Outputs GLB/OBJ/USDZ/FBX/STL/MP4.

Hunyuan 3D Pro Image-to-3D

Tencent Hunyuan 3D Pro (v3.1) — high-quality textured 3D mesh generation from a single image, with optional PBR materials and custom face count. Outputs GLB/OBJ/USDZ/FBX/STL.

IMAGE-TO-3D

From

$0.02/이미지

하나의 API로 모든 미디어 AI를.

모든 모델 탐색

Midjourney V8.1 Image-to-Image API by MIDJOURNEY

입력

출력

파라미터

코드 예시

설치

인증

HTTP 헤더

요청 제출

요청 제출

요청 본문

응답

상태 확인

폴링 예시

상태 값

완료 응답

파일 업로드

업로드 예시

응답

입력 Schema

요청 본문 예시

출력 Schema

응답 예시

Atlas Cloud Skills

지원 클라이언트

설치

API 키 설정

기능

MCP Server

지원 클라이언트

설치

설정

사용 가능한 도구

API 스키마

1. Introduction

2. Key Features & Innovations

3. Parameters & Usage

4. Model Architecture & Technical Details

5. Intended Use & Applications

유사한 모델 탐색

Midjourney V8.1 Remove Background

Midjourney V8.1 Style Transfer

Midjourney V8.1 Blend

Midjourney V8.1 Text-to-Image

Openai GPT Image 2 Text-to-Image

Openai GPT Image 2 Edit

GPT Image 2 Developer Edit

GPT Image 2 Developer Text-to-Image

Seed3D 2.0 Image-to-3D

Hunyuan 3D Rapid Image-to-3D

Hunyuan 3D Rapid Text-to-3D

Hunyuan 3D Pro Image-to-3D

하나의 API로 모든 미디어 AI를.

Join our Discord community

입력

출력

파라미터

코드 예시

설치

인증

HTTP 헤더

요청 제출

요청 제출

요청 본문

응답

상태 확인

폴링 예시

상태 값

완료 응답

파일 업로드

업로드 예시

응답

입력 Schema

요청 본문 예시

출력 Schema

응답 예시

Atlas Cloud Skills

지원 클라이언트

설치

API 키 설정