black-forest-labs/flux-2-flex/edit

image-to-image

FLUX.2 Flex Edit API by Black Forest Labs

black-forest-labs/flux-2-flex/edit

Edit

FLUX.2 Flex Edit is a professional image editing model specialized for typography, fine detail preservation, and production workflows. It provides adjustable inference steps and guidance scale, with multi-reference support for up to 8 images, delivering precise control over photorealistic editing results up to 4MP.

INPUT

Prompt *

Enable Prompt Expansion

Images *(0/8)

You can drag and drop a file or click to upload

MAX:8

Size

Width

Height

1024 × 1024 pxRange: 256 - 2048

Guidance scale

Num inference steps

Safety tolerance

Output format

Seed

OUTPUT

Idle

Your generated images will appear here

Configure your settings and click Run to get started

Your request will cost $0.05 per run. For $10 you can run this model approximately 200 times.

Here's what you can do next:

Image-to-Video Image-to-Image

Parameters

Code Example
import requests
import time

# Step 1: Start image generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "black-forest-labs/flux-2-flex/edit",  # Required. model name
    "prompt": "A beautiful landscape with mountains and lake",  # Required. Text prompt for image editing
    "images": [
        "https://example.com/image1.jpg"
    ],  # Required. List of input images for editing
    "enable_prompt_expansion": True,  # Whether to use prompt upsampling to enhance the prompt before generation
    "size": "1024*1024",  # Image dimensions in width*height format (e. (min: 256, max: 2048)
    "guidance_scale": 5,  # Guidance scale for image generation. (min: 1.5, max: 10)
    "num_inference_steps": 50,  # Number of steps for image generation. (min: 1, max: 50)
    "output_format": "jpeg",  # The format of the output image. options: jpeg | png
    "safety_tolerance": 2,  # Tolerance level for input and output moderation. (min: 0, max: 5)
    "seed": -1,  # Random seed for reproducibility
    "enable_base64_output": False,  # If enabled, the output will be encoded into a BASE64 string instead of a URL
    "enable_sync_mode": False,  # If set to true, the function will wait for the result to be generated and uploaded before returning the response
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] == "completed":
            print("Generated image:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

image_url = check_status()

Install

Install the required package for your language.

pip install requests

Authentication

All API requests require authentication via an API key. You can get your API key from the Atlas Cloud dashboard.

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP Headers

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

Keep your API key secure

Never expose your API key in client-side code or public repositories. Use environment variables or a backend proxy instead.

Submit a request

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

Submit a Request

Submit an asynchronous generation request. The API returns a prediction ID that you can use to check the status and retrieve the result.

POST/api/v1/model/generateImage

Request Body

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "black-forest-labs/flux-2-flex/edit",
    "prompt": "A beautiful landscape with mountains and lake"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

Response

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

Check Status

Poll the prediction endpoint to check the current status of your request.

GET/api/v1/model/prediction/{prediction_id}

Polling Example

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

Status Values

processingThe request is still being processed.

completedGeneration is complete. Outputs are available.

succeededGeneration succeeded. Outputs are available.

failedGeneration failed. Check the error field.

Completed Response

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.png"
    ],
    "metrics": {
      "predict_time": 8.3
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

Upload Files

Upload files to Atlas Cloud storage and get a URL you can use in your API requests. Use multipart/form-data to upload.

POST/api/v1/model/uploadMedia

Upload Example

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

Response

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

Input Schema

The following parameters are accepted in the request body.

Total: 12Required: 3Optional: 9

modelstringrequired

model name

Default: "black-forest-labs/flux-2-flex/edit"

promptstringrequired

Text prompt for image editing.

imagesarray[string]required

List of input images for editing. Each image can be a URL or a base64-encoded image string. Maximum 8 images supported.

Min items: 1Max items: 8

enable_prompt_expansionboolean

Whether to use prompt upsampling to enhance the prompt before generation.

Default: true

sizestring

Image dimensions in width*height format (e.g., 1024*1024, 1280*720), If not specified, the model will determine the optimal dimension.

Default: "1024*1024"Min: 256Max: 2048

guidance_scalenumber

Guidance scale for image generation. High guidance scales improve prompt adherence at the cost of reduced realism.

Default: 5Min: 1.5Max: 10

num_inference_stepsinteger

Number of steps for image generation. Higher steps lead to more detailed and realistic images.

Default: 50Min: 1Max: 50

output_formatstring

The format of the output image.

Default: "jpeg"

jpegpng

safety_toleranceinteger

Tolerance level for input and output moderation. Between 0 and 5, 0 being most strict, 5 being least strict.

Default: 2Min: 0Max: 5

seedinteger

Random seed for reproducibility. Use -1 for a random seed.

Default: -1

enable_base64_outputboolean

If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Default: false

enable_sync_modeboolean

If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.

Default: false

Example Request Body

{
  "model": "black-forest-labs/flux-2-flex/edit",
  "prompt": "A beautiful landscape",
  "images": [
    "https://example.com/file.jpg"
  ],
  "enable_prompt_expansion": true,
  "size": "1024*1024",
  "guidance_scale": 5,
  "num_inference_steps": 50,
  "output_format": "jpeg",
  "safety_tolerance": 2,
  "seed": -1,
  "enable_base64_output": false,
  "enable_sync_mode": false
}

Output Schema

The API returns a prediction response with the generated output URLs.

codeinteger

HTTP status code of the response.

messagestring

Human-readable message; non-empty on failure.

dataobject

Example Response

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.png"
  ],
  "metrics": {
    "predict_time": 8.3
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills integrates 400+ AI models directly into your AI coding assistant. One command to install, then use natural language to generate images, videos, and chat with LLMs.

Supported Clients

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ supported clients

Install

npx skills add AtlasCloudAI/atlas-cloud-skills

Setup API Key

Get your API key from the Atlas Cloud dashboard and set it as an environment variable.

export ATLASCLOUD_API_KEY="your-api-key-here"

Capabilities

Once installed, you can use natural language in your AI assistant to access all Atlas Cloud models.

Image GenerationGenerate images with models like Nano Banana 2, Z-Image, and more.

Video CreationCreate videos from text or images with Kling, Vidu, Veo, etc.

LLM ChatChat with Qwen, DeepSeek, and other large language models.

Media UploadUpload local files for image editing and image-to-video workflows.

Learn more

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

Atlas Cloud MCP Server connects your IDE with 400+ AI models via the Model Context Protocol. Works with any MCP-compatible client.

Supported Clients

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ supported clients

Install

npx -y atlascloud-mcp

Configuration

Add the following configuration to your IDE's MCP settings file.

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

Available Tools

atlas_generate_imageGenerate images from text prompts.

atlas_generate_videoCreate videos from text or images.

atlas_chatChat with large language models.

atlas_list_modelsBrowse 400+ available AI models.

atlas_quick_generateOne-step content creation with auto model selection.

atlas_upload_mediaUpload local files for API workflows.

Learn more

github.com/AtlasCloudAI/mcp-server

API Schema

{
  "info": {
    "description": "The AtlasCloud API.",
    "title": "AtlasCloud API",
    "version": "1.0.0"
  },
  "paths": {
    "/api/v1/model/generateImage": {
      "post": {
        "requestBody": {
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/Input"
              }
            }
          },
          "required": true
        },
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "The request status."
          }
        }
      },
      "x-api-name": "model_run"
    },
    "/api/v1/model/prediction/{request_id}": {
      "get": {
        "parameters": [
          {
            "in": "path",
            "name": "request_id",
            "required": true,
            "schema": {
              "description": "Request ID",
              "type": "string"
            }
          }
        ],
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "Result of the request."
          }
        }
      },
      "x-api-name": "model_result"
    }
  },
  "openapi": "3.0.0",
  "servers": [
    {
      "url": "https://api.atlascloud.ai"
    }
  ],
  "components": {
    "schemas": {
      "Input": {
        "properties": {
          "model": {
            "type": "string",
            "description": "model name",
            "default": "black-forest-labs/flux-2-flex/edit"
          },
          "prompt": {
            "type": "string",
            "description": "Text prompt for image editing."
          },
          "images": {
            "type": "array",
            "items": {
              "type": "string"
            },
            "minItems": 1,
            "maxItems": 8,
            "description": "List of input images for editing. Each image can be a URL or a base64-encoded image string. Maximum 8 images supported.",
            "x-ui-component": "uploaders"
          },
          "enable_prompt_expansion": {
            "type": "boolean",
            "default": true,
            "description": "Whether to use prompt upsampling to enhance the prompt before generation."
          },
          "size": {
            "default": "1024*1024",
            "description": "Image dimensions in width*height format (e.g., 1024*1024, 1280*720), If not specified, the model will determine the optimal dimension.",
            "maximum": 2048,
            "minimum": 256,
            "type": "string",
            "x-hidden": true
          },
          "guidance_scale": {
            "type": "number",
            "default": 5,
            "minimum": 1.5,
            "maximum": 10,
            "description": "Guidance scale for image generation. High guidance scales improve prompt adherence at the cost of reduced realism."
          },
          "num_inference_steps": {
            "type": "integer",
            "default": 50,
            "minimum": 1,
            "maximum": 50,
            "description": "Number of steps for image generation. Higher steps lead to more detailed and realistic images."
          },
          "output_format": {
            "type": "string",
            "default": "jpeg",
            "enum": [
              "jpeg",
              "png"
            ],
            "description": "The format of the output image.",
            "x-ui-component": "select"
          },
          "safety_tolerance": {
            "type": "integer",
            "default": 2,
            "minimum": 0,
            "maximum": 5,
            "description": "Tolerance level for input and output moderation. Between 0 and 5, 0 being most strict, 5 being least strict."
          },
          "seed": {
            "type": "integer",
            "default": -1,
            "description": "Random seed for reproducibility. Use -1 for a random seed."
          },
          "enable_base64_output": {
            "type": "boolean",
            "default": false,
            "disabled": true,
            "description": "If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API."
          },
          "enable_sync_mode": {
            "type": "boolean",
            "default": false,
            "disabled": true,
            "description": "If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API."
          }
        },
        "required": [
          "model",
          "prompt",
          "images"
        ],
        "type": "object",
        "x-order-properties": [
          "model",
          "prompt",
          "enable_prompt_expansion",
          "images",
          "size",
          "guidance_scale",
          "num_inference_steps",
          "safety_tolerance",
          "output_format",
          "seed",
          "enable_base64_output",
          "enable_sync_mode"
        ]
      },
      "PredictionResponse": {
        "type": "object",
        "properties": {
          "code": {
            "description": "HTTP status code of the response.",
            "type": "integer"
          },
          "message": {
            "description": "Human-readable message; non-empty on failure.",
            "type": "string"
          },
          "data": {
            "type": "object",
            "properties": {
              "id": {
                "description": "Unique identifier for the prediction.",
                "type": "string"
              },
              "model": {
                "description": "Model ID used for the prediction.",
                "type": "string"
              },
              "outputs": {
                "description": "Array of URLs to the generated content. Null when status is not completed.",
                "type": "array",
                "items": {
                  "type": "string"
                },
                "nullable": true
              },
              "urls": {
                "description": "Object containing related API endpoints.",
                "type": "object",
                "properties": {
                  "get": {
                    "description": "URL to poll for the prediction result.",
                    "type": "string",
                    "format": "uri"
                  }
                }
              },
              "has_nsfw_contents": {
                "description": "Array of boolean values indicating NSFW detection for each output. Null if not applicable.",
                "type": "array",
                "items": {
                  "type": "boolean"
                },
                "nullable": true
              },
              "status": {
                "description": "Status of the task: created, processing, completed, timeout, or failed.",
                "type": "string"
              },
              "created_at": {
                "description": "ISO timestamp of when the request was created (e.g., \"2023-04-01T12:34:56.789Z\").",
                "format": "date-time",
                "type": "string"
              },
              "error": {
                "description": "Error message if the task failed, empty string otherwise.",
                "type": "string"
              },
              "error_code": {
                "description": "Error code if the task failed.",
                "type": "integer"
              },
              "executionTime": {
                "description": "Total execution time in milliseconds.",
                "type": "number"
              },
              "timings": {
                "description": "Detailed timing breakdown.",
                "type": "object",
                "properties": {
                  "inference": {
                    "description": "Inference time in milliseconds.",
                    "type": "number"
                  }
                }
              }
            }
          }
        }
      }
    },
    "securitySchemes": {
      "apiKeyAuth": {
        "in": "header",
        "name": "Authorization",
        "type": "apiKey"
      }
    }
  }
}

LLM-Friendly Prompt Template

# black-forest-labs/flux-2-flex/edit

> FLUX.2 Flex Edit is a professional image editing model specialized for typography, fine detail preservation, and production workflows. It provides adjustable inference steps and guidance scale, with multi-reference support for up to 8 images, delivering precise control over photorealistic editing results up to 4MP.


## Overview

- **Submit endpoint (POST)**: `https://api.atlascloud.ai/api/v1/model/generateImage` — start an async generation; returns a `prediction_id`
- **Poll endpoint (GET)**: `https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}` — poll this until the prediction finishes
- **Model ID**: `black-forest-labs/flux-2-flex/edit`


## API Information

This model can be used via our HTTP API or more conveniently via our client libraries.
See the input and output schema below, as well as the usage examples.


### Input Schema

The API accepts the following input parameters:

- **`model`** (`string`, _required_):
  model name
  - Default: `"black-forest-labs/flux-2-flex/edit"`

- **`prompt`** (`string`, _required_):
  Text prompt for image editing.

- **`enable_prompt_expansion`** (`boolean`, _optional_):
  Whether to use prompt upsampling to enhance the prompt before generation.
  - Default: `true`

- **`images`** (`array[string]`, _required_):
  List of input images for editing. Each image can be a URL or a base64-encoded image string. Maximum 8 images supported.
  - Min items: 1
  - Max items: 8

- **`size`** (`string`, _optional_):
  Image dimensions in width*height format (e.g., 1024*1024, 1280*720), If not specified, the model will determine the optimal dimension.
  - Default: `"1024*1024"`
  - Min: 256
  - Max: 2048

- **`guidance_scale`** (`number`, _optional_):
  Guidance scale for image generation. High guidance scales improve prompt adherence at the cost of reduced realism.
  - Default: `5`
  - Min: 1.5
  - Max: 10

- **`num_inference_steps`** (`integer`, _optional_):
  Number of steps for image generation. Higher steps lead to more detailed and realistic images.
  - Default: `50`
  - Min: 1
  - Max: 50

- **`safety_tolerance`** (`integer`, _optional_):
  Tolerance level for input and output moderation. Between 0 and 5, 0 being most strict, 5 being least strict.
  - Default: `2`
  - Min: 0
  - Max: 5

- **`output_format`** (`string`, _optional_):
  The format of the output image.
  - Default: `"jpeg"`
  - Options: "jpeg", "png"

- **`seed`** (`integer`, _optional_):
  Random seed for reproducibility. Use -1 for a random seed.
  - Default: `-1`

- **`enable_base64_output`** (`boolean`, _optional_):
  If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.
  - Default: `false`

- **`enable_sync_mode`** (`boolean`, _optional_):
  If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.
  - Default: `false`



**Required Parameters Example**:

```json
{
  "model": "black-forest-labs/flux-2-flex/edit",
  "prompt": "",
  "images": [
    ""
  ]
}
```


**Full Example**:

```json
{
  "model": "black-forest-labs/flux-2-flex/edit",
  "prompt": "",
  "enable_prompt_expansion": true,
  "images": [
    ""
  ],
  "size": "1024*1024",
  "guidance_scale": 5,
  "num_inference_steps": 50,
  "safety_tolerance": 2,
  "output_format": "jpeg",
  "seed": -1,
  "enable_base64_output": false,
  "enable_sync_mode": false
}
```


### Output Schema

The API returns the following output format:


- **`code`** (`integer`, _optional_):
  HTTP status code of the response.

- **`message`** (`string`, _optional_):
  Human-readable message; non-empty on failure.

- **`data`** (`object`, _optional_):
  - Properties:
    - **`id`** (`string`, _optional_):
      Unique identifier for the prediction.

    - **`model`** (`string`, _optional_):
      Model ID used for the prediction.

    - **`outputs`** (`array[string]`, _optional_):
      Array of URLs to the generated content. Null when status is not completed.

    - **`urls`** (`object`, _optional_):
      Object containing related API endpoints.
      - Properties:
        - **`get`** (`string`, _optional_):
          URL to poll for the prediction result.


    - **`has_nsfw_contents`** (`array[boolean]`, _optional_):
      Array of boolean values indicating NSFW detection for each output. Null if not applicable.

    - **`status`** (`string`, _optional_):
      Status of the task: created, processing, completed, timeout, or failed.

    - **`created_at`** (`string`, _optional_):
      ISO timestamp of when the request was created (e.g., "2023-04-01T12:34:56.789Z").

    - **`error`** (`string`, _optional_):
      Error message if the task failed, empty string otherwise.

    - **`error_code`** (`integer`, _optional_):
      Error code if the task failed.

    - **`executionTime`** (`number`, _optional_):
      Total execution time in milliseconds.

    - **`timings`** (`object`, _optional_):
      Detailed timing breakdown.
      - Properties:
        - **`inference`** (`number`, _optional_):
          Inference time in milliseconds.





**Example Response**:

```json
{
  "code": 0,
  "message": "",
  "data": {
    "id": "",
    "model": "",
    "outputs": [
      ""
    ],
    "urls": {
      "get": ""
    },
    "has_nsfw_contents": [],
    "status": "",
    "created_at": "",
    "error": "",
    "error_code": 0,
    "executionTime": 0,
    "timings": {
      "inference": 0
    }
  }
}
```


## Usage Examples

### cURL

```bash
# Step 1: Start generation (async)
curl -X POST "https://api.atlascloud.ai/api/v1/model/generateImage" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "black-forest-labs/flux-2-flex/edit",
  "prompt": "",
  "enable_prompt_expansion": true,
  "images": [
    ""
  ],
  "size": "1024*1024",
  "guidance_scale": 5,
  "num_inference_steps": 50,
  "safety_tolerance": 2,
  "output_format": "jpeg",
  "seed": -1,
  "enable_base64_output": false,
  "enable_sync_mode": false
}'

# Response will contain: {"code": 200, "data": {"id": "prediction_id", "status": "processing"}}

# Step 2: Poll for result (replace {prediction_id} with the id returned above)
curl -X GET "https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"

# Keep polling until status is "completed", "succeeded" or "failed"
# When completed, outputs will contain the generated content URL(s)
```

## Additional Resources

### Documentation

- [Model Playground](https://www.atlascloud.ai/models/black-forest-labs/flux-2-flex/edit)

Design a movie promotional poster for a character.

FLUX.2 Flex — Edit (Image Editing)

Developer: Black Forest Labs
Model ID: black-forest-labs/flux-2-flex/edit
Release Date: February 23, 2026

Overview

FLUX.2 Flex Edit is a professional image editing model specialized for typography, fine detail preservation, and production workflows requiring precise customization. Built on the same latent flow matching architecture as FLUX.2 Flex, it accepts a text prompt alongside one or more reference images, enabling targeted edits, subject replacement, style transfer, and multi-reference identity-consistent generation — all without fine-tuning.

It provides adjustable inference steps (1–50) and guidance scale (1.5–10) for fine-grained control over the diffusion process. The latest version is up to 3× faster than previous FLUX.2 Flex releases while maintaining the same high output quality. Supports up to 8 reference images per request.

Key Capabilities

Superior typography — Reliable text rendering for complex typography, infographics, memes, UI mockups, and marketing materials with legible fine text.
Adjustable inference parameters — Fine-grained control through configurable steps (1–50) and guidance scale (1.5–10) to balance quality, speed, and prompt adherence.
Multi-reference support — Up to 8 reference images for consistent character, product, or style generation across compositions.
High-resolution output up to 4MP — Flexible aspect ratios with an improved VAE achieving 63.5% better learnability (gFID) and 84.1% better reconstruction quality (LPIPS) versus prior architectures.
HEX color accuracy — Precise color matching using hex color codes for brand-accurate asset creation.
Prompt expansion — Optional automatic prompt enrichment to improve output quality from short or simple prompts.

Use Cases

Image editing — Edit existing images through text instructions: change backgrounds, swap textures, alter objects, or relight scenes with fine-grained control.
Multi-reference generation — Maintain character, product, or style consistency across compositions using multiple reference images.
Typography and text-heavy editing — Apply precise text edits, add labels, overlays, or redesign layout elements with accurate rendering.
E-commerce asset creation — Product variations, environment swaps, and label editing for marketing materials.
Brand asset consistency — Generate a series of brand-consistent images from a set of reference shots with exact hex-color accuracy.
Iterative refinement — Refine images through successive edits while maintaining overall visual coherence, using adjustable steps for speed/quality balance.

Input Parameters

Parameter	Type	Default	Range	Description
`prompt`	string	—	—	Text description of the desired edit. Required.
`images`	array	—	1–8 items	One or more reference images. Each item is a URL or Base64-encoded string. Required.
`width`	integer	1024	256–2048	Width of the output image in pixels.
`height`	integer	1024	256–2048	Height of the output image in pixels.
`steps`	integer	50	1–50	Number of diffusion inference steps. Lower values are faster; higher values produce maximum fidelity.
`guidance`	number	5	1.5–10	Guidance scale controlling how strictly the model follows the prompt. Higher values increase prompt adherence.
`enable_prompt_expansion`	boolean	`true`	—	When enabled, the model automatically expands short prompts to improve output quality.
`output_format`	string	`jpeg`	`jpeg`, `png`	Output image format.
`safety_tolerance`	integer	2	0–5	Moderation strictness level. 0 = most strict, 5 = least strict.
`seed`	integer	-1	—	Random seed for reproducibility. Use -1 for a random seed.

Note on images: Each item can be a publicly accessible image URL or a Base64-encoded image string (e.g., data:image/jpeg;base64,...). When only one reference image is provided, its actual resolution is used for billing. When multiple images are provided, each is billed at a flat 1 MP rate.

Steps vs. Quality Trade-off

Steps Range	Recommended Use
6–20	Rapid prototyping, fast iteration
20–40	Balanced quality for general use
40–50	Maximum fidelity for final deliverables

Output

Returns a URL to the edited image. The model supports asynchronous generation: the response first contains a polling URL, which resolves to the final image once processing is complete.

Output formats: JPEG, PNG Maximum output resolution: 4 megapixels (e.g., 2048×2048)

Note: FLUX.2 Flex exhibits higher latency compared to FLUX.2 Pro, reflecting its emphasis on quality and fine-grained customization over raw speed.

Pricing

Pricing covers both the generated output image and the reference input image(s), each measured in megapixels (MP). 1 MP = 1,048,576 pixels. Resolution is capped at 4 MP for billing purposes. A single flat per-megapixel rate applies to both generated and reference images.

SKUs

SKU	Description	Unit Price
`sku_mp`	Per megapixel for both the generated output image and each reference input image (up to 4 MP each)	$0.05

Formula

cost = (min(4, ceil(width × height / 1,048,576)) + reference_image_mp) × sku_mp

Where reference_image_mp is:

Multiple images (len(images) > 1): each image is billed at 1 MP flat:
```
reference_image_mp = len(images)
```
Single image (len(images) == 1): actual resolution of the image is used (capped at 4 MP):
```
reference_image_mp = min(4, ceil(GetImagePixelsFromURL(images[0]) / 1,048,576))
```

Examples

Output Resolution	Reference Images	Reference MP	Output MP	Total MP	Cost
1024×1024 (1 MP)	1 × 1024×1024 image (1 MP)	1 MP	1 MP	2 MP	$0.100
1024×1024 (1 MP)	3 images (1 MP each, flat)	3 MP	1 MP	4 MP	$0.200
2048×2048 (4 MP)	1 × 2048×2048 image (4 MP)	4 MP	4 MP	8 MP	$0.400

When a single reference image is provided, its actual pixel count is fetched at billing time. If the image URL is inaccessible or returns an error, billing fails and the request is not charged.

Explore Similar Models

NEW

text-to-image

Flux Dev

Flux-dev text to image model, 12 billion parameter rectified flow transformer.

Flux Kontext Dev

FLUX.1 Kontext [dev] is a development version of the state-of-the-art image editing model that lets you edit images using text prompts. It makes editing intuitive by understanding the relationship between visuals and language.

Flux Kontext Dev Lora

Fast FLUX.1 Kontext [dev] endpoint with LoRA support for rapid image editing using pre-trained adapters for brand and style. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Flux Schnell

FLUX.1 [schnell] is fastest image generation model tailored for local development and personal use, a 12 billion parameter rectified flow transformer.

FLUX.2 Flex Text-to-image

FLUX.2 Flex is a professional text-to-image generation model specialized for typography, fine detail preservation, and production workflows requiring precise customization. It provides adjustable inference steps (1–50) and guidance scale (1.5–10) for fine-grained control, delivering high-quality output up to 4MP.

FLUX.2 Pro Edit

FLUX.2 Pro Edit is a professional image editing model that accepts a text prompt alongside one or more reference images, enabling targeted edits, subject replacement, style transfer, and multi-reference identity-consistent generation — all without fine-tuning. It supports up to 8 reference images per request with high visual consistency.

FLUX.2 Pro Text-to-image

FLUX.2 Pro is a state-of-the-art text-to-image generation model that raises the bar for photorealistic quality, prompt fidelity, and production reliability. Built on a multimodal flow matching architecture, it delivers up to 4MP coherent scenes with realistic lighting, spatial logic, and fine detail that close the gap with professional photography.

Flux Dev Lora

Rapid, high-quality image generation with FLUX.1 [dev] and LoRA support for personalized styles and brand-specific outputs.

Openai GPT Image 2 Text-to-Image

GPT Image 2 text to image is OpenAI's fast, cost-efficient text-to-image generator powered by GPT-5 guidance. Create photorealistic shots, product renders, concept art, and stylized graphics from natural-language prompts (optionally conditioned with an image). Supports custom aspect ratios, seeds, negative prompts, hex color hints, and style presets. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image 2 Edit

GPT Image 2 Edit is OpenAI's image model for precise, natural-language edits. Add/remove objects, swap backgrounds, retouch faces, adjust colors/lighting, edit text/graphics, crop/resize, and apply hex color control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

GPT Image 2 Developer Edit

GPT Image 2 Developer Edit applies natural-language instructions to one or more reference images, with common aspect ratios and 1k, 2k, or supported 4k output tiers. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

GPT Image 2 Developer Text-to-Image

GPT Image 2 Developer Text-to-Image generates polished visuals from natural-language prompts, with common aspect ratios and 1k, 2k, or supported 4k output tiers. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Seedream v5.0 Pro Edit

ByteDance flagship next-generation image editing model. Supports up to 10 reference images while preserving identity, lighting, and color tones for professional-quality modifications.

Seedream v5.0 Pro Text-to-Image

ByteDance flagship next-generation image generation model with stronger prompt adherence, refined typography, and photorealistic detail. Single-image output at 1.5K and 2K tiers with JPEG and PNG support.

Nano Banana 2 Lite Edit Developer

Google's fastest and most cost-efficient Nano Banana image model for editing, applying natural-language edits and multi-image composition to up to 14 reference images with low latency.

Nano Banana 2 Lite Text-to-Image Developer

Google's fastest and most cost-efficient Nano Banana image model, turning natural-language text prompts into high-quality 1k images in as little as 4 seconds for rapid, high-volume generation.

From$0.04/PIC

$0.028/PIC

-30%

One API for All Media AI.

Explore all models