google/nano-banana-pro/edit-developer

ภาพเป็นภาพ

PRODEV

Nano Banana Pro Edit Developer API by Google

google/nano-banana-pro/edit-developer

Edit-developer

Open and Advanced Large-Scale Image Generative Models.

อินพุต

พรอมต์ *

รูปภาพ *(0/10)

ลากไฟล์มาวางที่นี่ หรือคลิกเพื่ออัปโหลด

MAX:10

อัตราส่วนภาพ

ความละเอียด

Enable web search

เอาต์พุต

รอดำเนินการ

ภาพที่สร้างจะแสดงที่นี่

ตั้งค่าพารามิเตอร์แล้วคลิกรันเพื่อเริ่มสร้าง

แต่ละครั้งจะใช้ $0.07 ด้วย $10 คุณสามารถรันได้ประมาณ 142 ครั้ง

คุณสามารถทำต่อได้:

ภาพเป็นวิดีโอ ภาพเป็นภาพ

พารามิเตอร์

ตัวอย่างโค้ด
import requests
import time

# Step 1: Start image generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "google/nano-banana-pro/edit-developer",  # Required. model name
    "aspect_ratio": "example_value",  # The aspect ratio of the generated media
    "enable_base64_output": False,  # If enabled, the output will be encoded into a BASE64 string instead of a URL
    "enable_sync_mode": False,  # If set to true, the function will wait for the result to be generated and uploaded before returning the response
    "enable_web_search": False,  # If enabled, the model will use web search to ground the generation with real-time information
    "images": [
        "https://example.com/image1.jpg"
    ],  # Required. List of URLs of input images for editing
    "prompt": "A beautiful landscape with mountains and lake",  # Required. The positive prompt for the generation
    "resolution": "1k",  # The resolution of the output image. options: 1k | 2k | 4k
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] == "completed":
            print("Generated image:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

image_url = check_status()

ติดตั้ง

ติดตั้งแพ็กเกจที่จำเป็น

pip install requests

การยืนยันตัวตน

คำขอ API ทั้งหมดต้องมีการยืนยันตัวตนผ่าน API key คุณสามารถรับ API key ได้จากแดชบอร์ด Atlas Cloud

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP Headers

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

รักษา API key ของคุณให้ปลอดภัย

อย่าเปิดเผย API key ของคุณในโค้ดฝั่งไคลเอนต์หรือที่เก็บข้อมูลสาธารณะ ให้ใช้ตัวแปรสภาพแวดล้อมหรือพร็อกซีฝั่งเซิร์ฟเวอร์แทน

ส่งคำขอ

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

ส่งคำขอ

ส่งคำขอสร้างแบบอะซิงโครนัส API จะส่งคืน prediction ID ที่คุณสามารถใช้ตรวจสอบสถานะและดึงผลลัพธ์ได้

POST/api/v1/model/generateImage

เนื้อหาคำขอ

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "google/nano-banana-pro/edit-developer",
    "prompt": "A beautiful landscape with mountains and lake"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

การตอบกลับ

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

ตรวจสอบสถานะ

ตรวจสอบสถานะปัจจุบันของคำขอด้วยการเรียก prediction endpoint เป็นระยะ

GET/api/v1/model/prediction/{prediction_id}

ตัวอย่างการตรวจสอบสถานะเป็นระยะ

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

ค่าสถานะ

processingคำขอยังอยู่ระหว่างการประมวลผล

completedการสร้างเสร็จสมบูรณ์แล้ว ผลลัพธ์พร้อมใช้งาน

succeededการสร้างสำเร็จแล้ว ผลลัพธ์พร้อมใช้งาน

failedการสร้างล้มเหลว ตรวจสอบฟิลด์ error

การตอบกลับที่เสร็จสมบูรณ์

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.png"
    ],
    "metrics": {
      "predict_time": 8.3
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

อัปโหลดไฟล์

อัปโหลดไฟล์ไปยังที่เก็บข้อมูล Atlas Cloud และรับ URL ที่คุณสามารถใช้ในคำขอ API ของคุณ ใช้ multipart/form-data ในการอัปโหลด

POST/api/v1/model/uploadMedia

ตัวอย่างการอัปโหลด

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

การตอบกลับ

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

Input Schema

พารามิเตอร์ต่อไปนี้ยอมรับในเนื้อหาคำขอ

ทั้งหมด: 8จำเป็น: 3ไม่บังคับ: 5

modelstringrequired

model name

Default: "google/nano-banana-pro/edit-developer"

aspect_ratiostring

The aspect ratio of the generated media.

1:13:22:33:44:34:55:49:1616:921:9

enable_base64_outputboolean

If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Default: false

enable_sync_modeboolean

If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.

Default: false

enable_web_searchboolean

If enabled, the model will use web search to ground the generation with real-time information.

Default: false

imagesarray[string]required

List of URLs of input images for editing. The maximum number of images is 10.

Min items: 1Max items: 10

promptstringrequired

The positive prompt for the generation.

resolutionstring

The resolution of the output image.

Default: "1k"

1k2k4k

ตัวอย่างเนื้อหาคำขอ

{
  "model": "google/nano-banana-pro/edit-developer",
  "enable_base64_output": false,
  "enable_sync_mode": false,
  "enable_web_search": false,
  "images": [
    "https://example.com/file.jpg"
  ],
  "prompt": "A beautiful landscape",
  "resolution": "1k"
}

Output Schema

API จะส่งคืนการตอบกลับ prediction พร้อม URL ของผลลัพธ์ที่สร้างขึ้น

created_atstring

ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”).

idstring

Unique identifier for the prediction, the ID of the prediction to get.

modelstring

Model ID used for the prediction.

outputsarray

Array of URLs to the generated content (empty when status is not completed).

statusstring

Status of the task: created, processing, completed, or failed.

ตัวอย่างการตอบกลับ

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.png"
  ],
  "metrics": {
    "predict_time": 8.3
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills เชื่อมต่อโมเดล AI กว่า 400+ เข้ากับผู้ช่วยเขียนโค้ด AI ของคุณโดยตรง ติดตั้งด้วยคำสั่งเดียว จากนั้นใช้ภาษาธรรมชาติเพื่อสร้างรูปภาพ วิดีโอ และสนทนากับ LLM

ไคลเอนต์ที่รองรับ

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ ไคลเอนต์ที่รองรับ

ติดตั้ง

npx skills add AtlasCloudAI/atlas-cloud-skills

ตั้งค่า API Key

รับ API key จากแดชบอร์ด Atlas Cloud และตั้งค่าเป็นตัวแปรสภาพแวดล้อม

export ATLASCLOUD_API_KEY="your-api-key-here"

ความสามารถ

เมื่อติดตั้งแล้ว คุณสามารถใช้ภาษาธรรมชาติในผู้ช่วย AI ของคุณเพื่อเข้าถึงโมเดล Atlas Cloud ทั้งหมด

สร้างรูปภาพสร้างรูปภาพด้วยโมเดลเช่น Nano Banana 2, Z-Image และอื่นๆ

สร้างวิดีโอสร้างวิดีโอจากข้อความหรือรูปภาพด้วย Kling, Vidu, Veo เป็นต้น

สนทนา LLMสนทนากับ Qwen, DeepSeek และโมเดลภาษาขนาดใหญ่อื่นๆ

อัปโหลดสื่ออัปโหลดไฟล์จากเครื่องสำหรับการแก้ไขรูปภาพและเวิร์กโฟลว์รูปภาพเป็นวิดีโอ

เรียนรู้เพิ่มเติม

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

Atlas Cloud MCP Server เชื่อมต่อ IDE ของคุณกับโมเดล AI กว่า 400+ ผ่าน Model Context Protocol ใช้งานได้กับไคลเอนต์ที่รองรับ MCP ทุกตัว

ไคลเอนต์ที่รองรับ

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ ไคลเอนต์ที่รองรับ

ติดตั้ง

npx -y atlascloud-mcp

การกำหนดค่า

เพิ่มการกำหนดค่าต่อไปนี้ลงในไฟล์ตั้งค่า MCP ของ IDE ของคุณ

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

เครื่องมือที่ใช้ได้

atlas_generate_imageสร้างรูปภาพจากข้อความ prompt

atlas_generate_videoสร้างวิดีโอจากข้อความหรือรูปภาพ

atlas_chatสนทนากับโมเดลภาษาขนาดใหญ่

atlas_list_modelsเรียกดูโมเดล AI กว่า 400+ ที่ใช้ได้

atlas_quick_generateสร้างเนื้อหาขั้นตอนเดียวพร้อมเลือกโมเดลอัตโนมัติ

atlas_upload_mediaอัปโหลดไฟล์จากเครื่องสำหรับเวิร์กโฟลว์ API

เรียนรู้เพิ่มเติม

github.com/AtlasCloudAI/mcp-server

API Schema

{
  "info": {
    "description": "The AtlasCloud API.",
    "title": "AtlasCloud API",
    "version": "1.0.0"
  },
  "paths": {
    "/api/v1/model/generateImage": {
      "post": {
        "requestBody": {
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/Input"
              }
            }
          },
          "required": true
        },
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "The request status."
          }
        }
      },
      "x-api-name": "model_run"
    },
    "/api/v1/model/prediction/{request_id}": {
      "get": {
        "parameters": [
          {
            "in": "path",
            "name": "request_id",
            "required": true,
            "schema": {
              "description": "Request ID",
              "type": "string"
            }
          }
        ],
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "Result of the request."
          }
        }
      },
      "x-api-name": "model_result"
    }
  },
  "openapi": "3.0.0",
  "servers": [
    {
      "url": "https://api.atlascloud.ai"
    }
  ],
  "components": {
    "schemas": {
      "Input": {
        "properties": {
          "model": {
            "type": "string",
            "description": "model name",
            "default": "google/nano-banana-pro/edit-developer"
          },
          "aspect_ratio": {
            "description": "The aspect ratio of the generated media.",
            "enum": [
              "1:1",
              "3:2",
              "2:3",
              "3:4",
              "4:3",
              "4:5",
              "5:4",
              "9:16",
              "16:9",
              "21:9"
            ],
            "type": "string",
            "x-placeholder": "Select aspect ratio"
          },
          "enable_base64_output": {
            "default": false,
            "description": "If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.",
            "disabled": true,
            "type": "boolean"
          },
          "enable_sync_mode": {
            "default": false,
            "description": "If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.",
            "disabled": true,
            "type": "boolean"
          },
          "enable_web_search": {
            "default": false,
            "description": "If enabled, the model will use web search to ground the generation with real-time information.",
            "type": "boolean"
          },
          "images": {
            "description": "List of URLs of input images for editing. The maximum number of images is 10.",
            "items": {
              "type": "string"
            },
            "maxItems": 10,
            "minItems": 1,
            "type": "array",
            "x-ui-component": "uploaders"
          },
          "prompt": {
            "description": "The positive prompt for the generation.",
            "type": "string"
          },
          "resolution": {
            "default": "1k",
            "description": "The resolution of the output image.",
            "enum": [
              "1k",
              "2k",
              "4k"
            ],
            "type": "string"
          }
        },
        "required": [
          "model",
          "prompt",
          "images"
        ],
        "type": "object",
        "x-order-properties": [
          "model",
          "prompt",
          "images",
          "aspect_ratio",
          "resolution",
          "enable_web_search",
          "enable_sync_mode",
          "enable_base64_output"
        ]
      },
      "PredictionResponse": {
        "properties": {
          "created_at": {
            "description": "ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”).",
            "format": "date-time",
            "type": "string"
          },
          "has_nsfw_contents": {
            "description": "Array of boolean values indicating NSFW detection for each output.",
            "items": {
              "type": "boolean"
            },
            "type": "array"
          },
          "id": {
            "description": "Unique identifier for the prediction, the ID of the prediction to get.",
            "type": "string"
          },
          "model": {
            "description": "Model ID used for the prediction.",
            "type": "string"
          },
          "outputs": {
            "description": "Array of URLs to the generated content (empty when status is not completed).",
            "items": {
              "type": "string"
            },
            "type": "array"
          },
          "status": {
            "description": "Status of the task: created, processing, completed, or failed.",
            "type": "string"
          },
          "urls": {
            "description": "Object containing related API endpoints.",
            "type": "object"
          }
        },
        "type": "object"
      }
    },
    "securitySchemes": {
      "apiKeyAuth": {
        "in": "header",
        "name": "Authorization",
        "type": "apiKey"
      }
    }
  }
}

เทมเพลต Prompt สำหรับ LLM

# google/nano-banana-pro/edit-developer

> Open and Advanced Large-Scale Image Generative Models.


## Overview

- **Submit endpoint (POST)**: `https://api.atlascloud.ai/api/v1/model/generateImage` — start an async generation; returns a `prediction_id`
- **Poll endpoint (GET)**: `https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}` — poll this until the prediction finishes
- **Model ID**: `google/nano-banana-pro/edit-developer`


## API Information

This model can be used via our HTTP API or more conveniently via our client libraries.
See the input and output schema below, as well as the usage examples.


### Input Schema

The API accepts the following input parameters:

- **`model`** (`string`, _required_):
  model name
  - Default: `"google/nano-banana-pro/edit-developer"`

- **`prompt`** (`string`, _required_):
  The positive prompt for the generation.

- **`images`** (`array[string]`, _required_):
  List of URLs of input images for editing. The maximum number of images is 10.
  - Min items: 1
  - Max items: 10

- **`aspect_ratio`** (`string`, _optional_):
  The aspect ratio of the generated media.
  - Options: "1:1", "3:2", "2:3", "3:4", "4:3", "4:5", "5:4", "9:16", "16:9", "21:9"

- **`resolution`** (`string`, _optional_):
  The resolution of the output image.
  - Default: `"1k"`
  - Options: "1k", "2k", "4k"

- **`enable_web_search`** (`boolean`, _optional_):
  If enabled, the model will use web search to ground the generation with real-time information.
  - Default: `false`

- **`enable_sync_mode`** (`boolean`, _optional_):
  If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.
  - Default: `false`

- **`enable_base64_output`** (`boolean`, _optional_):
  If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.
  - Default: `false`



**Required Parameters Example**:

```json
{
  "model": "google/nano-banana-pro/edit-developer",
  "prompt": "",
  "images": [
    ""
  ]
}
```


**Full Example**:

```json
{
  "model": "google/nano-banana-pro/edit-developer",
  "prompt": "",
  "images": [
    ""
  ],
  "aspect_ratio": "1:1",
  "resolution": "1k",
  "enable_web_search": false,
  "enable_sync_mode": false,
  "enable_base64_output": false
}
```


### Output Schema

The API returns the following output format:


- **`created_at`** (`string`, _optional_):
  ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”).

- **`has_nsfw_contents`** (`array[boolean]`, _optional_):
  Array of boolean values indicating NSFW detection for each output.

- **`id`** (`string`, _optional_):
  Unique identifier for the prediction, the ID of the prediction to get.

- **`model`** (`string`, _optional_):
  Model ID used for the prediction.

- **`outputs`** (`array[string]`, _optional_):
  Array of URLs to the generated content (empty when status is not completed).

- **`status`** (`string`, _optional_):
  Status of the task: created, processing, completed, or failed.

- **`urls`** (`object`, _optional_):
  Object containing related API endpoints.



**Example Response**:

```json
{
  "created_at": "",
  "has_nsfw_contents": [],
  "id": "",
  "model": "",
  "outputs": [
    ""
  ],
  "status": "",
  "urls": {}
}
```


## Usage Examples

### cURL

```bash
# Step 1: Start generation (async)
curl -X POST "https://api.atlascloud.ai/api/v1/model/generateImage" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "google/nano-banana-pro/edit-developer",
  "prompt": "",
  "images": [
    ""
  ],
  "aspect_ratio": "1:1",
  "resolution": "1k",
  "enable_web_search": false,
  "enable_sync_mode": false,
  "enable_base64_output": false
}'

# Response will contain: {"code": 200, "data": {"id": "prediction_id", "status": "processing"}}

# Step 2: Poll for result (replace {prediction_id} with the id returned above)
curl -X GET "https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"

# Keep polling until status is "completed", "succeeded" or "failed"
# When completed, outputs will contain the generated content URL(s)
```

## Additional Resources

### Documentation

- [Model Playground](https://www.atlascloud.ai/models/google/nano-banana-pro/edit-developer)

Live action movie still, Japanese youth film style. A beautiful young woman in a collegiate outfit running through a campus gate, laughing. Dynamic motion, hair flying. Surrounded by falling pink cherry blossom petals. Soft sunlight, airy atmosphere, shallow depth of field, bokeh background of other students, Fujifilm color tone, realistic, sharp focus on eyes.

Translate the text in the image into Portuguese from brazil.

Make the hamburger made of glass.

Place the sofa from Figure 2 into Figure 1, and blend the light and shadows. The model merges these items into one cohesive interior scene, matching perspective, light direction, and color temperature automatically. For visual designers, this means realistic "in-context" images for furniture, décor, and home living categories — without 3D staging or studio photography.

Turn the image into Minecraft style.

Turn ithis scene into nighttime.

กำลังโหลด...

Nano Banana - Visual AI ปฏิวัติวงการจาก Google

เปิดตัวใหม่

หรือที่รู้จักในชื่อ Gemini 2.5 Flash Image

ความก้าวหน้าล่าสุดของ Google ในเทคโนโลยี AI แบบมัลติโมดัล มอบความสามารถในการสร้างและแก้ไขภาพที่ไม่เคยมีมาก่อน ด้วยความเร็วระดับสายฟ้าแลบและคุณภาพที่ยอดเยี่ยม

การสร้างภาพขั้นสูง

เทคโนโลยีผสมผสานหลายภาพ
ความสม่ำเสมอของตัวละครข้ามการสร้าง
การแปลงโดยคงสไตล์ไว้
เอาต์พุตความละเอียดสูงสุด 4K

เครื่องมือแก้ไขอัจฉริยะ

การแก้ไขอัจฉริยะด้วยข้อความ
เพิ่มและลบวัตถุ
เปลี่ยนพื้นหลัง
การถ่ายทอดสไตล์และเอฟเฟกต์ศิลปะ

Transform to Figure

Photo to Character Figure

Transform any photo into a realistic character figure with packaging and display

Prompt

turn this photo into a character figure. Behind it, place a box with the character's image printed on it, and a computer showing the Blender modeling process on its screen. In front of the box, add a round plastic base with the character figure standing on it. set the scene indoors if possible

Anime to Real

Anime to Cosplay

Transform anime illustrations into realistic cosplay photography

Prompt

Generate a highly detailed photo of a girl cosplaying this illustration, at Comiket. Exactly replicate the same pose, body posture, hand gestures, facial expression, and camera framing as in the original illustration. Keep the same angle, perspective, and composition, without any deviation

Photo to Action Figure

Person to Action Figure

Transform people from photos into collectible action figures with custom packaging

Prompt

Transform the the person in the photo into an action figure, styled after [CHARACTER_NAME] from [SOURCE / CONTEXT]. Next to the figure, display the accessories including [ITEM_1], [ITEM_2], and [ITEM_3]. On the top of the toy box, write "[BOX_LABEL_TOP]", and underneath it, "[BOX_LABEL_BOTTOM]". Place the box in a [BACKGROUND_SETTING] environment. Visualize this in a highly realistic way with attention to fine details.

Photo to Funko Pop

Person to Funko Pop Figure

Transform photos into Funko Pop style collectible figures with custom packaging

Prompt

Transform the person in the photo into the style of a Funko Pop figure packaging box, presented in an isometric perspective. Label the packaging with the title 'ZHOGUE'. Inside the box, showcase the figure based on the person in the photo, accompanied by their essential items (such as cosmetics, bags, or others). Next to the box, also display the actual figure itself outside of the packaging, rendered in a realistic and lifelike style.

Design to Reality

Product Design to Photorealistic Render

Transform product design sketches into photorealistic renders

Prompt

turn this illustration of a perfume into a realistic version, Frosted glass bottle with a marble cap

Face Reference Control

Transform to Q-Version Character

Create cartoon characters with face shape reference control

Prompt

Transform the person from image 1 into a Q-version character design based on the face shape from image 2

Architecture to Model

Building to 3D Architecture Model

Convert architectural photos into detailed physical models

Prompt

convert this photo into a architecture model. Behind the model, there should be a cardboard box with an image of the architecture from the photo on it. There should also be a computer, with the content on the computer screen showing the Blender modeling process of the figurine. In front of the cardboard box, place a cardstock and put the architecture model from the photo I provided on it. I hope the PVC material can be clearly presented. It would be even better if the background is indoors.

จุดเด่นทางเทคนิค

ประสิทธิภาพ

สร้างได้รวดเร็วเหมือนสายฟ้า

เพิ่มประสิทธิภาพด้านความเร็ว ใช้เวลาสร้างน้อยกว่า 2 วินาทีสำหรับงานส่วนใหญ่ เหมาะอย่างยิ่งสำหรับแอปพลิเคชันแบบเรียลไทม์และเวิร์กโฟลว์การสร้างต้นแบบอย่างรวดเร็ว

คุณภาพ

คุณภาพเอาต์พุตที่โดดเด่น

ใช้ประโยชน์จากสถาปัตยกรรม AI ขั้นสูงของ Google เพื่อสร้างภาพที่มีรายละเอียดสูงและสมจริง พร้อมแสง พื้นผิว และองค์ประกอบที่แม่นยำ

นวัตกรรม

การสังเคราะห์มุมมองใหม่

ความสามารถการแปลง 2D เป็น 3D ที่ปฏิวัติวงการ ช่วยให้สร้างมุมมองหลายแบบจากภาพเดียว เปิดโอกาสใหม่สำหรับการสร้างคอนเทนต์

เหมาะสำหรับ

📸

ถ่ายภาพสินค้า

🎨

สร้างงานศิลปะดิจิทัล

✨

เพิ่มคุณภาพภาพถ่าย

📊

วิชวลสำหรับการตลาด

👤

ออกแบบตัวละคร

👔

ลองสวมใส่เสมือนจริง

📱

โซเชียลมีเดีย

🔄

บูรณะภาพถ่าย

ทำไมต้องเลือก Nano Banana?

🚀

ไม่ต้องติดตั้ง

เริ่มสร้างสรรค์ได้ทันทีโดยไม่ต้องกำหนดค่าหรือติดตั้งที่ซับซ้อน

🎯

ควบคุมได้อย่างแม่นยำ

ปรับแต่งทุกแง่มุมของงานสร้างสรรค์ด้วยคำสั่งข้อความที่ใช้งานง่าย

🔄

ผลลัพธ์สม่ำเสมอ

รักษาความสม่ำเสมอของตัวละครและสไตล์ตลอดการสร้างหลายครั้ง

ข้อมูลจำเพาะทางเทคนิค

สถาปัตยกรรมโมเดล:ขับเคลื่อนโดย Google AI Studio

ความเร็วในการประมวลผล:เวลาสร้างเฉลี่ย < 2 วินาที

รองรับความละเอียด:สูงสุด 4096x4096 พิกเซล

รองรับรูปแบบ:รูปแบบเอาต์พุต PNG, JPEG, WebP

อินพุตหลายโหมด:พรอมต์แบบข้อความ รูปภาพ และผสมผสาน

การเชื่อมต่อ API:RESTful API พร้อมเอกสารประกอบครบถ้วน

สัมผัสพลังของ Nano Banana AI

ร่วมกับครีเอเตอร์และธุรกิจนับพันที่กำลังพลิกโฉมคอนเทนต์ภาพของตนด้วยเทคโนโลยี AI สร้างภาพที่ล้ำสมัยที่สุดจาก Google

✨เครดิตฟรีสำหรับเริ่มต้น

⚡เข้าถึงได้ทันที

🌐ใช้งานได้ทุกที่

Nano Banana Pro : A state-of-the-art, multimodal reasoning and image generation model by Google DeepMind

Model Card Overview

Field	Description
Model Name	Nano Banana Pro (also known as Gemini 3 Pro Image)
Developer	Google DeepMind
Release Date	November 20, 2025
Model Type	Multimodal Reasoning and Image Generation
Related Links	Official Product Page, Model Card (PDF)

Introduction

Nano Banana Pro, officially designated as Gemini 3 Pro Image, represents the next generation in Google's series of highly-capable, natively multimodal models. It is designed for professional asset production, integrating the advanced reasoning capabilities of the Gemini 3 Pro foundation model with a sophisticated image generation engine. The primary goal of Nano Banana Pro is to provide users with studio-quality precision and control, enabling the creation of complex, high-fidelity visuals from textual and image-based prompts. Its core contribution lies in its ability to understand and execute intricate instructions, maintain character and scene consistency, and render legible text directly within generated images, setting a new standard for professional creative workflows.

Key Features & Innovations

Nano Banana Pro introduces several technical breakthroughs that distinguish it from prior models:

Superior Text Rendering: The model excels at generating images that contain clear, accurate, and stylistically coherent text, making it ideal for creating posters, diagrams, and marketing materials.
Advanced Creative Controls: Users can exercise fine-grained control over image outputs, including camera angles, lighting transformations (e.g., day to night), color grading, depth of field, and localized editing.
High-Fidelity Consistency: It can maintain the consistency of up to 14 input images and blend up to 5 distinct characters seamlessly into complex compositions, ensuring visual coherence across a series of generated images.
Deep Real-World Knowledge: Built on Gemini 3 Pro, the model leverages a vast understanding of the world to generate contextually rich and factually grounded visuals, from detailed infographics to historically accurate scenes.
Multilingual Capabilities: The model can accurately render and translate text across multiple languages within an image, facilitating the localization of visual content.
Complex Composition from Multiple Inputs: Nano Banana Pro can synthesize elements from multiple source images and text prompts to create a single, cohesive scene, enabling complex creative concepts.

Model Architecture & Technical Details

Nano Banana Pro's architecture is fundamentally based on the Gemini 3 Pro model. While specific architectural details are not fully disclosed, the following technical information is available:

Foundation Model: Gemini 3 Pro
Inputs: The model accepts text strings and images as input, with a large context window of up to 1 million tokens.
Outputs: It generates high-resolution images (up to 4K) with a 64K token output capacity for handling complex generation tasks.
Training Infrastructure:
- Hardware: The model was trained on Google's custom-designed Tensor Processing Units (TPUs), which are optimized for large-scale machine learning computations and high-bandwidth memory access.
- Software: The training process utilized JAX and ML Pathways, Google's high-performance frameworks for machine learning research.
Knowledge Cutoff: The model's internal knowledge base has a cutoff date of January 2025.

Intended Use & Applications

Nano Banana Pro is intended for professional and creative applications that require a high degree of precision, control, and visual fidelity. It is well-suited for a variety of downstream tasks and application scenarios:

Professional Content Creation: Generating production-ready assets for marketing campaigns, advertising, and branding.
Design and Prototyping: Creating detailed product mockups, storyboards for film and animation, and architectural visualizations.
Informational Graphics: Designing complex and accurate infographics, educational diagrams, and data visualizations.
Artistic and Creative Expression: Enabling artists and designers to explore novel visual styles and create complex, multi-element compositions.

Performance

Nano Banana Pro's performance has been evaluated through extensive human evaluations and benchmarked against other leading image generation models. The results, measured in Elo scores, demonstrate its strong capabilities across a wide range of tasks.

A technical report also notes a performance dichotomy: while the model produces subjectively superior visual quality by hallucinating plausible details, it can lag behind specialist models in traditional quantitative metrics due to the stochastic nature of generative models.

Existing Capabilities (Elo Score Comparison)

Capability	Gemini 3 Pro Image	Gemini 2.5 Flash Image	GPT-Image 1	Seedream v4 4k	Flux Pro Kontext Max
Text Rendering	1198 ± 18	997 ± 10	1150 ± 14	1019 ± 13	854 ± 13
Stylization	1098 ± 11	933 ± 7	1069 ± 9	991 ± 9	908 ± 11
Multi-Turn	1186 ± 19	1045 ± 24	1079 ± 32	990 ± 32	889 ± 37
General Image Editing	1127 ± 13	996 ± 8	1011 ± 13	965 ± 12	902 ± 13
Character Editing	1176 ± 16	1075 ± 8	1016 ± 10	889 ± 10	843 ± 10
Object/Env. Editing	1102 ± 19	1025 ± 9	930 ± 12	983 ± 13	961 ± 10
General Text-to-Image	1094 ± 16	1037 ± 8	1025 ± 9	1011 ± 9	907 ± 9

New Capabilities (Elo Score Comparison)

Capability	Gemini 3 Pro Image	Gemini 2.5 Flash Image	GPT-Image 1	Seedream v4 4k	Flux Pro Kontext Max
Multi-character Editing	1213 ± 16	950 ± 10	997 ± 13	840 ± 19	-
Chart Editing	1209 ± 18	971 ± 10	994 ± 16	934 ± 16	893 ± 15
Text Editing	1202 ± 23	1001 ± 10	996 ± 14	860 ± 15	943 ± 12
Factuality - Edu	1169 ± 25	1050 ± 11	1084 ± 25	969 ± 22	884 ± 26
Infographics	1268 ± 17	1162 ± 11	1087 ± 12	1049 ± 12	824 ± 15
Visual Design	1104 ± 16	1083 ± 7	1028 ± 11	1038 ± 12	907 ± 11

สำรวจโมเดลที่คล้ายกัน

NEW

ภาพเป็นภาพ

DEV

Nano Banana 2 Lite Edit Developer

Google's fastest and most cost-efficient Nano Banana image model for editing, applying natural-language edits and multi-image composition to up to 14 reference images with low latency.

Nano Banana 2 Lite Text-to-Image Developer

Google's fastest and most cost-efficient Nano Banana image model, turning natural-language text prompts into high-quality 1k images in as little as 4 seconds for rapid, high-volume generation.

Nano Banana 2 Lite Edit

Nano banana lite is the efficiency-focused model in the image generation family. Sub-2 second latency with cost-effective generation and editing, fast multi-turn local edits, and 14 supported aspect ratios.

Nano Banana 2 Lite Text-to-image

Nano Banana 2 Reference to Image

Google's advanced AI-powered video-to-image generation model, designed to generate high-quality static images from video clips combined with text instructions.

Nano Banana 2 Reference to Image Developer

Google's advanced AI-powered video-to-image generation model, designed to generate high-quality static images from video clips combined with text instructions.

Nano Banana 2 Text-to-Image Developer

Google's lightweight yet powerful AI image generation model, built for creators who need fast, high-quality visuals from simple text prompts.

Nano Banana 2 Text-to-Image

Google's lightweight yet powerful AI image generation model, built for creators who need fast, high-quality visuals from simple text prompts.

Nano Banana 2 Edit Developer

Google's advanced AI-powered image editing and generation model, designed to make visual transformation as intuitive as describing it in words.

Nano Banana 2 Edit

Google's advanced AI-powered image editing and generation model, designed to make visual transformation as intuitive as describing it in words.

Nano Banana Pro Text-to-image Ultra

Nano Banana Pro is the next-generation Nano Banana image model, delivering sharper detail, richer color control, and faster diffusion for production-ready visuals.

Nano Banana Pro Edit Ultra

Nano Banana Pro Edit is an image editing tool built on the Nano Banana model family, designed for precise, AI-powered visual adjustments.

Nano Banana Pro Text-to-image

Nano Banana Pro is the next-generation Nano Banana image model, delivering sharper detail, richer color control, and faster diffusion for production-ready visuals.

From

$0.14/ภาพ