google/nano-banana-pro/text-to-image-developer

Metin-Görüntü

PRODEV

Nano Banana Pro Text-to-Image Developer API by Google

google/nano-banana-pro/text-to-image-developer

Text-to-image-developer

Open and Advanced Large-Scale Image Generative Models.

Girdi

Prompt *

En Boy Oranı

Çözünürlük

Enable web search

Çıktı

Boşta

Oluşturulan görüntüleriniz burada görünecek

Parametreleri yapılandırın ve oluşturmaya başlamak için Çalıştır'a tıklayın

Her çalıştırma $0.07 maliyete sahip. 10$ ile yaklaşık 142 kez çalıştırabilirsiniz.

Şununla devam edebilirsiniz:

Görüntüden videoya Görüntüden görüntüye

Parametreler

Kod örneği
import requests
import time

# Step 1: Start image generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "google/nano-banana-pro/text-to-image-developer",  # Required. model name
    "aspect_ratio": "example_value",  # The aspect ratio of the generated media
    "enable_base64_output": False,  # If enabled, the output will be encoded into a BASE64 string instead of a URL
    "enable_sync_mode": False,  # If set to true, the function will wait for the result to be generated and uploaded before returning the response
    "enable_web_search": False,  # If enabled, the model will use web search to ground the generation with real-time information
    "prompt": "A beautiful landscape with mountains and lake",  # Required. The positive prompt for the generation
    "resolution": "1k",  # The resolution of the output image. options: 1k | 2k | 4k
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] == "completed":
            print("Generated image:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

image_url = check_status()

Kurulum

Programlama diliniz için gerekli paketi kurun.

pip install requests

Kimlik Doğrulama

Tüm API istekleri, API anahtarı ile kimlik doğrulama gerektirir. API anahtarınızı Atlas Cloud kontrol panelinden alabilirsiniz.

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP Başlıkları

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

API anahtarınızı güvende tutun

API anahtarınızı asla istemci tarafı kodunda veya herkese açık depolarda ifşa etmeyin. Bunun yerine ortam değişkenleri veya arka uç proxy kullanın.

İstek gönder

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

İstek Gönder

Asenkron bir oluşturma isteği gönderin. API, durumu kontrol etmek ve sonucu almak için kullanabileceğiniz bir tahmin ID'si döndürür.

POST/api/v1/model/generateImage

İstek Gövdesi

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "google/nano-banana-pro/text-to-image-developer",
    "prompt": "A beautiful landscape with mountains and lake"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

Yanıt

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

Durumu Kontrol Et

İsteğinizin mevcut durumunu kontrol etmek için tahmin uç noktasını sorgulayın.

GET/api/v1/model/prediction/{prediction_id}

Sorgulama Örneği

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

Durum Değerleri

processingİstek hâlâ işleniyor.

completedOluşturma tamamlandı. Çıktılar kullanılabilir.

succeededOluşturma başarılı oldu. Çıktılar kullanılabilir.

failedOluşturma başarısız oldu. Hata alanını kontrol edin.

Tamamlanmış Yanıt

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.png"
    ],
    "metrics": {
      "predict_time": 8.3
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

Dosya Yükle

Dosyaları Atlas Cloud depolama alanına yükleyin ve API isteklerinizde kullanabileceğiniz bir URL alın. Yüklemek için multipart/form-data kullanın.

POST/api/v1/model/uploadMedia

Yükleme Örneği

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

Yanıt

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

Input Schema

İstek gövdesinde aşağıdaki parametreler kabul edilir.

Toplam: 7Zorunlu: 2İsteğe Bağlı: 5

modelstringrequired

model name

Default: "google/nano-banana-pro/text-to-image-developer"

aspect_ratiostring

The aspect ratio of the generated media.

1:13:22:33:44:34:55:49:1616:921:9

enable_base64_outputboolean

If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Default: false

enable_sync_modeboolean

If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.

Default: false

enable_web_searchboolean

If enabled, the model will use web search to ground the generation with real-time information.

Default: false

promptstringrequired

The positive prompt for the generation.

resolutionstring

The resolution of the output image.

Default: "1k"

1k2k4k

Örnek İstek Gövdesi

{
  "model": "google/nano-banana-pro/text-to-image-developer",
  "enable_base64_output": false,
  "enable_sync_mode": false,
  "enable_web_search": false,
  "prompt": "A beautiful landscape",
  "resolution": "1k"
}

Output Schema

API, oluşturulan çıktı URL'lerini içeren bir tahmin yanıtı döndürür.

created_atstring

ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”).

idstring

Unique identifier for the prediction, the ID of the prediction to get.

modelstring

Model ID used for the prediction.

outputsarray

Array of URLs to the generated content (empty when status is not completed).

statusstring

Status of the task: created, processing, completed, or failed.

Örnek Yanıt

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.png"
  ],
  "metrics": {
    "predict_time": 8.3
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills, 400'den fazla AI modelini doğrudan AI kodlama asistanınıza entegre eder. Kurmak için tek bir komut, ardından görüntü, video oluşturmak ve LLM ile sohbet etmek için doğal dil kullanın.

Desteklenen İstemciler

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ desteklenen i̇stemciler

Kurulum

npx skills add AtlasCloudAI/atlas-cloud-skills

API Anahtarını Ayarla

API anahtarınızı Atlas Cloud kontrol panelinden alın ve ortam değişkeni olarak ayarlayın.

export ATLASCLOUD_API_KEY="your-api-key-here"

Yetenekler

Kurulduktan sonra, tüm Atlas Cloud modellerine erişmek için AI asistanınızda doğal dil kullanabilirsiniz.

Görüntü OluşturmaNano Banana 2, Z-Image ve daha fazla model ile görüntüler oluşturun.

Video OluşturmaKling, Vidu, Veo vb. ile metin veya görüntülerden videolar oluşturun.

LLM SohbetQwen, DeepSeek ve diğer büyük dil modelleri ile sohbet edin.

Medya YüklemeGörüntü düzenleme ve görüntüden videoya iş akışları için yerel dosyaları yükleyin.

Daha fazla bilgi

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

Atlas Cloud MCP Server, IDE'nizi Model Context Protocol aracılığıyla 400'den fazla AI modeline bağlar. Herhangi bir MCP uyumlu istemci ile çalışır.

Desteklenen İstemciler

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ desteklenen i̇stemciler

Kurulum

npx -y atlascloud-mcp

Yapılandırma

Aşağıdaki yapılandırmayı IDE'nizin MCP ayarları dosyasına ekleyin.

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

Mevcut Araçlar

atlas_generate_imageMetin istemlerinden görüntüler oluşturun.

atlas_generate_videoMetin veya görüntülerden videolar oluşturun.

atlas_chatBüyük dil modelleri ile sohbet edin.

atlas_list_models400'den fazla mevcut AI modelini keşfedin.

atlas_quick_generateOtomatik model seçimi ile tek adımda içerik oluşturma.

atlas_upload_mediaAPI iş akışları için yerel dosyaları yükleyin.

Daha fazla bilgi

github.com/AtlasCloudAI/mcp-server

API Şeması

{
  "components": {
    "schemas": {
      "Input": {
        "properties": {
          "model": {
            "type": "string",
            "description": "model name",
            "default": "google/nano-banana-pro/text-to-image-developer"
          },
          "aspect_ratio": {
            "description": "The aspect ratio of the generated media.",
            "enum": [
              "1:1",
              "3:2",
              "2:3",
              "3:4",
              "4:3",
              "4:5",
              "5:4",
              "9:16",
              "16:9",
              "21:9"
            ],
            "type": "string",
            "x-placeholder": "Select aspect ratio"
          },
          "enable_base64_output": {
            "default": false,
            "description": "If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.",
            "disabled": true,
            "type": "boolean"
          },
          "enable_sync_mode": {
            "default": false,
            "description": "If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.",
            "disabled": true,
            "type": "boolean"
          },
          "enable_web_search": {
            "default": false,
            "description": "If enabled, the model will use web search to ground the generation with real-time information.",
            "type": "boolean"
          },
          "prompt": {
            "description": "The positive prompt for the generation.",
            "type": "string"
          },
          "resolution": {
            "default": "1k",
            "description": "The resolution of the output image.",
            "enum": [
              "1k",
              "2k",
              "4k"
            ],
            "type": "string"
          }
        },
        "required": [
          "model",
          "prompt"
        ],
        "seed": {
          "title": "Seed",
          "type": "integer"
        },
        "type": "object",
        "x-order-properties": [
          "model",
          "prompt",
          "aspect_ratio",
          "resolution",
          "enable_web_search",
          "enable_sync_mode",
          "enable_base64_output"
        ]
      },
      "PredictionResponse": {
        "properties": {
          "created_at": {
            "description": "ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”).",
            "format": "date-time",
            "type": "string"
          },
          "has_nsfw_contents": {
            "description": "Array of boolean values indicating NSFW detection for each output.",
            "items": {
              "type": "boolean"
            },
            "type": "array"
          },
          "id": {
            "description": "Unique identifier for the prediction, the ID of the prediction to get.",
            "type": "string"
          },
          "model": {
            "description": "Model ID used for the prediction.",
            "type": "string"
          },
          "outputs": {
            "description": "Array of URLs to the generated content (empty when status is not completed).",
            "items": {
              "type": "string"
            },
            "type": "array"
          },
          "status": {
            "description": "Status of the task: created, processing, completed, or failed.",
            "type": "string"
          },
          "urls": {
            "description": "Object containing related API endpoints.",
            "type": "object"
          }
        },
        "type": "object"
      }
    },
    "securitySchemes": {
      "apiKeyAuth": {
        "in": "header",
        "name": "Authorization",
        "type": "apiKey"
      }
    }
  },
  "info": {
    "description": "The AtlasCloud API.",
    "title": "AtlasCloud API",
    "version": "1.0.0"
  },
  "openapi": "3.0.0",
  "paths": {
    "/api/v1/model/generateImage": {
      "post": {
        "requestBody": {
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/Input"
              }
            }
          },
          "required": true
        },
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "The request status."
          }
        }
      },
      "x-api-name": "model_run"
    },
    "/api/v1/model/prediction/{request_id}": {
      "get": {
        "parameters": [
          {
            "in": "path",
            "name": "request_id",
            "required": true,
            "schema": {
              "description": "Request ID",
              "type": "string"
            }
          }
        ],
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "Result of the request."
          }
        }
      },
      "x-api-name": "model_result"
    }
  },
  "servers": [
    {
      "url": "https://api.atlascloud.ai"
    }
  ]
}

LLM Dostu İstem Şablonu

# google/nano-banana-pro/text-to-image-developer

> Open and Advanced Large-Scale Image Generative Models.


## Overview

- **Submit endpoint (POST)**: `https://api.atlascloud.ai/api/v1/model/generateImage` — start an async generation; returns a `prediction_id`
- **Poll endpoint (GET)**: `https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}` — poll this until the prediction finishes
- **Model ID**: `google/nano-banana-pro/text-to-image-developer`


## API Information

This model can be used via our HTTP API or more conveniently via our client libraries.
See the input and output schema below, as well as the usage examples.


### Input Schema

The API accepts the following input parameters:

- **`model`** (`string`, _required_):
  model name
  - Default: `"google/nano-banana-pro/text-to-image-developer"`

- **`prompt`** (`string`, _required_):
  The positive prompt for the generation.

- **`aspect_ratio`** (`string`, _optional_):
  The aspect ratio of the generated media.
  - Options: "1:1", "3:2", "2:3", "3:4", "4:3", "4:5", "5:4", "9:16", "16:9", "21:9"

- **`resolution`** (`string`, _optional_):
  The resolution of the output image.
  - Default: `"1k"`
  - Options: "1k", "2k", "4k"

- **`enable_web_search`** (`boolean`, _optional_):
  If enabled, the model will use web search to ground the generation with real-time information.
  - Default: `false`

- **`enable_sync_mode`** (`boolean`, _optional_):
  If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.
  - Default: `false`

- **`enable_base64_output`** (`boolean`, _optional_):
  If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.
  - Default: `false`



**Required Parameters Example**:

```json
{
  "model": "google/nano-banana-pro/text-to-image-developer",
  "prompt": ""
}
```


**Full Example**:

```json
{
  "model": "google/nano-banana-pro/text-to-image-developer",
  "prompt": "",
  "aspect_ratio": "1:1",
  "resolution": "1k",
  "enable_web_search": false,
  "enable_sync_mode": false,
  "enable_base64_output": false
}
```


### Output Schema

The API returns the following output format:


- **`created_at`** (`string`, _optional_):
  ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”).

- **`has_nsfw_contents`** (`array[boolean]`, _optional_):
  Array of boolean values indicating NSFW detection for each output.

- **`id`** (`string`, _optional_):
  Unique identifier for the prediction, the ID of the prediction to get.

- **`model`** (`string`, _optional_):
  Model ID used for the prediction.

- **`outputs`** (`array[string]`, _optional_):
  Array of URLs to the generated content (empty when status is not completed).

- **`status`** (`string`, _optional_):
  Status of the task: created, processing, completed, or failed.

- **`urls`** (`object`, _optional_):
  Object containing related API endpoints.



**Example Response**:

```json
{
  "created_at": "",
  "has_nsfw_contents": [],
  "id": "",
  "model": "",
  "outputs": [
    ""
  ],
  "status": "",
  "urls": {}
}
```


## Usage Examples

### cURL

```bash
# Step 1: Start generation (async)
curl -X POST "https://api.atlascloud.ai/api/v1/model/generateImage" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "google/nano-banana-pro/text-to-image-developer",
  "prompt": "",
  "aspect_ratio": "1:1",
  "resolution": "1k",
  "enable_web_search": false,
  "enable_sync_mode": false,
  "enable_base64_output": false
}'

# Response will contain: {"code": 200, "data": {"id": "prediction_id", "status": "processing"}}

# Step 2: Poll for result (replace {prediction_id} with the id returned above)
curl -X GET "https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"

# Keep polling until status is "completed", "succeeded" or "failed"
# When completed, outputs will contain the generated content URL(s)
```

## Additional Resources

### Documentation

- [Model Playground](https://www.atlascloud.ai/models/google/nano-banana-pro/text-to-image-developer)

Create a vibrant and modern magazine cover for Women’s Health, themed for April 2025. The main background is a warm, orange gradient with soft shadows, evoking a fresh spring mood. Centered is a stylish young woman sitting confidently on color-blocked orange cubes. She has long, voluminous, wavy blonde hair and a natural, glowing complexion. She’s dressed in a forest green zip-up windbreaker jacket with loose sleeves and an orange top underneath, paired with white athletic crew socks branded ‘SAMOLA’ and retro-style white sneakers with thick black stripes and tan soles. One leg is propped up, creating a confident, athletic pose. Her expression is calm and poised. Include magazine headlines in stylish fonts, balancing black, white, and lime green text, placed thoughtfully around the subject: • Top left: ‘Covid: five years on’ in pale lime green with subtext in black: ‘Has the pandemic reshaped your identity?’ • Top right: ‘Spring forward’ in bold black with subtext: ‘How to eat, travel and sweat for your healthiest season yet’ • Center right: ‘15 skincare habits beauty founders swear by’ with large lime green ‘15’ • Bottom left: ‘FAKE VIEWS: Inside the scroll holes telling women how to “fix” themselves’ in black and pale pink • Bottom left corner with a green plus sign: ‘The workout that experts are calling a magic pill’ • Bottom right over the box: ‘Em the nutritionist’ in elegant white serif font, with yellow subheading: ‘In the kitchen with wellness’s favourite foodie’ Design should reflect an empowering, clean, editorial style, with an emphasis on health, wellness, and bold femininity. Lighting should be studio-bright, shadows soft and controlled.

Plant vs. Zombies 3D Peashooter Picture

Same female character, 3-angle turnaround (front, side, 3/4), consistent face and lighting, detailed outfit texture, animation design sheet.

Generate a screenshot of a windows 11 desktop, with google chrome open, showing a YouTube thumbnail of Mr. Beast on YouTube.com

Please create a solution layout for a mathematics problem with a paper-texture background. Requirements: split the canvas into left and right sections—\emph{left:} schematic of the plan (arrows/notes, scale, directions); \emph{right:} step-by-step derivation. Use consistent annotations in the figure: known quantities, unknowns, key relations, and coordinate axes or normals. Box the final answer and include a check. \textbf{Problem:} Given $v_0=20\,\text{m/s}$ and $\theta=30^\circ$, find the time of flight, the maximum height, and the range, and output the position at $t=1\,\text{s}$. Take $g=10\,\text{m/s}^2$. draw the question and solution.

A perfectly reflective chrome (Chrome) mirror ball placed on a black and white checkerboard.

High-quality flat lay photography creating a DIY infographic that simply explains how solar energy works, arranged on a clean, light gray textured background. The visual story flows from left to right in clear steps. Simple, clean black arrows are hand-drawn onto the background to guide the viewer's eye from the sun to the house, clearly marking the flow of energy. The overall mood is educational, modern, and easy to understand. The image is shot from a top-down, bird's-eye view with soft, even lighting that minimizes shadows and keeps the focus on the process. Format 16:9

An intense, cinematic 3D animation style render of a boxing match taking place inside a large sizzling frying pan. The main characters are an anthropomorphic French fry wearing a red, white, and blue sweatband and boxing gloves, fighting against an anthropomorphic onion wearing a blue wrestling singlet and a mustache. They are in a dynamic fighting pose with hot oil splashing around their feet and flying vegetable particles. In the background, a crowd of anthropomorphic burgers, potatoes, and hot dogs are watching and cheering. The lighting is professional studio lighting with a kitchen background, high quality, octane render, hyper-realistic food textures, 8k resolution.

actor standing on set surrounded by two large cinema cameras, LED walls behind creating a sci-fi backdrop illusion, crew marking positions on the floor, realistic production lighting, ultra-real cinematic style

behind-the-scenes of a high-end commercial shoot, fashion model under giant soft lights, photographers, stylists fixing details, production assistants holding reflectors, studio filled with pro equipment, crisp realistic image

Generate a black-and-white comic for me about a Japanese high school student being late for school.

a realistic person blended into an artistic collage of textures, shapes, torn paper layers, bold contrasting typography, experimental poster layout, overlapping elements, vibrant color palette, modern graphic design aesthetic, high-resolution details

Yükleniyor...

Gelişmiş Görüntü Oluşturma

Çoklu görüntü birleştirme teknolojisi
Nesiller boyu karakter tutarlılığı
Stil koruyucu dönüşümler
4K'ya kadar yüksek çözünürlüklü çıktı

Akıllı Düzenleme Araçları

Metin tabanlı akıllı düzenleme
Nesne ekleme ve kaldırma
Arka plan değiştirme
Stil transferi ve sanatsal efektler

Transform to Figure

Photo to Character Figure

Transform any photo into a realistic character figure with packaging and display

Prompt

turn this photo into a character figure. Behind it, place a box with the character's image printed on it, and a computer showing the Blender modeling process on its screen. In front of the box, add a round plastic base with the character figure standing on it. set the scene indoors if possible

Anime to Real

Anime to Cosplay

Transform anime illustrations into realistic cosplay photography

Prompt

Generate a highly detailed photo of a girl cosplaying this illustration, at Comiket. Exactly replicate the same pose, body posture, hand gestures, facial expression, and camera framing as in the original illustration. Keep the same angle, perspective, and composition, without any deviation

Photo to Action Figure

Person to Action Figure

Transform people from photos into collectible action figures with custom packaging

Prompt

Transform the the person in the photo into an action figure, styled after [CHARACTER_NAME] from [SOURCE / CONTEXT]. Next to the figure, display the accessories including [ITEM_1], [ITEM_2], and [ITEM_3]. On the top of the toy box, write "[BOX_LABEL_TOP]", and underneath it, "[BOX_LABEL_BOTTOM]". Place the box in a [BACKGROUND_SETTING] environment. Visualize this in a highly realistic way with attention to fine details.

Photo to Funko Pop

Person to Funko Pop Figure

Transform photos into Funko Pop style collectible figures with custom packaging

Prompt

Transform the person in the photo into the style of a Funko Pop figure packaging box, presented in an isometric perspective. Label the packaging with the title 'ZHOGUE'. Inside the box, showcase the figure based on the person in the photo, accompanied by their essential items (such as cosmetics, bags, or others). Next to the box, also display the actual figure itself outside of the packaging, rendered in a realistic and lifelike style.

Design to Reality

Product Design to Photorealistic Render

Transform product design sketches into photorealistic renders

Prompt

turn this illustration of a perfume into a realistic version, Frosted glass bottle with a marble cap

Face Reference Control

Transform to Q-Version Character

Create cartoon characters with face shape reference control

Prompt

Transform the person from image 1 into a Q-version character design based on the face shape from image 2

Architecture to Model

Building to 3D Architecture Model

Convert architectural photos into detailed physical models

Prompt

convert this photo into a architecture model. Behind the model, there should be a cardboard box with an image of the architecture from the photo on it. There should also be a computer, with the content on the computer screen showing the Blender modeling process of the figurine. In front of the cardboard box, place a cardstock and put the architecture model from the photo I provided on it. I hope the PVC material can be clearly presented. It would be even better if the background is indoors.

Teknik Öne Çıkanlar

Performans

Yıldırım Hızında Oluşturma

Çoğu görev için 2 saniyenin altında oluşturma süreleriyle hız için optimize edilmiş olup gerçek zamanlı uygulamalar ve hızlı prototip geliştirme iş akışları için mükemmeldir.

Kalite

Üstün Çıktı Kalitesi

Google'ın gelişmiş AI mimarisinden yararlanarak doğru aydınlatma, dokular ve kompozisyonlarla son derece ayrıntılı, fotogerçekçi görüntüler üretir.

Yenilik

Yeni Görünüm Sentezi

Tek bir görüntüden birden fazla bakış açısı oluşturmayı mümkün kılan devrimci 2D-3D dönüştürme özellikleri, içerik üretimi için yeni ufuklar açar.

Kullanım Alanları

📸

Ürün Fotoğrafçılığı

🎨

Dijital Sanat Üretimi

✨

Fotoğraf İyileştirme

📊

Pazarlama Görselleri

👤

Karakter Tasarımı

👔

Sanal Deneme

📱

Sosyal Medya

🔄

Fotoğraf Restorasyonu

Neden Nano Banana?

🚀

Kurulum Gerekmez

Karmaşık yapılandırma veya kurulum olmadan hemen yaratmaya başlayın

🎯

Hassas Kontrol

Sezgisel metin komutlarıyla yaratımınızın her yönünü ince ayarlayın

🔄

Tutarlı Sonuçlar

Birden fazla nesil boyunca karakter ve stil tutarlılığını koruyun

Teknik Özellikler

Model Mimarisi:Google AI Studio Destekli

İşlem Hızı:Ortalama oluşturma süresi < 2 saniye

Çözünürlük Desteği:4096x4096 piksele kadar

Format Desteği:PNG, JPEG, WebP çıktı formatları

Çok Modlu Giriş:Metin, Görüntü ve Kombine istemler

API Entegrasyonu:Kapsamlı dokümantasyonla RESTful API

Nano Banana AI'nın Gücünü Deneyimleyin

Google'ın en gelişmiş görüntü AI teknolojisiyle görsel içeriğini halihazırda dönüştüren binlerce yaratıcı ve işletmeye katılın.

✨Başlangıç için Ücretsiz Krediler

⚡Anında Erişim

🌐Her Yerde Çalışır

Nano Banana Pro : A state-of-the-art, multimodal reasoning and image generation model by Google DeepMind

Model Card Overview

Field	Description
Model Name	Nano Banana Pro (also known as Gemini 3 Pro Image)
Developer	Google DeepMind
Release Date	November 20, 2025
Model Type	Multimodal Reasoning and Image Generation
Related Links	Official Product Page, Model Card (PDF)

Introduction

Nano Banana Pro, officially designated as Gemini 3 Pro Image, represents the next generation in Google's series of highly-capable, natively multimodal models. It is designed for professional asset production, integrating the advanced reasoning capabilities of the Gemini 3 Pro foundation model with a sophisticated image generation engine. The primary goal of Nano Banana Pro is to provide users with studio-quality precision and control, enabling the creation of complex, high-fidelity visuals from textual and image-based prompts. Its core contribution lies in its ability to understand and execute intricate instructions, maintain character and scene consistency, and render legible text directly within generated images, setting a new standard for professional creative workflows.

Key Features & Innovations

Nano Banana Pro introduces several technical breakthroughs that distinguish it from prior models:

Superior Text Rendering: The model excels at generating images that contain clear, accurate, and stylistically coherent text, making it ideal for creating posters, diagrams, and marketing materials.
Advanced Creative Controls: Users can exercise fine-grained control over image outputs, including camera angles, lighting transformations (e.g., day to night), color grading, depth of field, and localized editing.
High-Fidelity Consistency: It can maintain the consistency of up to 14 input images and blend up to 5 distinct characters seamlessly into complex compositions, ensuring visual coherence across a series of generated images.
Deep Real-World Knowledge: Built on Gemini 3 Pro, the model leverages a vast understanding of the world to generate contextually rich and factually grounded visuals, from detailed infographics to historically accurate scenes.
Multilingual Capabilities: The model can accurately render and translate text across multiple languages within an image, facilitating the localization of visual content.
Complex Composition from Multiple Inputs: Nano Banana Pro can synthesize elements from multiple source images and text prompts to create a single, cohesive scene, enabling complex creative concepts.

Model Architecture & Technical Details

Nano Banana Pro's architecture is fundamentally based on the Gemini 3 Pro model. While specific architectural details are not fully disclosed, the following technical information is available:

Foundation Model: Gemini 3 Pro
Inputs: The model accepts text strings and images as input, with a large context window of up to 1 million tokens.
Outputs: It generates high-resolution images (up to 4K) with a 64K token output capacity for handling complex generation tasks.
Training Infrastructure:
- Hardware: The model was trained on Google's custom-designed Tensor Processing Units (TPUs), which are optimized for large-scale machine learning computations and high-bandwidth memory access.
- Software: The training process utilized JAX and ML Pathways, Google's high-performance frameworks for machine learning research.
Knowledge Cutoff: The model's internal knowledge base has a cutoff date of January 2025.

Intended Use & Applications

Nano Banana Pro is intended for professional and creative applications that require a high degree of precision, control, and visual fidelity. It is well-suited for a variety of downstream tasks and application scenarios:

Professional Content Creation: Generating production-ready assets for marketing campaigns, advertising, and branding.
Design and Prototyping: Creating detailed product mockups, storyboards for film and animation, and architectural visualizations.
Informational Graphics: Designing complex and accurate infographics, educational diagrams, and data visualizations.
Artistic and Creative Expression: Enabling artists and designers to explore novel visual styles and create complex, multi-element compositions.

Performance

Nano Banana Pro's performance has been evaluated through extensive human evaluations and benchmarked against other leading image generation models. The results, measured in Elo scores, demonstrate its strong capabilities across a wide range of tasks.

A technical report also notes a performance dichotomy: while the model produces subjectively superior visual quality by hallucinating plausible details, it can lag behind specialist models in traditional quantitative metrics due to the stochastic nature of generative models.

Existing Capabilities (Elo Score Comparison)

Capability	Gemini 3 Pro Image	Gemini 2.5 Flash Image	GPT-Image 1	Seedream v4 4k	Flux Pro Kontext Max
Text Rendering	1198 ± 18	997 ± 10	1150 ± 14	1019 ± 13	854 ± 13
Stylization	1098 ± 11	933 ± 7	1069 ± 9	991 ± 9	908 ± 11
Multi-Turn	1186 ± 19	1045 ± 24	1079 ± 32	990 ± 32	889 ± 37
General Image Editing	1127 ± 13	996 ± 8	1011 ± 13	965 ± 12	902 ± 13
Character Editing	1176 ± 16	1075 ± 8	1016 ± 10	889 ± 10	843 ± 10
Object/Env. Editing	1102 ± 19	1025 ± 9	930 ± 12	983 ± 13	961 ± 10
General Text-to-Image	1094 ± 16	1037 ± 8	1025 ± 9	1011 ± 9	907 ± 9

New Capabilities (Elo Score Comparison)

Capability	Gemini 3 Pro Image	Gemini 2.5 Flash Image	GPT-Image 1	Seedream v4 4k	Flux Pro Kontext Max
Multi-character Editing	1213 ± 16	950 ± 10	997 ± 13	840 ± 19	-
Chart Editing	1209 ± 18	971 ± 10	994 ± 16	934 ± 16	893 ± 15
Text Editing	1202 ± 23	1001 ± 10	996 ± 14	860 ± 15	943 ± 12
Factuality - Edu	1169 ± 25	1050 ± 11	1084 ± 25	969 ± 22	884 ± 26
Infographics	1268 ± 17	1162 ± 11	1087 ± 12	1049 ± 12	824 ± 15
Visual Design	1104 ± 16	1083 ± 7	1028 ± 11	1038 ± 12	907 ± 11

Benzer Modelleri Keşfedin

NEW

Görüntü-Görüntü

DEV

Nano Banana 2 Lite Edit Developer

Google's fastest and most cost-efficient Nano Banana image model for editing, applying natural-language edits and multi-image composition to up to 14 reference images with low latency.

Nano Banana 2 Lite Text-to-Image Developer

Google's fastest and most cost-efficient Nano Banana image model, turning natural-language text prompts into high-quality 1k images in as little as 4 seconds for rapid, high-volume generation.

Nano Banana 2 Lite Edit

Nano banana lite is the efficiency-focused model in the image generation family. Sub-2 second latency with cost-effective generation and editing, fast multi-turn local edits, and 14 supported aspect ratios.

Nano Banana 2 Lite Text-to-image

Nano Banana 2 Reference to Image

Google's advanced AI-powered video-to-image generation model, designed to generate high-quality static images from video clips combined with text instructions.

Nano Banana 2 Reference to Image Developer

Google's advanced AI-powered video-to-image generation model, designed to generate high-quality static images from video clips combined with text instructions.

Nano Banana 2 Text-to-Image Developer

Google's lightweight yet powerful AI image generation model, built for creators who need fast, high-quality visuals from simple text prompts.

Nano Banana 2 Text-to-Image

Google's lightweight yet powerful AI image generation model, built for creators who need fast, high-quality visuals from simple text prompts.

Nano Banana 2 Edit Developer

Google's advanced AI-powered image editing and generation model, designed to make visual transformation as intuitive as describing it in words.

Nano Banana 2 Edit

Google's advanced AI-powered image editing and generation model, designed to make visual transformation as intuitive as describing it in words.

Nano Banana Pro Text-to-image Ultra

Nano Banana Pro is the next-generation Nano Banana image model, delivering sharper detail, richer color control, and faster diffusion for production-ready visuals.

Nano Banana Pro Edit Ultra

Nano Banana Pro Edit is an image editing tool built on the Nano Banana model family, designed for precise, AI-powered visual adjustments.

Nano Banana Pro Text-to-image

Nano Banana Pro is the next-generation Nano Banana image model, delivering sharper detail, richer color control, and faster diffusion for production-ready visuals.

From

$0.14/GÖRÜNTÜ