google/nano-banana-pro/text-to-image-developer

Open and Advanced Large-Scale Image Generative Models.

TEXT-TO-IMAGENEW
Nano Banana Pro Text-to-image Developer
Metin-Görüntü
PRODEV

Open and Advanced Large-Scale Image Generative Models.

Girdi

Parametre yapılandırması yükleniyor...

Çıktı

Boşta
Oluşturulan görüntüleriniz burada görünecek
Parametreleri yapılandırın ve oluşturmaya başlamak için Çalıştır'a tıklayın

Her çalıştırma $0.084 maliyete sahip. 10$ ile yaklaşık 119 kez çalıştırabilirsiniz.

Parametreler

Kod örneği

import requests
import time

# Step 1: Start image generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "google/nano-banana-pro/text-to-image-developer",
    "prompt": "A beautiful landscape with mountains and lake",
    "width": 512,
    "height": 512,
    "steps": 20,
    "guidance_scale": 7.5,
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] == "completed":
            print("Generated image:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

image_url = check_status()

Kurulum

Programlama diliniz için gerekli paketi kurun.

bash
pip install requests

Kimlik Doğrulama

Tüm API istekleri, API anahtarı ile kimlik doğrulama gerektirir. API anahtarınızı Atlas Cloud kontrol panelinden alabilirsiniz.

bash
export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP Başlıkları

python
import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}
API anahtarınızı güvende tutun

API anahtarınızı asla istemci tarafı kodunda veya herkese açık depolarda ifşa etmeyin. Bunun yerine ortam değişkenleri veya arka uç proxy kullanın.

İstek gönder

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

İstek Gönder

Asenkron bir oluşturma isteği gönderin. API, durumu kontrol etmek ve sonucu almak için kullanabileceğiniz bir tahmin ID'si döndürür.

POST/api/v1/model/generateImage

İstek Gövdesi

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "google/nano-banana-pro/text-to-image-developer",
    "input": {
        "prompt": "A beautiful landscape with mountains and lake"
    }
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['id']}")
print(f"Status: {result['status']}")

Yanıt

{
  "id": "pred_abc123",
  "status": "processing",
  "model": "model-name",
  "created_at": "2025-01-01T00:00:00Z"
}

Durumu Kontrol Et

İsteğinizin mevcut durumunu kontrol etmek için tahmin uç noktasını sorgulayın.

GET/api/v1/model/prediction/{prediction_id}

Sorgulama Örneği

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

Durum Değerleri

processingİstek hâlâ işleniyor.
completedOluşturma tamamlandı. Çıktılar kullanılabilir.
succeededOluşturma başarılı oldu. Çıktılar kullanılabilir.
failedOluşturma başarısız oldu. Hata alanını kontrol edin.

Tamamlanmış Yanıt

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.png"
    ],
    "metrics": {
      "predict_time": 8.3
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

Dosya Yükle

Dosyaları Atlas Cloud depolama alanına yükleyin ve API isteklerinizde kullanabileceğiniz bir URL alın. Yüklemek için multipart/form-data kullanın.

POST/api/v1/model/uploadMedia

Yükleme Örneği

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

Yanıt

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

Input Schema

İstek gövdesinde aşağıdaki parametreler kabul edilir.

Toplam: 0Zorunlu: 0İsteğe Bağlı: 0

Kullanılabilir parametre yok.

Örnek İstek Gövdesi

json
{
  "model": "google/nano-banana-pro/text-to-image-developer"
}

Output Schema

API, oluşturulan çıktı URL'lerini içeren bir tahmin yanıtı döndürür.

idstringrequired
Unique identifier for the prediction.
statusstringrequired
Current status of the prediction.
processingcompletedsucceededfailed
modelstringrequired
The model used for generation.
outputsarray[string]
Array of output URLs. Available when status is "completed".
errorstring
Error message if status is "failed".
metricsobject
Performance metrics.
predict_timenumber
Time taken for image generation in seconds.
created_atstringrequired
ISO 8601 timestamp when the prediction was created.
Format: date-time
completed_atstring
ISO 8601 timestamp when the prediction was completed.
Format: date-time

Örnek Yanıt

json
{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.png"
  ],
  "metrics": {
    "predict_time": 8.3
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills, 300'den fazla AI modelini doğrudan AI kodlama asistanınıza entegre eder. Kurmak için tek bir komut, ardından görüntü, video oluşturmak ve LLM ile sohbet etmek için doğal dil kullanın.

Desteklenen İstemciler

Claude Code
OpenAI Codex
Gemini CLI
Cursor
Windsurf
VS Code
Trae
GitHub Copilot
Cline
Roo Code
Amp
Goose
Replit
40+ desteklenen i̇stemciler

Kurulum

bash
npx skills add AtlasCloudAI/atlas-cloud-skills

API Anahtarını Ayarla

API anahtarınızı Atlas Cloud kontrol panelinden alın ve ortam değişkeni olarak ayarlayın.

bash
export ATLASCLOUD_API_KEY="your-api-key-here"

Yetenekler

Kurulduktan sonra, tüm Atlas Cloud modellerine erişmek için AI asistanınızda doğal dil kullanabilirsiniz.

Görüntü OluşturmaNano Banana 2, Z-Image ve daha fazla model ile görüntüler oluşturun.
Video OluşturmaKling, Vidu, Veo vb. ile metin veya görüntülerden videolar oluşturun.
LLM SohbetQwen, DeepSeek ve diğer büyük dil modelleri ile sohbet edin.
Medya YüklemeGörüntü düzenleme ve görüntüden videoya iş akışları için yerel dosyaları yükleyin.

MCP Server

Atlas Cloud MCP Server, IDE'nizi Model Context Protocol aracılığıyla 300'den fazla AI modeline bağlar. Herhangi bir MCP uyumlu istemci ile çalışır.

Desteklenen İstemciler

Cursor
VS Code
Windsurf
Claude Code
OpenAI Codex
Gemini CLI
Cline
Roo Code
100+ desteklenen i̇stemciler

Kurulum

bash
npx -y atlascloud-mcp

Yapılandırma

Aşağıdaki yapılandırmayı IDE'nizin MCP ayarları dosyasına ekleyin.

json
{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

Mevcut Araçlar

atlas_generate_imageMetin istemlerinden görüntüler oluşturun.
atlas_generate_videoMetin veya görüntülerden videolar oluşturun.
atlas_chatBüyük dil modelleri ile sohbet edin.
atlas_list_models300'den fazla mevcut AI modelini keşfedin.
atlas_quick_generateOtomatik model seçimi ile tek adımda içerik oluşturma.
atlas_upload_mediaAPI iş akışları için yerel dosyaları yükleyin.

API Şeması

Şema mevcut değil

İstek geçmişini görüntülemek için oturum açın

Model istek geçmişinize erişmek için oturum açmanız gerekir.

Oturum Aç

Seedance 1.5 Pro

YERLİ SES-GÖRSEL OLUŞTURMA

Ses ve Görüntü, Tek Çekimde Hepsi

ByteDance'in tek birleşik süreçten eş zamanlı olarak mükemmel senkronize ses ve video üreten devrim niteliğindeki yapay zeka modeli. 8'den fazla dilde milisaniye hassasiyetinde dudak senkronizasyonu ile gerçek yerli ses-görsel üretimi deneyimleyin.

Advanced Image Generation
  • Multi-image fusion technology
  • Character consistency across generations
  • Style-preserving transformations
  • High-resolution output up to 4K
Smart Editing Tools
  • Text-based intelligent editing
  • Object addition and removal
  • Background replacement
  • Style transfer and artistic effects

Prompt Examples & Templates

Explore curated prompt templates to unlock the full potential of Nano Banana AI. Click to copy any prompt and start creating immediately.

Photo to Character Figure
Transform to Figure

Photo to Character Figure

Transform any photo into a realistic character figure with packaging and display
Prompt

turn this photo into a character figure. Behind it, place a box with the character's image printed on it, and a computer showing the Blender modeling process on its screen. In front of the box, add a round plastic base with the character figure standing on it. set the scene indoors if possible

Anime to Cosplay
Anime to Real

Anime to Cosplay

Transform anime illustrations into realistic cosplay photography
Prompt

Generate a highly detailed photo of a girl cosplaying this illustration, at Comiket. Exactly replicate the same pose, body posture, hand gestures, facial expression, and camera framing as in the original illustration. Keep the same angle, perspective, and composition, without any deviation

Person to Action Figure
Photo to Action Figure

Person to Action Figure

Transform people from photos into collectible action figures with custom packaging
Prompt

Transform the the person in the photo into an action figure, styled after [CHARACTER_NAME] from [SOURCE / CONTEXT]. Next to the figure, display the accessories including [ITEM_1], [ITEM_2], and [ITEM_3]. On the top of the toy box, write "[BOX_LABEL_TOP]", and underneath it, "[BOX_LABEL_BOTTOM]". Place the box in a [BACKGROUND_SETTING] environment. Visualize this in a highly realistic way with attention to fine details.

Person to Funko Pop Figure
Photo to Funko Pop

Person to Funko Pop Figure

Transform photos into Funko Pop style collectible figures with custom packaging
Prompt

Transform the person in the photo into the style of a Funko Pop figure packaging box, presented in an isometric perspective. Label the packaging with the title 'ZHOGUE'. Inside the box, showcase the figure based on the person in the photo, accompanied by their essential items (such as cosmetics, bags, or others). Next to the box, also display the actual figure itself outside of the packaging, rendered in a realistic and lifelike style.

Product Design to Photorealistic Render
Design to Reality

Product Design to Photorealistic Render

Transform product design sketches into photorealistic renders
Prompt

turn this illustration of a perfume into a realistic version, Frosted glass bottle with a marble cap

Transform to Q-Version Character
Face Reference Control

Transform to Q-Version Character

Create cartoon characters with face shape reference control
Prompt

Transform the person from image 1 into a Q-version character design based on the face shape from image 2

Building to 3D Architecture Model
Architecture to Model

Building to 3D Architecture Model

Convert architectural photos into detailed physical models
Prompt

convert this photo into a architecture model. Behind the model, there should be a cardboard box with an image of the architecture from the photo on it. There should also be a computer, with the content on the computer screen showing the Blender modeling process of the figurine. In front of the cardboard box, place a cardstock and put the architecture model from the photo I provided on it. I hope the PVC material can be clearly presented. It would be even better if the background is indoors.

Technical Highlights

Performance
Lightning-Fast Generation

Optimized for speed with generation times under 2 seconds for most tasks, making it perfect for real-time applications and rapid prototyping workflows.

Quality
Exceptional Output Quality

Leveraging Google's advanced AI architecture to produce highly detailed, photorealistic images with accurate lighting, textures, and compositions.

Innovation
Novel View Synthesis

Revolutionary 2D-to-3D conversion capabilities enabling creation of multiple viewpoints from a single image, opening new possibilities for content creation.

Mükemmel Kullanım Alanları

📸
Product Photography
🎨
Digital Art Creation
Photo Enhancement
📊
Marketing Visuals
👤
Character Design
👔
Virtual Try-On
📱
Social Media
🔄
Photo Restoration

Why Choose Nano Banana?

🚀
No Setup Required
Start creating immediately without complex configurations or installations
🎯
Precision Control
Fine-tune every aspect of your creation with intuitive text commands
🔄
Consistent Results
Maintain character and style consistency across multiple generations

Teknik Özellikler

Model Architecture:Google AI Studio Powered
Processing Speed:< 2 seconds average generation time
Resolution Support:Up to 4096x4096 pixels
Format Support:PNG, JPEG, WebP output formats
Multi-modal Input:Text, Image, and Combined prompts
API Integration:RESTful API with comprehensive documentation

Yerli Ses-Görsel Üretimi Deneyimleyin

Seedance 1.5 Pro'nun çığır açan teknolojisi ile video içerik oluşturmayı devrimleştiren dünya çapındaki film yapımcılarına, reklamverenlere ve içerik üreticilerine katılın.

Free Credits to Start
Instant Access
🌐Works Everywhere

Nano Banana Pro : A state-of-the-art, multimodal reasoning and image generation model by Google DeepMind

Model Card Overview

FieldDescription
Model NameNano Banana Pro (also known as Gemini 3 Pro Image)
DeveloperGoogle DeepMind
Release DateNovember 20, 2025
Model TypeMultimodal Reasoning and Image Generation
Related LinksOfficial Product Page, Model Card (PDF)

Introduction

Nano Banana Pro, officially designated as Gemini 3 Pro Image, represents the next generation in Google's series of highly-capable, natively multimodal models. It is designed for professional asset production, integrating the advanced reasoning capabilities of the Gemini 3 Pro foundation model with a sophisticated image generation engine. The primary goal of Nano Banana Pro is to provide users with studio-quality precision and control, enabling the creation of complex, high-fidelity visuals from textual and image-based prompts. Its core contribution lies in its ability to understand and execute intricate instructions, maintain character and scene consistency, and render legible text directly within generated images, setting a new standard for professional creative workflows.

Key Features & Innovations

Nano Banana Pro introduces several technical breakthroughs that distinguish it from prior models:

  • Superior Text Rendering: The model excels at generating images that contain clear, accurate, and stylistically coherent text, making it ideal for creating posters, diagrams, and marketing materials.
  • Advanced Creative Controls: Users can exercise fine-grained control over image outputs, including camera angles, lighting transformations (e.g., day to night), color grading, depth of field, and localized editing.
  • High-Fidelity Consistency: It can maintain the consistency of up to 14 input images and blend up to 5 distinct characters seamlessly into complex compositions, ensuring visual coherence across a series of generated images.
  • Deep Real-World Knowledge: Built on Gemini 3 Pro, the model leverages a vast understanding of the world to generate contextually rich and factually grounded visuals, from detailed infographics to historically accurate scenes.
  • Multilingual Capabilities: The model can accurately render and translate text across multiple languages within an image, facilitating the localization of visual content.
  • Complex Composition from Multiple Inputs: Nano Banana Pro can synthesize elements from multiple source images and text prompts to create a single, cohesive scene, enabling complex creative concepts.

Model Architecture & Technical Details

Nano Banana Pro's architecture is fundamentally based on the Gemini 3 Pro model. While specific architectural details are not fully disclosed, the following technical information is available:

  • Foundation Model: Gemini 3 Pro
  • Inputs: The model accepts text strings and images as input, with a large context window of up to 1 million tokens.
  • Outputs: It generates high-resolution images (up to 4K) with a 64K token output capacity for handling complex generation tasks.
  • Training Infrastructure:
    • Hardware: The model was trained on Google's custom-designed Tensor Processing Units (TPUs), which are optimized for large-scale machine learning computations and high-bandwidth memory access.
    • Software: The training process utilized JAX and ML Pathways, Google's high-performance frameworks for machine learning research.
  • Knowledge Cutoff: The model's internal knowledge base has a cutoff date of January 2025.

Intended Use & Applications

Nano Banana Pro is intended for professional and creative applications that require a high degree of precision, control, and visual fidelity. It is well-suited for a variety of downstream tasks and application scenarios:

  • Professional Content Creation: Generating production-ready assets for marketing campaigns, advertising, and branding.
  • Design and Prototyping: Creating detailed product mockups, storyboards for film and animation, and architectural visualizations.
  • Informational Graphics: Designing complex and accurate infographics, educational diagrams, and data visualizations.
  • Artistic and Creative Expression: Enabling artists and designers to explore novel visual styles and create complex, multi-element compositions.

Performance

Nano Banana Pro's performance has been evaluated through extensive human evaluations and benchmarked against other leading image generation models. The results, measured in Elo scores, demonstrate its strong capabilities across a wide range of tasks.

A technical report also notes a performance dichotomy: while the model produces subjectively superior visual quality by hallucinating plausible details, it can lag behind specialist models in traditional quantitative metrics due to the stochastic nature of generative models.

Existing Capabilities (Elo Score Comparison)

CapabilityGemini 3 Pro ImageGemini 2.5 Flash ImageGPT-Image 1Seedream v4 4kFlux Pro Kontext Max
Text Rendering1198 ± 18997 ± 101150 ± 141019 ± 13854 ± 13
Stylization1098 ± 11933 ± 71069 ± 9991 ± 9908 ± 11
Multi-Turn1186 ± 191045 ± 241079 ± 32990 ± 32889 ± 37
General Image Editing1127 ± 13996 ± 81011 ± 13965 ± 12902 ± 13
Character Editing1176 ± 161075 ± 81016 ± 10889 ± 10843 ± 10
Object/Env. Editing1102 ± 191025 ± 9930 ± 12983 ± 13961 ± 10
General Text-to-Image1094 ± 161037 ± 81025 ± 91011 ± 9907 ± 9

New Capabilities (Elo Score Comparison)

CapabilityGemini 3 Pro ImageGemini 2.5 Flash ImageGPT-Image 1Seedream v4 4kFlux Pro Kontext Max
Multi-character Editing1213 ± 16950 ± 10997 ± 13840 ± 19-
Chart Editing1209 ± 18971 ± 10994 ± 16934 ± 16893 ± 15
Text Editing1202 ± 231001 ± 10996 ± 14860 ± 15943 ± 12
Factuality - Edu1169 ± 251050 ± 111084 ± 25969 ± 22884 ± 26
Infographics1268 ± 171162 ± 111087 ± 121049 ± 12824 ± 15
Visual Design1104 ± 161083 ± 71028 ± 111038 ± 12907 ± 11

300+ Model ile Başlayın,

Tüm modelleri keşfet

Join our Discord community

Join the Discord community for the latest model updates, prompts, and support.