google/nano-banana-pro/text-to-image

teks-ke-gambar

PRO

Nano Banana Pro Text-to-Image API by Google

google/nano-banana-pro/text-to-image

Text-to-image

Nano Banana Pro is the next-generation Nano Banana image model, delivering sharper detail, richer color control, and faster diffusion for production-ready visuals.

INPUT

Memuat konfigurasi parameter...

OUTPUT

Menunggu

Gambar yang dihasilkan akan muncul di sini

Konfigurasikan pengaturan Anda dan klik Jalankan untuk memulai

Permintaan Anda akan dikenakan biaya $0.14 per eksekusi. Dengan $10 Anda dapat menjalankan model ini sekitar 71 kali.

Berikut yang dapat Anda lakukan selanjutnya:

Gambar ke Video Gambar ke Gambar

Parameter

Contoh kode
import requests
import time

# Step 1: Start image generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "google/nano-banana-pro/text-to-image",
    "prompt": "A beautiful landscape with mountains and lake",
    "width": 512,
    "height": 512,
    "steps": 20,
    "guidance_scale": 7.5,
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] == "completed":
            print("Generated image:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

image_url = check_status()

Instalasi

Instal paket dependensi yang diperlukan.

pip install requests

Autentikasi

Semua permintaan API memerlukan autentikasi melalui API key. Anda bisa mendapatkan API key dari dasbor Atlas Cloud.

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP Headers

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

Jaga keamanan API key Anda

Jangan pernah mengekspos API key Anda di kode sisi klien atau repositori publik. Gunakan variabel lingkungan atau proxy backend sebagai gantinya.

Kirim permintaan

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

Kirim Permintaan

Kirim permintaan pembuatan asinkron. API mengembalikan prediction ID yang dapat Anda gunakan untuk memeriksa status dan mengambil hasil.

POST/api/v1/model/generateImage

Isi Permintaan

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "google/nano-banana-pro/text-to-image",
    "prompt": "A beautiful landscape with mountains and lake"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

Respons

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

Periksa Status

Polling prediction endpoint untuk memeriksa status permintaan Anda saat ini.

GET/api/v1/model/prediction/{prediction_id}

Contoh Polling

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

Nilai Status

processingPermintaan masih diproses.

completedPembuatan selesai. Output tersedia.

succeededPembuatan berhasil. Output tersedia.

failedPembuatan gagal. Periksa field error.

Respons Selesai

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.png"
    ],
    "metrics": {
      "predict_time": 8.3
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

Unggah File

Unggah file ke penyimpanan Atlas Cloud dan dapatkan URL yang dapat Anda gunakan dalam permintaan API Anda. Gunakan multipart/form-data untuk mengunggah.

POST/api/v1/model/uploadMedia

Contoh Unggah

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

Respons

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

Input Schema

Parameter berikut diterima di isi permintaan.

Total: 0Wajib: 0Opsional: 0

Tidak ada parameter yang tersedia.

Contoh Isi Permintaan

{
  "model": "google/nano-banana-pro/text-to-image"
}

Output Schema

API mengembalikan respons prediction dengan URL output yang dihasilkan.

idstringrequired

Unique identifier for the prediction.

statusstringrequired

Current status of the prediction.

processingcompletedsucceededfailed

modelstringrequired

The model used for generation.

outputsarray[string]

Array of output URLs. Available when status is "completed".

errorstring

Error message if status is "failed".

metricsobject

Performance metrics.

predict_timenumber

Time taken for image generation in seconds.

created_atstringrequired

ISO 8601 timestamp when the prediction was created.

Format: date-time

completed_atstring

ISO 8601 timestamp when the prediction was completed.

Format: date-time

Contoh Respons

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.png"
  ],
  "metrics": {
    "predict_time": 8.3
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills mengintegrasikan 300+ model AI langsung ke asisten pengkodean AI Anda. Satu perintah untuk menginstal, lalu gunakan bahasa alami untuk menghasilkan gambar, video, dan mengobrol dengan LLM.

Klien yang Didukung

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ klien yang didukung

Instalasi

npx skills add AtlasCloudAI/atlas-cloud-skills

Atur API Key

Dapatkan API key dari dasbor Atlas Cloud dan atur sebagai variabel lingkungan.

export ATLASCLOUD_API_KEY="your-api-key-here"

Kemampuan

Setelah diinstal, Anda dapat menggunakan bahasa alami di asisten AI Anda untuk mengakses semua model Atlas Cloud.

Pembuatan GambarBuat gambar dengan model seperti Nano Banana 2, Z-Image, dan lainnya.

Pembuatan VideoBuat video dari teks atau gambar dengan Kling, Vidu, Veo, dll.

Obrolan LLMMengobrol dengan Qwen, DeepSeek, dan model bahasa besar lainnya.

Unggah MediaUnggah file lokal untuk pengeditan gambar dan alur kerja gambar-ke-video.

Pelajari lebih lanjut

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

Atlas Cloud MCP Server menghubungkan IDE Anda dengan 300+ model AI melalui Model Context Protocol. Berfungsi dengan klien apa pun yang kompatibel dengan MCP.

Klien yang Didukung

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ klien yang didukung

Instalasi

npx -y atlascloud-mcp

Konfigurasi

Tambahkan konfigurasi berikut ke file pengaturan MCP di IDE Anda.

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

Alat yang Tersedia

atlas_generate_imageBuat gambar dari prompt teks.

atlas_generate_videoBuat video dari teks atau gambar.

atlas_chatMengobrol dengan model bahasa besar.

atlas_list_modelsJelajahi 300+ model AI yang tersedia.

atlas_quick_generatePembuatan konten satu langkah dengan pemilihan model terbaik otomatis.

atlas_upload_mediaUnggah file lokal untuk alur kerja API.

Pelajari lebih lanjut

github.com/AtlasCloudAI/mcp-server

Schema API

Schema tidak tersedia

Tidak ada contoh yang tersedia

Memuat...

Pembuatan Gambar Tingkat Lanjut

Teknologi penggabungan multi-gambar
Konsistensi karakter lintas generasi
Transformasi dengan mempertahankan gaya
Output resolusi tinggi hingga 4K

Alat Pengeditan Cerdas

Pengeditan cerdas berbasis teks
Penambahan dan penghapusan objek
Penggantian latar belakang
Transfer gaya dan efek artistik

Transform to Figure

Photo to Character Figure

Transform any photo into a realistic character figure with packaging and display

Prompt

turn this photo into a character figure. Behind it, place a box with the character's image printed on it, and a computer showing the Blender modeling process on its screen. In front of the box, add a round plastic base with the character figure standing on it. set the scene indoors if possible

Anime to Real

Anime to Cosplay

Transform anime illustrations into realistic cosplay photography

Prompt

Generate a highly detailed photo of a girl cosplaying this illustration, at Comiket. Exactly replicate the same pose, body posture, hand gestures, facial expression, and camera framing as in the original illustration. Keep the same angle, perspective, and composition, without any deviation

Photo to Action Figure

Person to Action Figure

Transform people from photos into collectible action figures with custom packaging

Prompt

Transform the the person in the photo into an action figure, styled after [CHARACTER_NAME] from [SOURCE / CONTEXT]. Next to the figure, display the accessories including [ITEM_1], [ITEM_2], and [ITEM_3]. On the top of the toy box, write "[BOX_LABEL_TOP]", and underneath it, "[BOX_LABEL_BOTTOM]". Place the box in a [BACKGROUND_SETTING] environment. Visualize this in a highly realistic way with attention to fine details.

Photo to Funko Pop

Person to Funko Pop Figure

Transform photos into Funko Pop style collectible figures with custom packaging

Prompt

Transform the person in the photo into the style of a Funko Pop figure packaging box, presented in an isometric perspective. Label the packaging with the title 'ZHOGUE'. Inside the box, showcase the figure based on the person in the photo, accompanied by their essential items (such as cosmetics, bags, or others). Next to the box, also display the actual figure itself outside of the packaging, rendered in a realistic and lifelike style.

Design to Reality

Product Design to Photorealistic Render

Transform product design sketches into photorealistic renders

Prompt

turn this illustration of a perfume into a realistic version, Frosted glass bottle with a marble cap

Face Reference Control

Transform to Q-Version Character

Create cartoon characters with face shape reference control

Prompt

Transform the person from image 1 into a Q-version character design based on the face shape from image 2

Architecture to Model

Building to 3D Architecture Model

Convert architectural photos into detailed physical models

Prompt

convert this photo into a architecture model. Behind the model, there should be a cardboard box with an image of the architecture from the photo on it. There should also be a computer, with the content on the computer screen showing the Blender modeling process of the figurine. In front of the cardboard box, place a cardstock and put the architecture model from the photo I provided on it. I hope the PVC material can be clearly presented. It would be even better if the background is indoors.

Keunggulan Teknis

Performa

Generasi Super Cepat

Dioptimalkan untuk kecepatan dengan waktu pembuatan di bawah 2 detik untuk sebagian besar tugas, menjadikannya sempurna untuk aplikasi real-time dan alur kerja prototipe cepat.

Kualitas

Kualitas Output yang Luar Biasa

Memanfaatkan arsitektur AI canggih Google untuk menghasilkan gambar yang sangat detail dan fotorealistis dengan pencahayaan, tekstur, dan komposisi yang akurat.

Inovasi

Sintesis Tampilan Baru

Kemampuan konversi 2D ke 3D yang revolusioner, memungkinkan pembuatan berbagai sudut pandang dari satu gambar, membuka peluang baru dalam pembuatan konten.

Kasus Penggunaan

📸

Fotografi Produk

🎨

Kreasi Seni Digital

✨

Peningkatan Foto

📊

Visual Pemasaran

👤

Desain Karakter

👔

Virtual Try-On

📱

Media Sosial

🔄

Restorasi Foto

Mengapa Memilih Nano Banana?

🚀

Tanpa Pengaturan

Mulai berkreasi langsung tanpa konfigurasi atau instalasi yang rumit

🎯

Kontrol Presisi

Sesuaikan setiap aspek kreasi Anda dengan perintah teks yang intuitif

🔄

Hasil yang Konsisten

Pertahankan konsistensi karakter dan gaya di berbagai generasi

Spesifikasi Teknis

Arsitektur Model:Didukung oleh Google AI Studio

Kecepatan Pemrosesan:< 2 detik rata-rata waktu pembuatan

Dukungan Resolusi:Hingga 4096x4096 piksel

Dukungan Format:Format output PNG, JPEG, WebP

Input Multi-modal:Prompt teks, gambar, dan kombinasi

Integrasi API:RESTful API dengan dokumentasi lengkap

Rasakan Kekuatan AI Nano Banana

Bergabunglah dengan ribuan kreator dan bisnis yang telah mengubah konten visual mereka dengan teknologi AI gambar tercanggih dari Google.

✨Kredit Gratis untuk Memulai

⚡Akses Instan

🌐Bekerja di Mana Saja

Nano Banana Pro : A state-of-the-art, multimodal reasoning and image generation model by Google DeepMind

Model Card Overview

Field	Description
Model Name	Nano Banana Pro (also known as Gemini 3 Pro Image)
Developer	Google DeepMind
Release Date	November 20, 2025
Model Type	Multimodal Reasoning and Image Generation
Related Links	Official Product Page, Model Card (PDF)

Introduction

Nano Banana Pro, officially designated as Gemini 3 Pro Image, represents the next generation in Google's series of highly-capable, natively multimodal models. It is designed for professional asset production, integrating the advanced reasoning capabilities of the Gemini 3 Pro foundation model with a sophisticated image generation engine. The primary goal of Nano Banana Pro is to provide users with studio-quality precision and control, enabling the creation of complex, high-fidelity visuals from textual and image-based prompts. Its core contribution lies in its ability to understand and execute intricate instructions, maintain character and scene consistency, and render legible text directly within generated images, setting a new standard for professional creative workflows.

Key Features & Innovations

Nano Banana Pro introduces several technical breakthroughs that distinguish it from prior models:

Superior Text Rendering: The model excels at generating images that contain clear, accurate, and stylistically coherent text, making it ideal for creating posters, diagrams, and marketing materials.
Advanced Creative Controls: Users can exercise fine-grained control over image outputs, including camera angles, lighting transformations (e.g., day to night), color grading, depth of field, and localized editing.
High-Fidelity Consistency: It can maintain the consistency of up to 14 input images and blend up to 5 distinct characters seamlessly into complex compositions, ensuring visual coherence across a series of generated images.
Deep Real-World Knowledge: Built on Gemini 3 Pro, the model leverages a vast understanding of the world to generate contextually rich and factually grounded visuals, from detailed infographics to historically accurate scenes.
Multilingual Capabilities: The model can accurately render and translate text across multiple languages within an image, facilitating the localization of visual content.
Complex Composition from Multiple Inputs: Nano Banana Pro can synthesize elements from multiple source images and text prompts to create a single, cohesive scene, enabling complex creative concepts.

Model Architecture & Technical Details

Nano Banana Pro's architecture is fundamentally based on the Gemini 3 Pro model. While specific architectural details are not fully disclosed, the following technical information is available:

Foundation Model: Gemini 3 Pro
Inputs: The model accepts text strings and images as input, with a large context window of up to 1 million tokens.
Outputs: It generates high-resolution images (up to 4K) with a 64K token output capacity for handling complex generation tasks.
Training Infrastructure:
- Hardware: The model was trained on Google's custom-designed Tensor Processing Units (TPUs), which are optimized for large-scale machine learning computations and high-bandwidth memory access.
- Software: The training process utilized JAX and ML Pathways, Google's high-performance frameworks for machine learning research.
Knowledge Cutoff: The model's internal knowledge base has a cutoff date of January 2025.

Intended Use & Applications

Nano Banana Pro is intended for professional and creative applications that require a high degree of precision, control, and visual fidelity. It is well-suited for a variety of downstream tasks and application scenarios:

Professional Content Creation: Generating production-ready assets for marketing campaigns, advertising, and branding.
Design and Prototyping: Creating detailed product mockups, storyboards for film and animation, and architectural visualizations.
Informational Graphics: Designing complex and accurate infographics, educational diagrams, and data visualizations.
Artistic and Creative Expression: Enabling artists and designers to explore novel visual styles and create complex, multi-element compositions.

Performance

Nano Banana Pro's performance has been evaluated through extensive human evaluations and benchmarked against other leading image generation models. The results, measured in Elo scores, demonstrate its strong capabilities across a wide range of tasks.

A technical report also notes a performance dichotomy: while the model produces subjectively superior visual quality by hallucinating plausible details, it can lag behind specialist models in traditional quantitative metrics due to the stochastic nature of generative models.

Existing Capabilities (Elo Score Comparison)

Capability	Gemini 3 Pro Image	Gemini 2.5 Flash Image	GPT-Image 1	Seedream v4 4k	Flux Pro Kontext Max
Text Rendering	1198 ± 18	997 ± 10	1150 ± 14	1019 ± 13	854 ± 13
Stylization	1098 ± 11	933 ± 7	1069 ± 9	991 ± 9	908 ± 11
Multi-Turn	1186 ± 19	1045 ± 24	1079 ± 32	990 ± 32	889 ± 37
General Image Editing	1127 ± 13	996 ± 8	1011 ± 13	965 ± 12	902 ± 13
Character Editing	1176 ± 16	1075 ± 8	1016 ± 10	889 ± 10	843 ± 10
Object/Env. Editing	1102 ± 19	1025 ± 9	930 ± 12	983 ± 13	961 ± 10
General Text-to-Image	1094 ± 16	1037 ± 8	1025 ± 9	1011 ± 9	907 ± 9

New Capabilities (Elo Score Comparison)

Capability	Gemini 3 Pro Image	Gemini 2.5 Flash Image	GPT-Image 1	Seedream v4 4k	Flux Pro Kontext Max
Multi-character Editing	1213 ± 16	950 ± 10	997 ± 13	840 ± 19	-
Chart Editing	1209 ± 18	971 ± 10	994 ± 16	934 ± 16	893 ± 15
Text Editing	1202 ± 23	1001 ± 10	996 ± 14	860 ± 15	943 ± 12
Factuality - Edu	1169 ± 25	1050 ± 11	1084 ± 25	969 ± 22	884 ± 26
Infographics	1268 ± 17	1162 ± 11	1087 ± 12	1049 ± 12	824 ± 15
Visual Design	1104 ± 16	1083 ± 7	1028 ± 11	1038 ± 12	907 ± 11

Jelajahi Model Serupa

NEW

gambar-ke-gambar

Nano Banana 2 Reference to Image

Google's advanced AI-powered video-to-image generation model, designed to generate high-quality static images from video clips combined with text instructions.

Nano Banana 2 Reference to Image Developer

Google's advanced AI-powered video-to-image generation model, designed to generate high-quality static images from video clips combined with text instructions.

Nano Banana 2 Text-to-Image Developer

Google's lightweight yet powerful AI image generation model, built for creators who need fast, high-quality visuals from simple text prompts.

Nano Banana 2 Text-to-Image

Google's lightweight yet powerful AI image generation model, built for creators who need fast, high-quality visuals from simple text prompts.

Nano Banana 2 Edit Developer

Google's advanced AI-powered image editing and generation model, designed to make visual transformation as intuitive as describing it in words.

Nano Banana 2 Edit

Google's advanced AI-powered image editing and generation model, designed to make visual transformation as intuitive as describing it in words.

Nano Banana Pro Text-to-image Ultra

Nano Banana Pro is the next-generation Nano Banana image model, delivering sharper detail, richer color control, and faster diffusion for production-ready visuals.

From

$0.15/GAMBAR