GPT Image API with 3 Model Tiers

GPT Image API 為開發者提供了 OpenAI 的圖像生成系列產品，包含 GPT Image 1、1.5 和 Mini 三個層級，每個層級均提供文本生成圖像和圖像編輯變體。這些模型在多種風格下均能提供準確的圖像內文本、照片級的逼真渲染以及高度的提示詞遵循能力。在 Atlas Cloud 上，您可以透過一個統一的 API 存取所有層級以及其他 300 多種模型，每張圖像低至 0.004 美元，並擁有 99.99% 的正常運行時間保證。

探索領先模型

Atlas Cloud 為您提供最新的行業領先創意模型。

NEW

文生圖

Openai GPT Image 2 Text-to-Image

GPT Image 2 text to image is OpenAI's fast, cost-efficient text-to-image generator powered by GPT-5 guidance. Create photorealistic shots, product renders, concept art, and stylized graphics from natural-language prompts (optionally conditioned with an image). Supports custom aspect ratios, seeds, negative prompts, hex color hints, and style presets. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image 2 Edit

GPT Image 2 Edit is OpenAI's image model for precise, natural-language edits. Add/remove objects, swap backgrounds, retouch faces, adjust colors/lighting, edit text/graphics, crop/resize, and apply hex color control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image-1.5 Text-to-image

GPT Image 1.5 text to image is OpenAI’s fast, cost-efficient text-to-image generator powered by GPT-5 guidance. Create photorealistic shots, product renders, concept art, and stylized graphics from natural-language prompts (optionally conditioned with an image). Supports custom aspect ratios, seeds, negative prompts, hex color hints, and style presets. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image-1.5 Edit

GPT Image 1.5 Edit is OpenAI’s image model for precise, natural-language edits. Add/remove objects, swap backgrounds, retouch faces, adjust colors/lighting, edit text/graphics, crop/resize, and apply hex color control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image-1 Text-to-image

OpenAI GPT Image-1 generates images from text prompts from OpenAI's latest text-to-image model, ideal for creating visual assets. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image-1 Edit

OpenAI's gpt-image-1 enables image generation and image editing via OpenAI's image API, ideal for creating and refining images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image-1 Mini Text-to-image

GPT Image 1 Mini is a cost-efficient multimodal OpenAI model powered by GPT-5 that turns text or image prompts into high-quality images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image-1 Mini Edit

GPT Image 1 Mini is a cost-efficient, natively multimodal OpenAI model that pairs GPT-5 language understanding with compact image editing and generation from text and image inputs to produce high-quality images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

GPT Image 2 Developer Edit

GPT Image 2 Developer Edit applies natural-language instructions to one or more reference images, with common aspect ratios and 1k, 2k, or supported 4k output tiers. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

GPT Image 2 Developer Text-to-Image

GPT Image 2 Developer Text-to-Image generates polished visuals from natural-language prompts, with common aspect ratios and 1k, 2k, or supported 4k output tiers. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

From$0.009/張

$0.004/張

-50%

峰值速度

最低成本

模態	描述
GPT Image-1 T2I API(Text to Image)	GPT Image-1 文字生成圖像 API 賦能開發者將文字提示轉化為細節豐富、令人驚嘆的逼真視覺效果。透過將 GPT-4 Turbo 的推理能力與 DALL·E 等級的視覺合成技術相結合，它為專業級圖像製作提供了業界領先的提示詞遵循度與複雜構圖能力。
GPT Image-1 Edit API(Image to Image)	GPT Image-1 Edit API 賦能開發者，以無縫的一致性將現有圖像轉化為經過精細調整或重新構想的傑作。透過利用多模態理解能力，它能夠生成精確的風格轉換、情境構圖以及針對性的修改，以實現專業級的資產迭代。
GPT Image-1.5 T2I API(Text to Image)	GPT Image-1.5 Text to Image API 賦能開發者以最佳化的成本將文字提示詞轉化為高品質的視覺效果。透過運用基於 GPT 的架構，它提供了強大的提示詞理解能力和視覺保真度，以實現平衡的生產工作流程。
GPT Image-1.5 Edit API(Image to Image)	GPT Image-1.5 Edit API 使開發者能夠透過精確的修改來完善現有資產。透過支援 input_fidelity 控制，它能夠進行微調，同時保留臉部和標誌等關鍵元素。
GPT Image-1 Mini T2I API(Text to Image)	GPT Image-1 Mini Text to Image API 為開發者提供了該系列中性價比最高的圖像生成能力。借助 GPT-5 架構，它以最低的單張圖像成本提供專業級的生成結果，是高頻大批量內容生產的理想選擇。
GPT Image-1 Mini Edit API(Image to Image)	GPT Image-1 Mini Edit API 賦能開發者，透過精簡的編輯功能改造現有圖像。它以極低的成本提供必要的編輯功能，進而實現快速迭代和內容生產工作流程。

GPT Image API 的主要特性

探索 GPT Image API 提供的功能，從靈活的風格、相片級的逼真度、準確的影像內文字到基於遮罩的編輯、背景控制與品質分級。

使用 GPT Image API 進行靈活的風格生成

生成涵蓋逼真攝影、風格化藝術作品、概念藝術、資訊圖表、3D風格插畫等多領域的豐富視覺輸出。從電影級景觀到 UI 介面原型，該模型均能精準契合您的創作方向。

使用 GPT Image API 實現高視覺保真度

Maintains object relationships, lighting consistency, and color balance with industry-leading prompt adherence. Generated images exhibit natural textures, accurate proportions, and physically plausible compositions.

使用 GPT Image API 實現精準的文字渲染

能夠在圖像中生成清晰、易讀的排版——是海報、迷因、漫畫、品牌視覺設計以及任何需要整合文本元素的專案的理想選擇。

使用 GPT Image API 的基於知識的創造力

藉助 GPT-4/GPT-5 的世界知識，生成事實上準確且符合語境的視覺內容。該模型理解文化背景、歷史語境以及特定領域的概念。

使用 GPT Image API 進行基於遮罩的編輯

使用可選的遮罩輸入編輯特定區域，僅修改選中部分，同時保持影像其餘部分原樣不變。這使得 GPT Image API 在修圖、物件移除和精確的構圖調整方面非常可靠。

背景與透明度控制

在支援的模型上自訂背景並產生透明輸出，是標誌、產品展示圖和分層設計工作的理想選擇。您可以將主體放置在新的場景中，或匯出乾淨的去背圖，而無需手動建立遮罩。

Quality Tier Control

在每次請求時選擇低、中或高品質，以平衡您工作負載的細節與成本。較低層級可加速大批量草稿的生成，而高品質層級則能為最終資產提供最具照片級真實感的結果。

Comparisons with One Prompt

提示詞

Surrealist fashion campaign poster, quadrant layout (2x2 grid of 4 variations), extreme macro photography of a human eye filling the entire frame as background — iris colors vary across panels: blue-green teal, golden hazel, natural brown — hyperrealistic eye texture with visible pores on eyelid skin, dramatic long eyelashes in black with some purple/violet colored lash extensions spiking outward in an editorial exaggerated style, miniaturized female model composited realistically into the eye environment, appearing to sit casually on the lower eyelid or eyelash roots, model wearing streetwear/casual fashion outfits — variations include: oversized grey graphic sweatshirt + black plaid wide-leg pants + black chunky platform boots, grey long-sleeve polo shirt + sage green cargo pants + tan Timberland boots + camo backpack, bold typographic brand logo "LKNLN" stamped/tattooed directly onto the eyelid skin in dark gothic/industrial bold sans-serif font, appearing as if embossed or inked into skin, lighting: dramatic studio lighting on the eye, soft fill on model, depth of field contrast between hyper-sharp iris and soft skin surroundings, color palette: skin tones, teal/hazel iris, muted sage green, plaid grey-black, amber boots, purple accent lashes, photorealistic composite, editorial fashion photography style, small watermark "AI dsgn" in bottom left corner, ultra high resolution, cinematic color grading

GPT Image 1

GPT Image 1.5

GPT Image 2

GPT Image API Use Cases for Image Generation

探索您可以使用 GPT Image API 建構的內容，從專業攝影和 UI 原型到行銷活動、概念藝術、風格轉換以及內容在地化。

Professional Photography & Visual Art

Generate photorealistic images with cinematic lighting, precise composition, and natural textures. From product photography to editorial visuals, GPT Image models produce outputs indistinguishable from professional camera work.

UI/UX Design & Mockups

Create clean, modern design concepts including app interfaces, dashboards, websites, and product layouts. The models excel at generating structured compositions with professional aesthetics.

Marketing & Advertising Campaigns

Rapidly produce campaign-ready visuals for social media, digital ads, and brand marketing. Support for multiple quality tiers enables both rapid A/B testing and high-end final deliverables.

Creative Concept Art & Illustration

Explore styles, moodboards, and concept art at speed. Generate illustrations in diverse artistic styles — from watercolor paintings to anime, comic books to oil paintings.

Style Transfer & Artistic Transformation

Transform existing images into different artistic styles while preserving core subject matter. Convert photos to cartoons, paintings, sketches, or any aesthetic direction with natural language instructions.

Content Localization & Adaptation

Quickly adapt visual content for different markets, audiences, or platforms. Modify backgrounds, adjust colors, update styling, or re-contextualize imagery through simple text descriptions.

模型對比

查看不同廠商的模型表現 — 對比效能、價格和獨特優勢，做出明智決策。

Model	Reference Image Limit	Output Num	Resolution	Aspect Ratio
GPT Image-1	4	1~10	1024×1024, 1024×1536, 1536×1024	1:1, 3:2, 2:3
GPT Image-1.5	10	1	1024×1024, 1024×1536, 1536×1024	1:1, 3:2, 2:3
GPT Image-1 Mini	4	1~10	1024×1024, 1024×1536, 1536×1024	1:1, 3:2, 2:3
Nano Banana 2	14	1	4K, 2K, 1K	1:1 3:2 2:3 3:4 4:3 4:5 5:4 9:16 16:9 21:9
Seedream 5.0	14	1~15	2K~4K+	1:1 3:2 2:3 3:4 4:3 4:5 5:4 9:16 16:9 21:9

如何在 Atlas Cloud 上使用 GPT Image

幾分鐘即可上手 — 按照以下簡單步驟，透過 Atlas Cloud 平台整合和部署模型。

建立 Atlas Cloud 帳戶

在 atlascloud.ai 註冊並完成驗證。新用戶可獲得免費額度，用於探索平台和測試模型。

為何在 Atlas Cloud 使用 GPT Image

將先進的 GPT Image 模型與 Atlas Cloud 的 GPU 加速平台相結合，提供無與倫比的效能、可擴展性和開發體驗。

效能與靈活性

低延遲：
GPU 最佳化推理，實現即時回應。

統一 API：
一次整合，暢用 GPT Image、GPT、Gemini 和 DeepSeek。

透明定價：
按 Token 計費，支援 Serverless 模式。

企業與規模

開發者體驗：
SDK、資料分析、微調工具和模板一應俱全。

可靠性：
99.99% 可用性、RBAC 權限控制、合規日誌。

安全與合規：
SOC 2 Type II 認證、HIPAA 合規、美國資料主權。

GPT Image API FAQ

The GPT Image API offers three tiers. GPT Image-1 is the flagship for the highest quality, GPT Image-1.5 balances strong quality with lower cost, and GPT Image-1 Mini is the most cost-efficient for high-volume work. Each tier is available in both text to image and image to image variants.

Each model supports Low, Medium, and High quality settings. Higher quality produces more detailed and photorealistic results but at higher cost. For initial testing and previews, use Low quality for speed and savings. Switch to High quality for final deliverables requiring maximum fidelity.

Text-to-Image models support three output sizes: 1024×1024 (square), 1024×1536 (portrait), and 1536×1024 (landscape). Choose based on your use case — portrait for characters and vertical art, landscape for cinematic scenes and wide compositions, square for general purpose content.

Yes. The GPT Image API edit models accept an optional mask input, so you can control exactly which regions of an image are modified while the rest stays untouched. This supports precise inpainting for retouching, object removal, and localized changes.

The GPT Image API gives developers programmatic access to OpenAI's GPT Image family, a suite of multimodal image generation and editing models. It generates and edits images from text and image inputs, with accurate in-image text, photorealistic rendering, and strong prompt adherence. On Atlas Cloud you reach all three tiers through one unified API alongside 300+ models.

On Atlas Cloud the GPT Image API uses flat per-image pricing, starting at $0.004 per image on GPT Image-1 Mini, $0.008 on GPT Image-1.5, and $0.009 on GPT Image-1. Pricing is transparent with no token math, so you can predict the cost per generation before you run it.

No. OpenAI gates the GPT Image models behind organization verification in its own developer console, which can block individual developers. With the GPT Image API on Atlas Cloud you only need an Atlas Cloud account, so you can get a key and start generating without OpenAI verification.

Yes. Images you generate through the GPT Image API come with full commercial usage rights, and you retain ownership of the content you create. This makes it suitable for client work, marketing campaigns, and products you ship.

Yes. Atlas Cloud exposes an OpenAI-compatible API, so you can point the OpenAI SDK at the Atlas Cloud base URL, add your Atlas key, and call the GPT Image API with your existing code. You can make your first request in minutes without rebuilding your integration.

The GPT Image API gives you programmatic control that the chat experience does not, including quality settings, output size and format, mask-based editing, and batch generation. It is built for integrating image generation into your own apps and pipelines, rather than one-off creation in a chat window.

探索更多系列

Seedance 2.0

Seedance 2.0 API 為您提供 ByteDance 多模態影片模型的生產級存取權限——支援四模態輸入（文字、影像、影片、音訊），以及業界領先的「Universal Reference」（通用參考）系統，可在不同鏡頭間鎖定構圖、運鏡與角色動作。只需一次 API 呼叫即可整合導演級控制，固定費率為 $0.09/秒，即時取得金鑰，無需排隊——由企業級正常運行時間與合規性提供保障。Seedance 2.0 原生 4K 現已上線！

檢視系列

Grok Imagine

Grok Imagine API 為開發者提供 xAI 的圖像、影片和音訊生成一站式套件。它可以生成解析度高達 2K 且支援多語言文本渲染的圖像，以及長達 15 秒且帶有原生同步音訊和基於參考圖像編輯功能的影片。在 Atlas Cloud 上，只需一個金鑰即可執行每個 Grok Imagine 模式，因此您可以在圖像、影片和音訊之間無縫切換，無需單獨設定，每張圖像 0.02 美元起，每秒 0.05 美元起。

檢視系列

Gemini Omni Flash

Gemini Omni API 將 Google DeepMind 於 Google I/O 2026 發表的多模態影片生成與編輯模型帶進你的技術棧。Gemini Omni 將 Gemini 的推理引擎與生成式媒體融合，可接受文字、圖片、影片與音訊的任意組合輸入，產生一致且以知識為根據的輸出。透過自然對話持續打磨成果：替換物件、改寫場景、切換風格，同時維持物理規律、角色與畫面連貫性不變。Atlas Cloud 透過單一整合 API 提供完整的 Gemini Omni Flash 系列——文字生成影片、支援最多 7 張參考圖片的圖片生成影片，以及參考圖生成影片——採每秒計費、價格透明，$0.112 起，無需訂閱。立即開始打造。

檢視系列

GPT Image 2

GPT Image 2 API 為開發者提供了訪問 OpenAI 最新圖像模型的途徑，它是 GPT Image 1.5 的繼任者。該模型可生成和編輯圖像，能夠在拉丁和 CJK 文字上實現準確的文本渲染，並在海報、樣機和資訊圖表方面具備強大的排版能力。在 Atlas Cloud 上，您可以透過一個統一的 API 與 300 多個模型一起訪問它，並享受免費額度、99.99% 的正常運行時間，且無需 OpenAI 組織驗證。

檢視系列

Google

Google最強大的創意模型現已在Atlas Cloud上全面可用。Veo 3.1提供電影等級的影片生成，Nano Banana 2支援高保真圖像建立，而Gemini為每個工作流程帶來多模態智慧。透過單一API key即可存取完整的Google模型套件，提供Day-0可用性和隨用隨付（pay-as-you-go）定價。

檢視系列

Seedance 2.0 Mini

Seedance 2.0 Mini 將 ByteDance 的多模態影片生成技術引入到對速度和成本要求極高的工作流程中。它以更輕量的佔用空間提供 Seedance 2.0 的核心能力——更快的生成速度、更低的單支影片成本，並且使用您現有的同款 API 整合。對於運行高吞吐量流水線或進行大規模原型設計的團隊來說，Mini 是最實用的預設選擇。

檢視系列

ByteDance

從電影級影片生成到高保真影像建立，ByteDance 最強大的模型現已在 Atlas Cloud 上線。以最低的推論定價和零基礎設施開銷，大規模執行 Seedance 和 Seedream。

檢視系列

Alibaba

Atlas Cloud 將 Alibaba 的全系模型陣容整合至同一個 API 中：Qwen 適用於語言和圖像任務，Wan 適用於高達 1080p 的影片生成。所有模型均採用按需付費模式，無需訂閱。您可以使用現有的 OpenAI 兼容客戶端，透過單一的 base URL 存取 Alibaba API。

檢視系列

OpenAI

Atlas Cloud 為您提供存取完整 OpenAI API 產品線的權限，從用於圖像生成的 GPT Image 2 到用於影片的 Sora 2。每個模型均採用按需付費模式，無月度消費限制。使用相容 OpenAI 的 API，只需簡單替換基礎 URL 即可輕鬆接入。

檢視系列

xAI

在 Atlas Cloud 上使用 xAI API 建構完整的影像與影片處理管線。以 2K 解析度生成、使用參考影像進行編輯，並將影像動畫化為音訊同步的影片片段。

檢視系列

Kwaivgi

Kwaivgi API 價格低於標準定價 15%。Atlas Cloud 提供對最新 Kling 版本的零日（Day-0）存取權限，採用按需付費定價且無席位限制。一個帳戶，一個金鑰，暢享從標準版到大師版的所有 Kling 模型。

檢視系列

Seedream 5.0 Pro

Seedream 5.0 Pro API 為開發者在 Atlas Cloud 上提供了字節跳動的可控圖像編輯模型。它透過錨點和座標精確定位編輯，將圖像分離為可編輯圖層，融合多個參考，並精準匹配顏色和材質，支援 2K 和 3K 解析度的多語言文本。在 Atlas Cloud 上，您只需一個金鑰即可存取！

檢視系列

一個 API，暢享全模態 AI。

探索全部模型

GPT Image API with 3 Model Tiers

探索領先模型

Openai GPT Image 2 Text-to-Image

Openai GPT Image 2 Edit

Openai GPT Image-1.5 Text-to-image

Openai GPT Image-1.5 Edit

Openai GPT Image-1 Text-to-image

Openai GPT Image-1 Edit

Openai GPT Image-1 Mini Text-to-image

Openai GPT Image-1 Mini Edit

GPT Image 2 Developer Edit

GPT Image 2 Developer Text-to-Image

峰值速度

GPT Image API 的主要特性

使用 GPT Image API 進行靈活的風格生成

使用 GPT Image API 實現高視覺保真度

使用 GPT Image API 實現精準的文字渲染

使用 GPT Image API 的基於知識的創造力

使用 GPT Image API 進行基於遮罩的編輯

背景與透明度控制

Quality Tier Control

Comparisons with One Prompt

GPT Image API Use Cases for Image Generation

Professional Photography & Visual Art

UI/UX Design & Mockups

Marketing & Advertising Campaigns

Creative Concept Art & Illustration

Style Transfer & Artistic Transformation

Content Localization & Adaptation

模型對比

如何在 Atlas Cloud 上使用 GPT Image

建立 Atlas Cloud 帳戶

為何在 Atlas Cloud 使用 GPT Image

效能與靈活性

企業與規模

GPT Image API FAQ

探索更多系列

Seedance 2.0

Grok Imagine

Gemini Omni Flash

GPT Image 2

Google

Seedance 2.0 Mini

ByteDance

Alibaba

OpenAI

xAI

Kwaivgi

Seedream 5.0 Pro

一個 API，暢享全模態 AI。

Join our Discord community