GPT Image API with 3 Model Tiers

GPT Image API 为开发者提供了 OpenAI 的图像生成系列产品，包含 GPT Image 1、1.5 和 Mini 三个层级，每个层级均提供文本生成图像和图像编辑变体。这些模型在多种风格下均能提供准确的图像内文本、照片级的逼真渲染以及高度的提示词遵循能力。在 Atlas Cloud 上，您可以通过一个统一的 API 访问所有层级以及其他 300 多种模型，每张图像低至 0.004 美元，并拥有 99.99% 的正常运行时间保证。

探索领先模型

Atlas Cloud 为您提供最新的行业领先创意模型。

NEW

文生图

Openai GPT Image 2 Text-to-Image

GPT Image 2 text to image is OpenAI's fast, cost-efficient text-to-image generator powered by GPT-5 guidance. Create photorealistic shots, product renders, concept art, and stylized graphics from natural-language prompts (optionally conditioned with an image). Supports custom aspect ratios, seeds, negative prompts, hex color hints, and style presets. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image 2 Edit

GPT Image 2 Edit is OpenAI's image model for precise, natural-language edits. Add/remove objects, swap backgrounds, retouch faces, adjust colors/lighting, edit text/graphics, crop/resize, and apply hex color control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image-1.5 Text-to-image

GPT Image 1.5 text to image is OpenAI’s fast, cost-efficient text-to-image generator powered by GPT-5 guidance. Create photorealistic shots, product renders, concept art, and stylized graphics from natural-language prompts (optionally conditioned with an image). Supports custom aspect ratios, seeds, negative prompts, hex color hints, and style presets. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image-1.5 Edit

GPT Image 1.5 Edit is OpenAI’s image model for precise, natural-language edits. Add/remove objects, swap backgrounds, retouch faces, adjust colors/lighting, edit text/graphics, crop/resize, and apply hex color control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image-1 Text-to-image

OpenAI GPT Image-1 generates images from text prompts from OpenAI's latest text-to-image model, ideal for creating visual assets. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image-1 Edit

OpenAI's gpt-image-1 enables image generation and image editing via OpenAI's image API, ideal for creating and refining images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image-1 Mini Text-to-image

GPT Image 1 Mini is a cost-efficient multimodal OpenAI model powered by GPT-5 that turns text or image prompts into high-quality images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image-1 Mini Edit

GPT Image 1 Mini is a cost-efficient, natively multimodal OpenAI model that pairs GPT-5 language understanding with compact image editing and generation from text and image inputs to produce high-quality images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

GPT Image 2 Developer Edit

GPT Image 2 Developer Edit applies natural-language instructions to one or more reference images, with common aspect ratios and 1k, 2k, or supported 4k output tiers. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

GPT Image 2 Developer Text-to-Image

GPT Image 2 Developer Text-to-Image generates polished visuals from natural-language prompts, with common aspect ratios and 1k, 2k, or supported 4k output tiers. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

From$0.009/张

$0.004/张

-50%

峰值速度

最低成本

模态	描述
GPT Image-1 T2I API(Text to Image)	GPT Image-1 文本生成图像 API 赋能开发者将文本提示转化为细节丰富、令人惊叹的逼真视觉效果。通过将 GPT-4 Turbo 的推理能力与 DALL·E 级别的视觉合成技术相结合，它为专业级图像制作提供了业界领先的提示词遵循度与复杂构图能力。
GPT Image-1 Edit API(Image to Image)	GPT Image-1 Edit API 赋能开发者，以无缝的一致性将现有图像转化为经过精细调整或重新构想的杰作。通过利用多模态理解能力，它能够生成精确的风格迁移、情境构图以及针对性的修改，以实现专业级的资产迭代。
GPT Image-1.5 T2I API(Text to Image)	GPT Image-1.5 Text to Image API 赋能开发者以优化的成本将文本提示词转化为高质量的视觉效果。通过利用基于 GPT 的架构，它提供了强大的提示词理解能力和视觉保真度，以实现平衡的生产工作流。
GPT Image-1.5 Edit API(Image to Image)	GPT Image-1.5 Edit API 使开发者能够通过精确的修改来完善现有资产。通过支持 input_fidelity 控制，它能够进行微调，同时保留面部和徽标等关键元素。
GPT Image-1 Mini T2I API(Text to Image)	GPT Image-1 Mini Text to Image API 为开发者提供了该系列中性价比最高的图像生成能力。借助 GPT-5 架构，它以最低的单张图像成本提供专业级的生成结果，是高频大批量内容生产的理想选择。
GPT Image-1 Mini Edit API(Image to Image)	GPT Image-1 Mini Edit API 赋能开发者，通过精简的编辑功能改造现有图像。它以极低的成本提供必要的编辑功能，从而实现快速迭代和内容生产工作流。

GPT Image API 的主要特性

探索 GPT Image API 提供的功能，从灵活的风格、照片级的逼真度、准确的图像内文本到基于蒙版的编辑、背景控制和质量分级。

使用 GPT Image API 进行灵活的风格生成

生成涵盖逼真摄影、风格化艺术作品、概念艺术、信息图表、3D风格插画等多领域的丰富视觉输出。从电影级景观到 UI 界面原型，该模型均能精准契合您的创作方向。

使用 GPT Image API 实现高视觉保真度

Maintains object relationships, lighting consistency, and color balance with industry-leading prompt adherence. Generated images exhibit natural textures, accurate proportions, and physically plausible compositions.

使用 GPT Image API 实现精准的文本渲染

能够在图像中生成清晰、易读的排版——是海报、梗图、漫画、品牌视觉设计以及任何需要整合文本元素的项目的理想选择。

使用 GPT Image API 的基于知识的创造力

借助 GPT-4/GPT-5 的世界知识，生成事实上准确且符合语境的视觉内容。该模型理解文化背景、历史语境以及特定领域的概念。

使用 GPT Image API 进行基于蒙版的编辑

使用可选的蒙版输入编辑特定区域，仅修改选中部分，同时保持图像其余部分原样不变。这使得 GPT Image API 在修图、物体移除和精确的构图调整方面非常可靠。

背景与透明度控制

在受支持的模型上自定义背景并生成透明输出，是徽标、产品展示图和分层设计工作的理想选择。您可以将主体放置在新的场景中，或导出干净的抠图，而无需手动创建蒙版。

Quality Tier Control

在每次请求时选择低、中或高质量，以平衡您工作负载的细节与成本。较低层级可加速大批量草稿的生成，而高质量层级则能为最终资产提供最具照片级真实感的结果。

Comparisons with One Prompt

提示词

Surrealist fashion campaign poster, quadrant layout (2x2 grid of 4 variations), extreme macro photography of a human eye filling the entire frame as background — iris colors vary across panels: blue-green teal, golden hazel, natural brown — hyperrealistic eye texture with visible pores on eyelid skin, dramatic long eyelashes in black with some purple/violet colored lash extensions spiking outward in an editorial exaggerated style, miniaturized female model composited realistically into the eye environment, appearing to sit casually on the lower eyelid or eyelash roots, model wearing streetwear/casual fashion outfits — variations include: oversized grey graphic sweatshirt + black plaid wide-leg pants + black chunky platform boots, grey long-sleeve polo shirt + sage green cargo pants + tan Timberland boots + camo backpack, bold typographic brand logo "LKNLN" stamped/tattooed directly onto the eyelid skin in dark gothic/industrial bold sans-serif font, appearing as if embossed or inked into skin, lighting: dramatic studio lighting on the eye, soft fill on model, depth of field contrast between hyper-sharp iris and soft skin surroundings, color palette: skin tones, teal/hazel iris, muted sage green, plaid grey-black, amber boots, purple accent lashes, photorealistic composite, editorial fashion photography style, small watermark "AI dsgn" in bottom left corner, ultra high resolution, cinematic color grading

GPT Image 1

GPT Image 1.5

GPT Image 2

GPT Image API Use Cases for Image Generation

探索您可以使用 GPT Image API 构建的内容，从专业摄影和 UI 原型到营销活动、概念艺术、风格迁移以及内容本地化。

Professional Photography & Visual Art

Generate photorealistic images with cinematic lighting, precise composition, and natural textures. From product photography to editorial visuals, GPT Image models produce outputs indistinguishable from professional camera work.

UI/UX Design & Mockups

Create clean, modern design concepts including app interfaces, dashboards, websites, and product layouts. The models excel at generating structured compositions with professional aesthetics.

Marketing & Advertising Campaigns

Rapidly produce campaign-ready visuals for social media, digital ads, and brand marketing. Support for multiple quality tiers enables both rapid A/B testing and high-end final deliverables.

Creative Concept Art & Illustration

Explore styles, moodboards, and concept art at speed. Generate illustrations in diverse artistic styles — from watercolor paintings to anime, comic books to oil paintings.

Style Transfer & Artistic Transformation

Transform existing images into different artistic styles while preserving core subject matter. Convert photos to cartoons, paintings, sketches, or any aesthetic direction with natural language instructions.

Content Localization & Adaptation

Quickly adapt visual content for different markets, audiences, or platforms. Modify backgrounds, adjust colors, update styling, or re-contextualize imagery through simple text descriptions.

模型对比

查看不同厂商的模型表现 — 对比性能、价格和独特优势，做出明智决策。

Model	Reference Image Limit	Output Num	Resolution	Aspect Ratio
GPT Image-1	4	1~10	1024×1024, 1024×1536, 1536×1024	1:1, 3:2, 2:3
GPT Image-1.5	10	1	1024×1024, 1024×1536, 1536×1024	1:1, 3:2, 2:3
GPT Image-1 Mini	4	1~10	1024×1024, 1024×1536, 1536×1024	1:1, 3:2, 2:3
Nano Banana 2	14	1	4K, 2K, 1K	1:1 3:2 2:3 3:4 4:3 4:5 5:4 9:16 16:9 21:9
Seedream 5.0	14	1~15	2K~4K+	1:1 3:2 2:3 3:4 4:3 4:5 5:4 9:16 16:9 21:9

如何在 Atlas Cloud 上使用 GPT Image

几分钟即可上手 — 按照以下简单步骤，通过 Atlas Cloud 平台集成和部署模型。

创建 Atlas Cloud 账户

在 atlascloud.ai 注册并完成验证。新用户可获得免费额度，用于探索平台和测试模型。

为何在 Atlas Cloud 使用 GPT Image

将先进的 GPT Image 模型与 Atlas Cloud 的 GPU 加速平台相结合，提供无与伦比的性能、可扩展性和开发体验。

性能与灵活性

低延迟：
GPU 优化推理，实现实时响应。

统一 API：
一次集成，畅用 GPT Image、GPT、Gemini 和 DeepSeek。

透明定价：
按 Token 计费，支持 Serverless 模式。

企业与规模

开发者体验：
SDK、数据分析、微调工具和模板一应俱全。

可靠性：
99.99% 可用性、RBAC 权限控制、合规日志。

安全与合规：
SOC 2 Type II 认证、HIPAA 合规、美国数据主权。

GPT Image API FAQ

The GPT Image API offers three tiers. GPT Image-1 is the flagship for the highest quality, GPT Image-1.5 balances strong quality with lower cost, and GPT Image-1 Mini is the most cost-efficient for high-volume work. Each tier is available in both text to image and image to image variants.

Each model supports Low, Medium, and High quality settings. Higher quality produces more detailed and photorealistic results but at higher cost. For initial testing and previews, use Low quality for speed and savings. Switch to High quality for final deliverables requiring maximum fidelity.

Text-to-Image models support three output sizes: 1024×1024 (square), 1024×1536 (portrait), and 1536×1024 (landscape). Choose based on your use case — portrait for characters and vertical art, landscape for cinematic scenes and wide compositions, square for general purpose content.

Yes. The GPT Image API edit models accept an optional mask input, so you can control exactly which regions of an image are modified while the rest stays untouched. This supports precise inpainting for retouching, object removal, and localized changes.

The GPT Image API gives developers programmatic access to OpenAI's GPT Image family, a suite of multimodal image generation and editing models. It generates and edits images from text and image inputs, with accurate in-image text, photorealistic rendering, and strong prompt adherence. On Atlas Cloud you reach all three tiers through one unified API alongside 300+ models.

On Atlas Cloud the GPT Image API uses flat per-image pricing, starting at $0.004 per image on GPT Image-1 Mini, $0.008 on GPT Image-1.5, and $0.009 on GPT Image-1. Pricing is transparent with no token math, so you can predict the cost per generation before you run it.

No. OpenAI gates the GPT Image models behind organization verification in its own developer console, which can block individual developers. With the GPT Image API on Atlas Cloud you only need an Atlas Cloud account, so you can get a key and start generating without OpenAI verification.

Yes. Images you generate through the GPT Image API come with full commercial usage rights, and you retain ownership of the content you create. This makes it suitable for client work, marketing campaigns, and products you ship.

Yes. Atlas Cloud exposes an OpenAI-compatible API, so you can point the OpenAI SDK at the Atlas Cloud base URL, add your Atlas key, and call the GPT Image API with your existing code. You can make your first request in minutes without rebuilding your integration.

The GPT Image API gives you programmatic control that the chat experience does not, including quality settings, output size and format, mask-based editing, and batch generation. It is built for integrating image generation into your own apps and pipelines, rather than one-off creation in a chat window.

探索更多系列

Seedance 2.0

Seedance 2.0 API 为您提供 ByteDance 多模态视频模型的生产级访问权限——支持四模态输入（文本、图像、视频、音频），以及行业领先的“Universal Reference”（通用参考）系统，可在不同镜头间锁定构图、运镜和角色动作。只需一次 API 调用即可集成导演级控制，固定费率为 $0.09/秒，即时获取密钥，无需排队——由企业级正常运行时间和合规性提供保障。Seedance 2.0 原生 4K 现已上线！

查看系列

Grok Imagine

Grok Imagine API 为开发者提供 xAI 的图像、视频和音频生成一站式套件。它可以生成分辨率高达 2K 且支持多语言文本渲染的图像，以及长达 15 秒且带有原生同步音频和基于参考图像编辑功能的视频。在 Atlas Cloud 上，只需一个密钥即可运行每个 Grok Imagine 模式，因此您可以在图像、视频和音频之间无缝切换，无需单独设置，每张图像 0.02 美元起，每秒 0.05 美元起。

查看系列

Gemini Omni Flash

Gemini Omni API 将 Google DeepMind 在 Google I/O 2026 上发布的多模态视频生成与编辑模型带入你的技术栈。Gemini Omni 将 Gemini 的推理引擎与生成式媒体融合，可接受文本、图像、视频和音频的任意组合输入，生成一致且以知识为依据的输出。通过自然对话不断打磨结果：替换物体、重写场景、切换风格，同时保持物理规律、角色形象和画面连贯性不变。Atlas Cloud 通过统一的 API 提供完整的 Gemini Omni Flash 系列——文生视频、支持最多 7 张参考图的图生视频，以及参考图生视频——按秒计费、价格透明，低至 $0.112 起，且无需订阅。立即开始构建。

查看系列

GPT Image 2

GPT Image 2 API 为开发者提供了访问 OpenAI 最新图像模型的途径，它是 GPT Image 1.5 的继任者。该模型可生成和编辑图像，能够在拉丁和 CJK 文字上实现准确的文本渲染，并在海报、样机和信息图表方面具备强大的排版能力。在 Atlas Cloud 上，您可以通过一个统一的 API 与 300 多个模型一起访问它，并享受免费额度、99.99% 的正常运行时间，且无需 OpenAI 组织验证。

查看系列

Google

Google最强大的创意模型现已在Atlas Cloud上全面可用。Veo 3.1提供电影级别的视频生成，Nano Banana 2支持高保真图像创建，而Gemini为每个工作流带来多模态智能。通过单一API key即可访问完整的Google模型套件，提供Day-0可用性和按需付费（pay-as-you-go）定价。

查看系列

Seedance 2.0 Mini

Seedance 2.0 Mini 将 ByteDance 的多模态视频生成技术引入到对速度和成本要求极高的工作流中。它以更轻量的占用空间提供 Seedance 2.0 的核心能力——更快的生成速度、更低的单条视频成本，并且使用您现有的同款 API 集成。对于运行高吞吐量流水线或进行大规模原型设计的团队来说，Mini 是最实用的默认选择。

查看系列

ByteDance

从电影级视频生成到高保真图像创建，ByteDance 最强大的模型现已在 Atlas Cloud 上线。以最低的推理定价和零基础设施开销，大规模运行 Seedance 和 Seedream。

查看系列

Alibaba

Atlas Cloud 将 Alibaba 的全系模型阵容整合至同一个 API 中：Qwen 用于语言和图像任务，Wan 用于高达 1080p 的视频生成。所有模型均采用按需付费模式，无需订阅。您可以使用现有的 OpenAI 兼容客户端，通过单一的 base URL 访问 Alibaba API。

查看系列

OpenAI

Atlas Cloud 为您提供访问完整 OpenAI API 产品线的权限，从用于图像生成的 GPT Image 2 到用于视频的 Sora 2。每个模型均采用按需付费模式，无月度消费限制。使用兼容 OpenAI 的 API，只需简单替换基础 URL 即可轻松接入。

查看系列

xAI

在 Atlas Cloud 上使用 xAI API 构建完整的图像和视频处理工作流。以 2K 分辨率生成、使用参考图像进行编辑，并将图像动画化为音画同步的视频片段。

查看系列

Kwaivgi

Kwaivgi API 价格低于标准定价 15%。Atlas Cloud 提供对最新 Kling 版本的零日（Day-0）访问权限，采用按需付费定价且无席位限制。一个账户，一个密钥，畅享从标准版到大师版的所有 Kling 模型。

查看系列

Seedream 5.0 Pro

Seedream 5.0 Pro API 为开发者在 Atlas Cloud 上提供了字节跳动的可控图像编辑模型。它通过锚点和坐标精确定位编辑，将图像分离为可编辑图层，融合多个参考，并精准匹配颜色和材质，支持 2K 和 3K 分辨率的多语言文本。在 Atlas Cloud 上，您只需一个密钥即可访问！

查看系列

一个 API，畅享全模态 AI。

探索全部模型

GPT Image API with 3 Model Tiers

探索领先模型

Openai GPT Image 2 Text-to-Image

Openai GPT Image 2 Edit

Openai GPT Image-1.5 Text-to-image

Openai GPT Image-1.5 Edit

Openai GPT Image-1 Text-to-image

Openai GPT Image-1 Edit

Openai GPT Image-1 Mini Text-to-image

Openai GPT Image-1 Mini Edit

GPT Image 2 Developer Edit

GPT Image 2 Developer Text-to-Image

峰值速度

GPT Image API 的主要特性

使用 GPT Image API 进行灵活的风格生成

使用 GPT Image API 实现高视觉保真度

使用 GPT Image API 实现精准的文本渲染

使用 GPT Image API 的基于知识的创造力

使用 GPT Image API 进行基于蒙版的编辑

背景与透明度控制

Quality Tier Control

Comparisons with One Prompt

GPT Image API Use Cases for Image Generation

Professional Photography & Visual Art

UI/UX Design & Mockups

Marketing & Advertising Campaigns

Creative Concept Art & Illustration

Style Transfer & Artistic Transformation

Content Localization & Adaptation

模型对比

如何在 Atlas Cloud 上使用 GPT Image

创建 Atlas Cloud 账户

为何在 Atlas Cloud 使用 GPT Image

性能与灵活性

企业与规模

GPT Image API FAQ

探索更多系列

Seedance 2.0

Grok Imagine

Gemini Omni Flash

GPT Image 2

Google

Seedance 2.0 Mini

ByteDance

Alibaba

OpenAI

xAI

Kwaivgi

Seedream 5.0 Pro

一个 API，畅享全模态 AI。

Join our Discord community