HiDream O1 1.5 Image API for Pixel-Native Creation

HiDream O1 1.5 Image API 將 HiDream.ai 的統一基礎模型帶入你的技術堆疊，在同一套像素級系統上執行文字轉圖片、單張圖片編輯，以及以主體為核心的個人化生成。可調整 guidance 與 inference steps，並支援六種長寬比預設，確保高度貼合提示詞。Atlas Cloud 透過單一 OpenAI 相容端點提供此服務，採透明的隨用隨付定價，每張圖片 $0.044。立即開始建置。

探索領先模型(2)

NEW

文生圖

HiDream O1 1.5 Text-to-Image

暫無描述

HiDream O1 1.5 Edit

暫無描述

From

$0.044/張

Every HiDream O1 1.5 Image API Endpoint, Side by Side

Compare what each route of the HiDream O1 1.5 Image API takes in, renders out, and charges per call.

Modality	Description
HiDream O1 1.5 Text-to-Image API (Text To Image)	Turn a written prompt of up to 2,500 characters into a fully composed image across six presets, from a 512x512 square to 16:9 landscape, with PNG, JPEG, or WebP output. Denoising steps range from 1 to 100 and guidance scale from 1.0 to 20.0, so each request can trade speed against how tightly the result follows your prompt. At $0.044 per image, it fits e-commerce mockups, advertising concepts, and game art produced at volume.
HiDream O1 1.5 Edit API (Image Editing)	Feed one reference image URL alongside your instruction and this endpoint rewrites that image, or pass several URLs for subject-driven personalization across a set. It shares the same six size presets, 1 to 100 inference steps, and 1.0 to 20.0 guidance range as the text-to-image route, returning PNG, JPEG, or WebP. Billed at $0.044 per image, it handles product retouching, background swaps, and consistent character edits.

Modality

Description

HiDream O1 1.5 Text-to-Image API (Text To Image)

Turn a written prompt of up to 2,500 characters into a fully composed image across six presets, from a 512x512 square to 16:9 landscape, with PNG, JPEG, or WebP output. Denoising steps range from 1 to 100 and guidance scale from 1.0 to 20.0, so each request can trade speed against how tightly the result follows your prompt. At $0.044 per image, it fits e-commerce mockups, advertising concepts, and game art produced at volume.

HiDream O1 1.5 Edit API (Image Editing)

Feed one reference image URL alongside your instruction and this endpoint rewrites that image, or pass several URLs for subject-driven personalization across a set. It shares the same six size presets, 1 to 100 inference steps, and 1.0 to 20.0 guidance range as the text-to-image route, returning PNG, JPEG, or WebP. Billed at $0.044 per image, it handles product retouching, background swaps, and consistent character edits.

HiDream O1 1.5 Image API 內建精準度與控制能力

HiDream O1 1.5 Image API 將文字轉圖像生成、指令式編輯，以及以主體為核心的個人化整合在同一個 pixel-native 模型中，可呈現精準的雙語文字，並讓開發者直接控制 guidance、sampling steps 與輸出格式。

使用 HiDream O1 1.5 Image API 進行文字轉圖像

送出最多 2,500 個字元的提示詞，模型會透過單一 pixel-native transformer 將其渲染成完整圖像；此 transformer 會在同一個共享空間中編碼像素、文字與任務條件。由於流程中沒有外部 VAE 或獨立文字編碼器，細節與構圖在密集、多子句描述中也能保持穩定。這讓它成為概念美術、行銷視覺與產品模型圖的可靠基礎。

雙語文字與版面渲染

很少有圖像模型能在構圖中放入清晰可讀的文字，但 HiDream O1 1.5 能乾淨呈現中文、英文、混合語言字串與數值資料，品質足以省去手動修圖。pixel-native 設計可處理多區塊版面，讓標題、說明文字與標籤保持銳利；相較之下，latent-space 模型常會讓字體模糊或亂碼。設計師可以製作文字已可直接交付的海報、包裝與社群圖像。

HiDream O1 1.5 Image API 的情境式編輯

當你傳入一個參考圖像 URL，並搭配像「移除耳機」這類自然語言指令時，edit endpoint 會套用變更，同時保留周圍構圖。生成與編輯由同一個模型完成，因此光線、風格與未修改區域能保持一致，而不是從零重建。團隊可用它在已核准的視覺素材上快速迭代，無需完整重新設計。

以主體為核心的個人化

多個參考圖像 URL 可讓模型鎖定主體，並在全新的場景、姿勢與背景中延續其識別特徵。這種 subject-driven 模式無需針對每張圖像進行 fine-tuning，就能讓角色、產品或品牌吉祥物在不同生成結果中保持可辨識。它適合用於同一形象需要反覆出現的行銷活動、分鏡腳本與遊戲素材。

一把金鑰、完整控制、用多少付多少

你實際需要多少控制權？可將 guidance_scale 從 1.0 調整到 20.0、將 inference steps 從 1 調整到 100，選擇六種長寬比預設之一，並匯出為 PNG、JPEG 或 WebP。每次呼叫都透過單一 OpenAI-compatible endpoint 執行，價格透明，每張圖像 $0.044，採用 pay-as-you-go 計費且無需訂閱。立即開始建置。

HiDream O1 1.5 Image API 對比領先模型：一個提示詞，三種渲染結果

將同一個提示詞同時送入 HiDream O1 1.5 Image API 與兩個競品影像模型，然後比較各自如何把相同文字詮釋為構圖、光線與細節。

提示詞

一座地中海港灣小鎮裡熱鬧的清晨魚市場，木製攤位一字排開，手寫粉筆價格板標示著當日新鮮漁獲；一位穿著條紋圍裙的年輕魚販正笑著做出動作，把一條銀色沙丁魚拋向空中；低角度金色側光掠過濕潤的鵝卵石路面與閃閃發亮的魚鱗；深度望遠壓縮感將攤位層層堆疊到後方柔和霧氣中的港口；色彩以青綠色百葉窗、溫暖陶土色牆面與冷冽銀色魚身構成；清晰的粉筆字跡與風化木紋；自然抓拍的紀實報導攝影，35mm，寬幅 16:9 畫面比例，滿版出血

Generated with HiDream O1 1.5 on Atlas Cloud

Generated with Nano Banana Pro on Atlas Cloud

Generated with Seedream v4.5 on Atlas Cloud

提示詞

一對緋紅金剛鸚鵡在結果的 cecropia 樹枝上爭吵的瞬間，翅膀張開成一片緋紅與鈷藍的爆發，其中一隻鳥在拍翅間倒掛翻滾；柔和陰天的叢林逆光透過半透明羽毛發亮；以 400mm telephoto 拍攝，將層層霧氣中的雨林壓縮到背景；右側三分之一留有大片淡色天空的負空間；紅色羽毛在深翡翠綠葉叢映襯下形成互補色對比；羽枝與鳥喙質感銳利清晰地呈現；自然史野生動物攝影，寬幅 16:9 畫面比例，滿版出血

Generated with HiDream O1 1.5 on Atlas Cloud

Generated with Nano Banana Pro on Atlas Cloud

Generated with Seedream v4.5 on Atlas Cloud

使用 HiDream O1 1.5 Image API，從提示詞走到正式上線

橫跨電子商務、廣告、遊戲美術與社群行銷活動，HiDream O1 1.5 Image API 可將單一提示詞或一組參考素材轉換為影像生成、編輯，以及主體一致的個人化內容，每張圖片固定 $0.044。

電子商務產品視覺素材

零售團隊可透過文字提示詞生成產品照與生活情境圖，每張圖片 $0.044，並可從六種長寬比預設中選擇。無需拍攝或等待攝影棚交件，即可完成型錄視覺素材。

以 HiDream O1 1.5 Image API 打造廣告創意

製作活動海報與橫幅，輸出構圖嚴謹、具電影感打光的版面，支援橫式、直式與方形構圖。代理商可在一次工作流程中快速迭代主視覺，並將可直接投入製作的美術素材交付給客戶。

精準照片編輯

只需一張參考圖片加上一段編輯提示詞，模型就能在保留原有結構與光線的同時，重新套用風格、修飾或重構照片。設計師無需完整影像編輯工具，也能修正背景或替換元素。

透過 HiDream O1 1.5 Image API 維持角色一致性

輸入多張參考圖片後，模型可在全新場景中維持角色、產品或吉祥物的一致性。工作室能建立可重複使用的品牌資產與行銷系列，並確保視覺始終符合設定。

遊戲美術與概念設計

當遊戲團隊需要環境、道具或角色概念時，模型可依 guidance scale 與 inference steps 調整，輸出細節豐富的美術圖。美術指導可在投入工作室製作時間前，先探索不同視覺方向。

使用 HiDream O1 1.5 Image API 製作社群行銷活動

內容排程很滿嗎？行銷人員可快速產出能吸引停留的貼文、限時動態與縮圖素材，支援方形、直式與橫式預設；每張圖片皆以固定且可預期的 $0.044 生成。

HiDream O1 1.5 Image API 與競品影像模型的比較

了解 HiDream O1 1.5 Image API 在內建推理、雙語文字、開放權重與單張圖片成本方面，相較於 Alibaba 和 ByteDance 影像模型的表現。

模型	供應商	推理提示代理	雙語文字渲染	開放權重	價格（每張圖片）
HiDream O1 1.5 Text-to-Image	HiDream.ai	√	√	√	$0.044
HiDream O1 1.5 Edit	HiDream.ai	√	√	√	$0.044
Qwen Image 2.0	Alibaba (Qwen)	-	√	-	$0.035
Seedream v4.5	ByteDance	-	√	-	$0.04

如何在 Atlas Cloud 上使用 HiDream

幾分鐘即可上手 — 按照以下簡單步驟，透過 Atlas Cloud 平台整合和部署模型。

建立 Atlas Cloud 帳戶

在 atlascloud.ai 註冊並完成驗證。新用戶可獲得免費額度，用於探索平台和測試模型。

為何在 Atlas Cloud 使用 HiDream

將先進的 HiDream 模型與 Atlas Cloud 的 GPU 加速平台相結合，提供無與倫比的效能、可擴展性和開發體驗。

效能與靈活性

低延遲：
GPU 最佳化推理，實現即時回應。

統一 API：
一次整合，暢用 HiDream、GPT、Gemini 和 DeepSeek。

透明定價：
按 Token 計費，支援 Serverless 模式。

企業與規模

開發者體驗：
SDK、資料分析、微調工具和模板一應俱全。

可靠性：
99.99% 可用性、RBAC 權限控制、合規日誌。

安全與合規：
SOC 2 Type II 認證、HIPAA 合規、美國資料主權。

HiDream O1 1.5 Image API Questions, Answered

The HiDream O1 1.5 Image API gives developers programmatic access to HiDream's unified image generation model through a single OpenAI-compatible endpoint on Atlas Cloud. Built on a pixel-level unified transformer, it delivers text-to-image, editing, and subject-driven personalization from one model instead of a stack of separate tools. Access is Day-0 with pay-as-you-go, transparent per-call pricing.

Beyond straightforward text-to-image generation, the model handles instruction-based editing, subject-driven personalization across multiple reference images, and accurate long-text rendering for posters and commercial graphics. Teams reach for it in e-commerce product visuals, advertising creative, and game art, where tight composition and legible on-image text both matter.

Yes. HiDream O1 1.5 was trained to interpret nuanced prompts in both Chinese and English, and it renders multilingual on-image text with strong accuracy. That makes it a practical fit for teams shipping localized visuals without switching between models.

You call the HiDream O1 1.5 Image API with one OpenAI-compatible key, so most existing SDKs work once you point them at the Atlas Cloud endpoint. Send a request with your prompt and any optional parameters to the hidream-o1-1.5/text-to-image model, then read back the generated image. No separate model hosting or GPU infrastructure is required on your side.

Prompts can run up to 2,500 characters, and you pick from preset sizes including square_hd at 1024x1024, square at 512x512, plus portrait and landscape options in 4:3 and 16:9. You can also tune num_inference_steps from 1 to 100 with a default of 50, set guidance_scale between 1.0 and 20.0 with a default of 5.0, and return PNG, JPEG, or WebP.

Pass a single URL in reference_image_urls to run instruction-based editing on an existing image, or supply multiple URLs to drive personalization that keeps a consistent subject across scenes. Leave the field empty for standard text-to-image generation. A dedicated hidream-o1-1.5/edit model is available for editing workflows at the same per-image rate.

The HiDream O1 1.5 Image API is priced at $0.044 per image on Atlas Cloud, and the text-to-image and edit models share that same rate. Billing is pay-as-you-go with transparent per-call pricing, so you pay only for the images you generate with no subscription. Start building today.

On Atlas Cloud you choose a preset size such as square_hd at 1024x1024, and the model synthesizes each image directly from raw pixels through its unified transformer rather than compressing into a latent space. Because detail and on-image text are generated instead of upscaled from a bottleneck, HiDream is known for clean typography and crisp edges in posters and product graphics.

探索更多系列

Seedance 2.0

Seedance 2.0 API 為您提供 ByteDance 多模態影片模型的生產級存取權限——支援四模態輸入（文字、影像、影片、音訊），以及業界領先的「Universal Reference」（通用參考）系統，可在不同鏡頭間鎖定構圖、運鏡與角色動作。只需一次 API 呼叫即可整合導演級控制，固定費率為 $0.09/秒，即時取得金鑰，無需排隊——由企業級正常運行時間與合規性提供保障。Seedance 2.0 原生 4K 現已上線！

檢視系列

Grok Imagine

Grok Imagine API 為開發者提供 xAI 的圖像、影片和音訊生成一站式套件。它可以生成解析度高達 2K 且支援多語言文本渲染的圖像，以及長達 15 秒且帶有原生同步音訊和基於參考圖像編輯功能的影片。在 Atlas Cloud 上，只需一個金鑰即可執行每個 Grok Imagine 模式，因此您可以在圖像、影片和音訊之間無縫切換，無需單獨設定，每張圖像 0.02 美元起，每秒 0.05 美元起。

檢視系列

Gemini Omni Flash

Gemini Omni API 將 Google DeepMind 於 Google I/O 2026 發表的多模態影片生成與編輯模型帶進你的技術棧。Gemini Omni 將 Gemini 的推理引擎與生成式媒體融合，可接受文字、圖片、影片與音訊的任意組合輸入，產生一致且以知識為根據的輸出。透過自然對話持續打磨成果：替換物件、改寫場景、切換風格，同時維持物理規律、角色與畫面連貫性不變。Atlas Cloud 透過單一整合 API 提供完整的 Gemini Omni Flash 系列——文字生成影片、支援最多 7 張參考圖片的圖片生成影片，以及參考圖生成影片——採每秒計費、價格透明，$0.112 起，無需訂閱。立即開始打造。

檢視系列

GPT Image 2

GPT Image 2 API 為開發者提供了訪問 OpenAI 最新圖像模型的途徑，它是 GPT Image 1.5 的繼任者。該模型可生成和編輯圖像，能夠在拉丁和 CJK 文字上實現準確的文本渲染，並在海報、樣機和資訊圖表方面具備強大的排版能力。在 Atlas Cloud 上，您可以透過一個統一的 API 與 300 多個模型一起訪問它，並享受免費額度、99.99% 的正常運行時間，且無需 OpenAI 組織驗證。

檢視系列

Google

Google最強大的創意模型現已在Atlas Cloud上全面可用。Veo 3.1提供電影等級的影片生成，Nano Banana 2支援高保真圖像建立，而Gemini為每個工作流程帶來多模態智慧。透過單一API key即可存取完整的Google模型套件，提供Day-0可用性和隨用隨付（pay-as-you-go）定價。

檢視系列

Seedance 2.0 Mini

Seedance 2.0 Mini 將 ByteDance 的多模態影片生成技術引入到對速度和成本要求極高的工作流程中。它以更輕量的佔用空間提供 Seedance 2.0 的核心能力——更快的生成速度、更低的單支影片成本，並且使用您現有的同款 API 整合。對於運行高吞吐量流水線或進行大規模原型設計的團隊來說，Mini 是最實用的預設選擇。

檢視系列

ByteDance

從電影級影片生成到高保真影像建立，ByteDance 最強大的模型現已在 Atlas Cloud 上線。以最低的推論定價和零基礎設施開銷，大規模執行 Seedance 和 Seedream。

檢視系列

Alibaba

Atlas Cloud 將 Alibaba 的全系模型陣容整合至同一個 API 中：Qwen 適用於語言和圖像任務，Wan 適用於高達 1080p 的影片生成。所有模型均採用按需付費模式，無需訂閱。您可以使用現有的 OpenAI 兼容客戶端，透過單一的 base URL 存取 Alibaba API。

檢視系列

OpenAI

Atlas Cloud 為您提供存取完整 OpenAI API 產品線的權限，從用於圖像生成的 GPT Image 2 到用於影片的 Sora 2。每個模型均採用按需付費模式，無月度消費限制。使用相容 OpenAI 的 API，只需簡單替換基礎 URL 即可輕鬆接入。

檢視系列

xAI

在 Atlas Cloud 上使用 xAI API 建構完整的影像與影片處理管線。以 2K 解析度生成、使用參考影像進行編輯，並將影像動畫化為音訊同步的影片片段。

檢視系列

Kwaivgi

Kwaivgi API 價格低於標準定價 15%。Atlas Cloud 提供對最新 Kling 版本的零日（Day-0）存取權限，採用按需付費定價且無席位限制。一個帳戶，一個金鑰，暢享從標準版到大師版的所有 Kling 模型。

檢視系列

Seedream 5.0 Pro

Seedream 5.0 Pro API 為開發者在 Atlas Cloud 上提供了字節跳動的可控圖像編輯模型。它透過錨點和座標精確定位編輯，將圖像分離為可編輯圖層，融合多個參考，並精準匹配顏色和材質，支援 2K 和 3K 解析度的多語言文本。在 Atlas Cloud 上，您只需一個金鑰即可存取！

檢視系列

一個 API，暢享全模態 AI。

探索全部模型