图生视频

Kling Video O3 4K Image-to-Video

Kling Omni Video O3 (4K) Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Supports first/last frame control and audio generation.

Nano Banana 2 Reference to Image

Seedance 2.0 Reference-to-Video

Seed3D 2.0 Image-to-3D

Wan-2.7 Text-to-video

分类

折扣模型 (125)

模型功能

模型系列

共 258 个模型，当前显示 48 个

HappyHorse-1.1 Text-to-video

Generates videos from text prompts with HappyHorse 1.1, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.1 Image-to-video

Animates a first-frame image into video with optional prompt guidance, 720P or 1080P output, and durations from 3 to 15 seconds.

HappyHorse-1.1 Reference-to-video

Generates videos from one to nine reference images and a text prompt, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

Kling V3.0 Turbo Image-to-Video

Kling V3.0 Turbo Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Supports first/last frame control and audio generation.

Kling V3.0 Turbo Text-to-Video

Kling V3.0 Turbo Text-to-Video generates dynamic cinematic videos from text prompts using MVL technology. Supports first/last frame control and audio generation.

Kling Video O3 4K Image-to-Video

Kling Omni Video O3 (4K) Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Supports first/last frame control and audio generation.

Kling Video O3 4K Text-to-Video

Kling Omni Video O3 (4K) is Kuaishou advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Generates high-quality videos from text prompts with natural motion and audio generation support.

MAI-Image-2.5-Flash Text-to-image

Microsoft's fast, cost-optimized text-to-image generation model, creating high-quality images at lower cost using the same diffusion-based architecture as MAI-Image-2.5.

MAI-Image-2.5 Edit

Microsoft's flagship image-to-image editing model, enabling precise, controllable edits to existing images through natural language instructions.

MAI-Image-2.5 Text-to-image

Microsoft's flagship text-to-image generation model, designed to create high-quality, visually rich images from natural language prompts.

Midjourney V8.1 Remove Background

Midjourney automatically removes the background from an input image, returning one transparent-background result.

Midjourney V8.1 Style Transfer

Midjourney retexture changes the artistic style of an input image while preserving its composition, returning four restyled results.

Midjourney V8.1 Blend

Midjourney V8.1 blends two to five input images into four fused results, with an optional guiding prompt and native 2K HD.

Midjourney V8.1 Image-to-Image

Midjourney V8.1 re-imagines an input image guided by a text prompt, returning four variations. Supports native 2K HD, style reference, and aspect-ratio / stylize / chaos / weird controls.

Seed3D 2.0 Image-to-3D

ByteDance Seed3D 2.0 — generates a textured, PBR-shaded 3D model (glb/obj/usd/usdz) from a single input image. Returns a downloadable .zip archive containing the 3D file.

Midjourney V8.1 Image-to-Video

Midjourney V8.1 animates an input image into four 5-second videos at 480p or 720p.

Midjourney V8.1 Text-to-Image

Midjourney V8.1 generates four images from a text prompt, with optional native 2K HD, a style reference, and aspect-ratio / stylize / chaos / weird controls.

xAI TTS v1

xAI TTS v1 is a high-fidelity text-to-speech model that converts text into natural, expressive speech with sub-second latency, supporting 20 languages and 80+ voices with fine-grained delivery control.

Hunyuan 3D Rapid Image-to-3D

Tencent Hunyuan 3D Rapid (Express) — fast lightweight 3D mesh generation from a single image, with optional PBR materials. Outputs GLB/OBJ/USDZ/FBX/STL/MP4.

Hunyuan 3D Rapid Text-to-3D

Tencent Hunyuan 3D Rapid (Express) — fast lightweight 3D mesh generation from a text prompt, with optional PBR materials. Outputs GLB/OBJ/USDZ/FBX/STL/MP4.

Hunyuan 3D Pro Image-to-3D

Tencent Hunyuan 3D Pro (v3.1) — high-quality textured 3D mesh generation from a single image, with optional PBR materials and custom face count. Outputs GLB/OBJ/USDZ/FBX/STL.

Hunyuan 3D Pro Text-to-3D

Tencent Hunyuan 3D Pro (v3.1) — high-quality textured 3D mesh generation from a text prompt, with optional PBR materials and custom face count. Outputs GLB/OBJ/USDZ/FBX/STL.

Nano Banana 2 Reference to Image

Google's advanced AI-powered video-to-image generation model, designed to generate high-quality static images from video clips combined with text instructions.

Nano Banana 2 Reference to Image Developer

Google's advanced AI-powered video-to-image generation model, designed to generate high-quality static images from video clips combined with text instructions.

Openai GPT Image 2 Text-to-Image

GPT Image 2 text to image is OpenAI's fast, cost-efficient text-to-image generator powered by GPT-5 guidance. Create photorealistic shots, product renders, concept art, and stylized graphics from natural-language prompts (optionally conditioned with an image). Supports custom aspect ratios, seeds, negative prompts, hex color hints, and style presets. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image 2 Edit

GPT Image 2 Edit is OpenAI's image model for precise, natural-language edits. Add/remove objects, swap backgrounds, retouch faces, adjust colors/lighting, edit text/graphics, crop/resize, and apply hex color control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Fast video generation from first-frame image (and optional last-frame) with native audio.

Seedance 2.0 Fast Reference-to-Video

Fast multimodal video generation from reference images, videos, and audio. Supports video editing and extension.

Wan-2.7 Text-to-video

Generates videos from text prompts with multi-shot narrative, audio generation, and sound-image synchronization.

Vidu Q3-Mix Reference to Video

Vidu Q3-Mix Reference-to-Video generates videos from 1-4 reference images with consistent subjects. Offers strong visual quality with intelligent scene transitions, smooth dynamic effects, and audio support up to 1080p.

From$0.125/秒

$0.106/秒

-15%

Kling Video O3 4K Image-to-Video

HappyHorse-1.1 Text-to-video

HappyHorse-1.1 Image-to-video

HappyHorse-1.1 Reference-to-video

Kling V3.0 Turbo Image-to-Video

Kling V3.0 Turbo Text-to-Video

Kling Video O3 4K Image-to-Video

Kling Video O3 4K Text-to-Video

MAI-Image-2.5-Flash Text-to-image

MAI-Image-2.5 Edit

MAI-Image-2.5 Text-to-image

Midjourney V8.1 Remove Background

Midjourney V8.1 Style Transfer

Midjourney V8.1 Blend

Midjourney V8.1 Image-to-Image

Seed3D 2.0 Image-to-3D

Midjourney V8.1 Image-to-Video

Midjourney V8.1 Text-to-Image

xAI TTS v1

Hunyuan 3D Rapid Image-to-3D

Hunyuan 3D Rapid Text-to-3D

Hunyuan 3D Pro Image-to-3D

Hunyuan 3D Pro Text-to-3D

Nano Banana 2 Reference to Image

Nano Banana 2 Reference to Image Developer

Grok Imagine Video v1.5 Image-to-Video

Grok Imagine Image Quality Text-to-Image

Grok Imagine Image Quality Edit

HappyHorse-1.0 Text-to-video

HappyHorse-1.0 Image-to-video

HappyHorse-1.0 Reference-to-video

HappyHorse-1.0 Video-edit

Openai GPT Image 2 Text-to-Image

Openai GPT Image 2 Edit

Baidu ERNIE Image Turbo Text-to-image

Seedance 2.0 Text-to-Video

Seedance 2.0 Image-to-Video

Seedance 2.0 Reference-to-Video

Seedance 2.0 Fast Text-to-Video

Seedance 2.0 Fast Image-to-Video

Seedance 2.0 Fast Reference-to-Video

Wan-2.7 Text-to-video

Wan-2.7 Image-to-video

Wan-2.7 Reference-to-video

Wan-2.7 Video-edit

Veo 3.1 Lite Text-to-video

Veo 3.1 Lite Start-End Frame to Video

Veo 3.1 Lite Image-to-video

Vidu Q3-Mix Reference to Video

Join our Discord community