vidu/q2-pro-fast/reference-to-video-with-audio

Vidu Q2-Pro-Fast Reference to Video with Audio is a cutting-edge AI model that seamlessly converts text descriptions into high-quality videos with direct audio output, offering fast processing, smooth visuals, and synchronized sound.

IMAGE-TO-VIDEONEW
首页
探索
Vidu Video Models
vidu/q2-pro-fast/reference-to-video-with-audio
图生视频
PRO

Vidu Q2-Pro-Fast Reference to Video with Audio is a cutting-edge AI model that seamlessly converts text descriptions into high-quality videos with direct audio output, offering fast processing, smooth visuals, and synchronized sound.

输入

正在加载参数配置...

输出

空闲
生成的视频将在这里显示
配置参数后点击运行开始生成

每次运行将花费 0.011。$10 可运行约 909 次。

你可以继续:

参数

Queue

集成

Input Schema

以下参数在请求体中被接受。

总计: 0必填: 0可选: 0

暂无可用参数。

请求体示例

json
{
  "model": "vidu/q2-pro-fast/reference-to-video-with-audio"
}

请登录以查看请求历史

您需要登录才能访问模型请求历史记录。

登录

Vidu Q2-Pro-Fast Reference-to-Video-with-Audio

Vidu Q2-Pro-Fast Reference-to-Video-with-Audio is an advanced AI video generation model that brings static images to life. Upload a reference image and describe the motion you want — the model generates high-quality video with smooth animation, optional audio, and cinematic quality up to 1080p.

Why Choose This?

  • Faster speed Significantly reduced generation time compared to Q3-Pro.

  • Image-driven generation Transform any image into dynamic video with natural motion.

  • High resolution output Generate videos in 540p, 720p, or 1080p quality.

  • Flexible duration Create videos from 1 to 16 seconds in length.

  • Audio generation Optional synchronized audio and background music.

  • Motion control Adjust movement amplitude for subtle or dynamic animations.

  • Prompt Enhancer Built-in tool to automatically improve your motion descriptions.

Parameters

ParameterRequiredDescription
promptYesText description of the desired motion and action
subjectsYesInformation about the subjects in the images
resolutionNoOutput quality: 720p (default), 1080p
durationNoVideo length in seconds (1-10, default: 5)
generate_audioNoGenerate synchronized audio (default: enabled)
audio_typeNoAudio type, required when generate_audio is true.
aspect_ratioNoThe aspect ratio of the output video
bgmNoAdd background music (default: enabled)
seedNoRandom seed for reproducibility

How to Use

  1. Upload your image — provide the reference image to animate.
  2. Write your prompt — describe the motion, camera movement, and action.
  3. Set resolution — higher resolution for better quality, lower for faster processing.
  4. Adjust duration — set video length up to 16 seconds.
  5. Configure audio (optional) — enable/disable audio generation and background music.
  6. Set motion intensity (optional) — control how dynamic the movement is.
  7. Run — submit and download your video.

Pricing

ResolutionCost per secondExtend Cost
720pStarts at 0.0750,+0.0750, +0.0125/sec+$0.075 when open audio
1080pStarts at 0.2125,+0.2125, +0.0250/sec+$0.075 when open audio

Best Use Cases

  • Photo Animation — Bring portraits, landscapes, and product images to life.
  • Social Media Content — Create engaging video content from static images.
  • Marketing & Ads — Generate dynamic promotional videos from product photos.
  • Storytelling — Animate illustrations and artwork for narratives.
  • Creative Projects — Explore motion concepts from reference images.

Pro Tips

  • Use the Prompt Enhancer to refine your motion descriptions.
  • Be specific about movement direction, speed, and camera angles.
  • Set movement_amplitude to "small" for subtle, cinematic motion or "large" for dramatic action.
  • Enable generate_audio for realistic sound effects matching the scene.
  • Use high-quality source images for better video results.
  • Describe environmental effects (wind, smoke, dust) for more immersive results.

Notes

  • Both prompt and image are required fields.
  • Maximum video duration is 16 seconds.
  • Audio generation adds synchronized sound effects and ambient audio.
  • BGM adds background music appropriate to the scene mood.
  • Ensure uploaded image URLs are publicly accessible.

详细规格

概览:

模型提供商:VIDU
模型类型:image-to-video
部署方式:推理 API;Playground
定价:$0.011

关键参数:

尺寸上限:最大宽度 × 高度(用户可配置)
LoRA 支持:
种子选项:N/A

创作你的下一件杰作

300+ 模型,即刻开启,

探索全部模型