vidu/q2-pro-fast/reference-to-video-with-audio

Vidu Q2-Pro-Fast Reference to Video with Audio is a cutting-edge AI model that seamlessly converts text descriptions into high-quality videos with direct audio output, offering fast processing, smooth visuals, and synchronized sound.

IMAGE-TO-VIDEONEW
首頁
探索
Vidu Video Models
vidu/q2-pro-fast/reference-to-video-with-audio
圖生影片
PRO

Vidu Q2-Pro-Fast Reference to Video with Audio is a cutting-edge AI model that seamlessly converts text descriptions into high-quality videos with direct audio output, offering fast processing, smooth visuals, and synchronized sound.

輸入

正在載入參數設定...

輸出

閒置
生成的影片將在這裡顯示
設定參數後點擊執行開始生成

每次執行將花費 0.011。$10 可執行約 909 次。

你可以繼續:

參數

Queue

整合

輸入 Schema

以下參數可在請求主體中使用。

總計: 0必填: 0選填: 0

無可用參數。

範例請求主體

json
{
  "model": "vidu/q2-pro-fast/reference-to-video-with-audio"
}

請登入以檢視請求歷史

您需要登入才能存取模型請求歷史記錄。

登入

Vidu Q2-Pro-Fast Reference-to-Video-with-Audio

Vidu Q2-Pro-Fast Reference-to-Video-with-Audio is an advanced AI video generation model that brings static images to life. Upload a reference image and describe the motion you want — the model generates high-quality video with smooth animation, optional audio, and cinematic quality up to 1080p.

Why Choose This?

  • Faster speed Significantly reduced generation time compared to Q3-Pro.

  • Image-driven generation Transform any image into dynamic video with natural motion.

  • High resolution output Generate videos in 540p, 720p, or 1080p quality.

  • Flexible duration Create videos from 1 to 16 seconds in length.

  • Audio generation Optional synchronized audio and background music.

  • Motion control Adjust movement amplitude for subtle or dynamic animations.

  • Prompt Enhancer Built-in tool to automatically improve your motion descriptions.

Parameters

ParameterRequiredDescription
promptYesText description of the desired motion and action
subjectsYesInformation about the subjects in the images
resolutionNoOutput quality: 720p (default), 1080p
durationNoVideo length in seconds (1-10, default: 5)
generate_audioNoGenerate synchronized audio (default: enabled)
audio_typeNoAudio type, required when generate_audio is true.
aspect_ratioNoThe aspect ratio of the output video
bgmNoAdd background music (default: enabled)
seedNoRandom seed for reproducibility

How to Use

  1. Upload your image — provide the reference image to animate.
  2. Write your prompt — describe the motion, camera movement, and action.
  3. Set resolution — higher resolution for better quality, lower for faster processing.
  4. Adjust duration — set video length up to 16 seconds.
  5. Configure audio (optional) — enable/disable audio generation and background music.
  6. Set motion intensity (optional) — control how dynamic the movement is.
  7. Run — submit and download your video.

Pricing

ResolutionCost per secondExtend Cost
720pStarts at 0.0750,+0.0750, +0.0125/sec+$0.075 when open audio
1080pStarts at 0.2125,+0.2125, +0.0250/sec+$0.075 when open audio

Best Use Cases

  • Photo Animation — Bring portraits, landscapes, and product images to life.
  • Social Media Content — Create engaging video content from static images.
  • Marketing & Ads — Generate dynamic promotional videos from product photos.
  • Storytelling — Animate illustrations and artwork for narratives.
  • Creative Projects — Explore motion concepts from reference images.

Pro Tips

  • Use the Prompt Enhancer to refine your motion descriptions.
  • Be specific about movement direction, speed, and camera angles.
  • Set movement_amplitude to "small" for subtle, cinematic motion or "large" for dramatic action.
  • Enable generate_audio for realistic sound effects matching the scene.
  • Use high-quality source images for better video results.
  • Describe environmental effects (wind, smoke, dust) for more immersive results.

Notes

  • Both prompt and image are required fields.
  • Maximum video duration is 16 seconds.
  • Audio generation adds synchronized sound effects and ambient audio.
  • BGM adds background music appropriate to the scene mood.
  • Ensure uploaded image URLs are publicly accessible.

詳細規格

概覽:

模型提供商:VIDU
模型類型:image-to-video
部署方式:推理 API;Playground
定價:$0.011

關鍵參數:

尺寸上限:最大寬度 × 高度(使用者可設定)
LoRA 支援:
種子選項:N/A

創作你的下一件傑作

300+ 模型,即刻開啟,

探索全部模型