
Vidu Q1 Start-End-to-Video API by Vidu
Vidu Q1 Start-end-to-Video is an advanced AI video generation model that brings static images to life. Upload a reference image and describe the motion you want — the model generates high-quality video with smooth animation, optional audio, and cinematic quality up to 1080p.
Vidu Q1 Start-End-to-Video
Vidu Q1 Start-End-to-Video is an efficient AI video generation model that creates smooth video transitions between two images. Provide a start frame and an end frame — the model generates a natural, coherent 5-second 1080p video that bridges the two visuals with fluid motion and optional audio.
Why Choose This?
-
Frame-to-frame control Define exactly where the video begins and ends with two reference images.
-
Smooth transitions Generate natural, coherent motion that seamlessly connects the start and end frames.
-
Fast generation Optimized for quick turnaround with minimal wait time.
-
1080p output Generate videos in full 1080p high definition quality.
-
5-second videos Produces crisp, fixed-length 5-second videos ready to share.
-
Audio generation Optional synchronized audio and background music.
-
Prompt Enhancer Built-in tool to automatically improve your transition descriptions.
Parameters
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Text description of the transition, motion, and action between frames |
| start_image | Yes | The first frame image (URL or upload) |
| end_image | Yes | The last frame image (URL or upload) |
| resolution | No | Output quality: 1080p |
| duration | No | Fixed video length of 5 seconds |
| generate_audio | No | Generate synchronized audio (default: enabled) |
| bgm | No | Add background music (default: enabled) |
| seed | No | Random seed for reproducibility |
How to Use
- Upload your start image — provide the image representing the first frame of the video.
- Upload your end image — provide the image representing the last frame of the video.
- Write your prompt — describe the transition, motion, and how the scene evolves between frames.
- Configure audio (optional) — enable/disable audio generation and background music.
- Run — submit and download your video.
Pricing
| Resolution | Cost |
|---|---|
| 1080p | $0.4 |
Best Use Cases
- Before & After Videos — Showcase transformations such as renovations, makeovers, or seasonal changes.
- Scene Transitions — Create smooth visual transitions between two scenes for film or storytelling.
- Product Reveals — Animate the transformation from a product packshot to a lifestyle shot.
- Visual Morphing — Generate morphing effects between two related subjects or compositions.
- Social Media Content — Produce eye-catching transition clips for Reels, TikTok, and Stories.
Pro Tips
- Use the Prompt Enhancer to refine your transition descriptions.
- Ensure start and end images share a compatible composition or subject for the most natural transition.
- Be specific in your prompt about how the motion or transformation should unfold between the two frames.
- Keep subject framing consistent between start and end images for smoother results.
- Enable generate_audio for synchronized sound effects that complement the transition.
Notes
- All three fields — prompt, start_image, and end_image — are required.
- Video duration is fixed at 5 seconds.
- The model interpolates motion and scene content between the two provided frames.
- Audio generation adds synchronized sound effects and ambient audio.
- BGM adds background music appropriate to the scene mood.
- Ensure uploaded image URLs are publicly accessible.
Related Models
- Vidu Q1 Image-to-Video — Animate a single reference image with text-guided motion.
- Vidu Q1 Reference-to-Video — Generate video using one or more reference images as visual anchors.


















