
Pixverse c1 Reference-to-Video API by PIXVERSE
Pixverse c1 Reference-to-Video model. High-quality video generation from image prompts.
PixVerse C1 Reference-to-Video
PixVerse C1 Reference-to-Video enables subject-consistent video generation by accepting up to 7 image references. Specify reference images as subjects or backgrounds, and the model generates a video that faithfully preserves the visual identity of your referenced elements.
Why Choose This?
-
Subject consistency Maintain character and object identity across the generated video.
-
Multiple references Use up to 7 image references for rich, multi-subject scenes.
-
Reference typing Tag each reference as a "subject" or "background" for precise control.
-
High resolution output Generate videos in 360p, 540p, 720p, or 1080p quality.
-
Flexible duration Create videos from 1 to 15 seconds in length.
-
Audio generation Optional synchronized audio that matches your scene.
Parameters
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Text description of the desired scene and motion |
| images | Yes | Array of image reference objects (1–7 items) |
| images[].image | Yes | Reference image (public URL or Base64, max 20MB) |
| images[].type | No | Reference type: "subject" or "background" (v5.5+) |
| images[].ref_name | No | Name for this reference, used in prompt mentions (max 30 bytes, v5.5+) |
| model | Yes | Model name (default: c1) |
| duration | No | Video length in seconds (1-15, default: 5) |
| quality | No | Output resolution: 360p, 540p, 720p (default), 1080p |
| aspect_ratio | No | Video aspect ratio (default: 16:9) |
| sound | No | Generate synchronized audio (default: enabled) |
| seed | No | Random seed for reproducibility |
How to Use
- Prepare your reference images — gather the subject and/or background images you want to use.
- Write your prompt — describe the scene and motion; mention subjects by
ref_nameif assigned. - Build the images array — provide each image with its type and optional ref_name.
- Set quality — choose resolution based on your quality and speed requirements.
- Adjust duration — set video length up to 15 seconds.
- Configure audio (optional) — enable or disable synchronized audio generation.
- Run — submit and download your video.
Best Use Cases
- Character Animation — Generate videos featuring specific people or characters with visual consistency.
- Product Videos — Keep product appearance consistent throughout the generated video.
- Brand Content — Maintain brand identity elements across video generations.
- Multi-character Scenes — Include multiple distinct subjects in a single video.
- Custom Backgrounds — Fix a specific environment as the video backdrop.
Pro Tips
- Assign
ref_nameto each reference and mention them in your prompt for better accuracy. - Use
type: "subject"for characters and objects,type: "background"for environments. - Use clean, high-quality images with clear subjects for the best consistency results.
- Limit references to the most essential subjects — too many may reduce quality.
- Ensure image URLs are publicly accessible; base64 is supported for private assets.
Pricing
| Quality | Billing Standard | Without Audio | With Audio |
|---|---|---|---|
| 360p | per second | $0.030 | $0.040 |
| 540p | per second | $0.040 | $0.050 |
| 720p | per second | $0.050 | $0.065 |
| 1080p | per second | $0.095 | $0.120 |
Notes
promptandimagesare required fields.- Up to 7 image references supported for C1 and v5.5+ models.
ref_namemax length is 30 bytes (UTF-8).- Supported image formats: PNG, JPEG, JPG, WebP.
- Image aspect ratio should be between 1:2.5 and 2.5:1; minimum dimension 300px.
- Maximum video duration is 15 seconds.
Related Models
- PixVerse C1 Text-to-Video — Generate video from text without reference images.
- PixVerse C1 Image-to-Video — Animate a single reference image.


















