Vidu Q1 Reference to Video

Create videos with consistent characters using up to 7 reference images

"A young woman and a monkey inside a colorful house"

Image 1

Image 1
1

Image 2

Image 2
2

Image 3

Image 3
3

Generated Result

Generated

Describe your scene and generate a video in seconds

8,500+ videos generated this month

📄 About Vidu Q1 Reference to Video
Key Features
Generates video clips with consistent subject appearance using up to 7 reference images.
Supports detailed text prompts (up to 1,500 characters) for customized video generation.
Offers three aspect ratios: landscape (16:9), portrait (9:16), and square (1:1) for various platforms.
Adjustable movement amplitude lets you control the amount of motion in your video.
Optional background music integration enhances the mood and engagement of your videos.
Fast generation of high-quality 1080p videos, typically within 80–120 seconds.
Random seed option enables reproducible results for iterative creative processes.
💡 Use Cases
Creating consistent animated character videos for social media campaigns.
Developing branded promotional clips with specific subject likeness.
Producing explainer or storytelling videos that require subject fidelity.
Generating video content for advertising with tailored backgrounds and music.
Visualizing concepts or storyboards for film and animation projects.
Crafting personalized video messages or greetings with recognizable characters.
Rapid prototyping of video ideas with iterative design using reference images.
🎯 Best For
🎯 Professional designers, marketers, content creators, animators, and anyone needing consistent character videos.
👍 Pros
Maintains subject consistency throughout the video using multiple reference images.
Highly customizable with flexible prompts, aspect ratios, and movement controls.
Quick turnaround for high-resolution video generation.
Optional background music for enhanced audience engagement.
Suitable for a wide range of creative and commercial applications.
⚠️ Considerations
Requires at least one reference image and accepts a maximum of seven.
Video duration and complexity may be limited by generation time.
Customization may be constrained by prompt and reference image quality.
Background music options may be limited compared to dedicated audio tools.
📚 How to Use Vidu Q1 Reference to Video
1
Prepare 1 to 7 high-quality reference images of your subject and upload them.
2
Enter a detailed text prompt (up to 1,500 characters) describing the desired video scene.
3
Select your preferred aspect ratio: 16:9 (landscape), 9:16 (portrait), or 1:1 (square).
4
Choose the movement amplitude (auto, small, medium, large) to control animation style.
5
Optionally, enable background music to add audio to your video.
6
Submit your request and download the generated video clip once processing is complete.
💡 Pro Tips for Vidu Q1 Reference to Video
Upload Multiple Angles for Best Consistency Vidu Q1 supports up to 7 reference images, and using at least 3-5 photos from different angles dramatically improves character fidelity. Include front-facing, side profile, and three-quarter views with consistent lighting. This helps the model understand facial structure and body proportions, reducing morphing artifacts. If you only have one photo, consider Vidu Reference to Video for simpler workflows.
Balance Prompt Detail with Movement Amplitude When writing your prompt, match the level of action described to your chosen movement amplitude setting. For subtle scenes like a character reading or standing still, select 'small' amplitude. For dynamic actions like running or dancing, use 'medium' or 'large'. The 'auto' setting works well for general use, but manual control prevents the AI from adding unwanted motion or restricting intentional action sequences.
Test Aspect Ratios for Platform Optimization Vidu Q1 offers 16:9, 9:16, and 1:1 aspect ratios. Choose 9:16 for Instagram Reels, TikTok, and YouTube Shorts; 16:9 for YouTube standard videos and presentations; and 1:1 for Instagram feed posts. If you need faster generation for vertical content, compare with Seedance 2.0 Fast Reference to Video, which prioritizes speed over extended duration.
Use Seed Values for Iterative Refinement When experimenting with prompts or movement settings, save the seed value from a generation you like. Reusing the same seed with slight prompt adjustments lets you iterate on a concept while maintaining the core visual style and motion patterns. This is especially useful for client revisions or A/B testing different narrative angles without starting from scratch each time.
Leverage Background Music for Social Engagement Enabling the BGM option adds a professionally mixed audio layer that significantly boosts viewer retention on social platforms. The background music is algorithmically matched to the pacing and mood of your video. For projects requiring custom audio, generate the video without BGM and add your own soundtrack in post-production. Compare with Kling O1 Reference to Video if you need more advanced audio control.
Match Reference Image Quality to Output Expectations Vidu Q1 generates 1080p video, so reference images should be at least 1080px on the shortest side for optimal detail retention. Avoid low-resolution, heavily compressed, or blurry photos. Consistent lighting across all reference images is critical—mixing indoor and outdoor shots with different color temperatures can confuse the model and reduce character consistency throughout the final video.
Frequently Asked Questions
You can upload between one and seven reference images. Using multiple images helps the model maintain subject consistency from different angles or poses.
Yes, you can select from three aspect ratios—16:9 (landscape), 9:16 (portrait), and 1:1 (square)—to match your intended use or platform requirements.
Absolutely! Vidu Q1 Reference to Video allows you to add optional background music, making your videos more engaging and professional.
Most videos are generated within approximately 80 to 120 seconds, depending on the complexity of your prompt and reference images.
Pricing varies by model and is based on a pay-as-you-go credit system, allowing you to pay only for the resources you use.
Vidu Q1 operates on JAI Portal's pay-as-you-go credit system, with pricing determined by video resolution, duration, and the number of reference images processed. Typically, generating a 1080p video with 5-7 reference images costs more credits than simpler models like Seedance 2.0 Fast Reference to Video, which prioritizes speed over multi-image consistency. However, Vidu Q1's superior character fidelity often justifies the cost for professional projects requiring brand consistency or complex character work. You only pay for successful generations, and you can preview credit costs before submitting. Check the model's pricing page or compare side-by-side with Wan v2.6 Reference-to-Video to find the best value for your workflow.
Yes, all videos generated with paid credits on JAI Portal come with full commercial-use rights, including Vidu Q1 Reference to Video outputs. You can use the videos in advertising campaigns, social media content, client deliverables, product demos, and monetized YouTube channels without additional licensing fees. This applies whether you're a freelancer, agency, or in-house creator. The commercial license covers the AI-generated content, but you remain responsible for ensuring your reference images and prompts don't infringe on third-party copyrights or trademarks. For example, avoid uploading celebrity photos or copyrighted characters unless you have explicit permission. Free trial generations may have usage restrictions, so always use paid credits for commercial work.
Vidu Q1 Reference to Video generates videos at 1080p resolution (1920×1080 for 16:9, 1080×1920 for 9:16, and 1080×1080 for 1:1) in MP4 format with H.264 encoding, which is compatible with all major editing software and social platforms. The typical output duration is 4-8 seconds depending on prompt complexity and movement amplitude settings. MP4 is the standard delivery format, optimized for fast streaming and broad compatibility. If you need higher resolutions or alternative codecs, you can upscale or transcode the MP4 using external tools. For projects requiring 4K output, consider models like Google Veo 3.1 Reference-to-Video, though generation times and credit costs will increase accordingly.
Currently, Vidu Q1 Reference to Video processes one video generation per request through the JAI Portal web interface. However, you can manually queue multiple generations by submitting separate requests with the same reference images and different prompts or settings. For users needing high-volume or automated workflows, JAI Portal offers API access that allows programmatic batch submissions, enabling you to generate dozens of variations efficiently. The API supports the same parameters as the web interface, including reference image uploads, prompts, aspect ratios, and movement amplitude. Contact JAI Portal support to enable API access and review documentation. For faster iteration on similar concepts, save your reference images locally and reuse them across sessions, adjusting only the prompt or seed value each time.
Inconsistent character appearance usually stems from low-quality or mismatched reference images. First, ensure all reference photos show the same person or subject under similar lighting conditions—avoid mixing indoor/outdoor shots or photos taken years apart. Upload at least 3-5 images from multiple angles (front, side, three-quarter) at 1080px minimum resolution. Second, review your text prompt for conflicting descriptions; avoid phrases like 'changing appearance' or 'transforming into' unless intentional. Third, try reducing movement amplitude to 'small' or 'medium' to minimize frame-to-frame variation. If issues persist, test with Kling O1 Reference to Video or Seedance 2.0 Reference to Video, which may handle certain face types or lighting conditions differently. Finally, use the seed parameter to lock in successful generations and iterate from there.
⚖️ How Vidu Q1 Reference to Video Compares
Vidu Q1 Reference to Video stands out among JAI Portal's reference-to-video models for its ability to process up to 7 reference images simultaneously, making it ideal for projects requiring exceptional character consistency across complex scenes. Compared to Vidu Reference to Video, the Q1 version offers enhanced multi-image processing and more granular movement amplitude controls, though both share similar generation speeds of 80-120 seconds. For users prioritizing speed over multi-angle consistency, Seedance 2.0 Fast Reference to Video delivers faster turnaround but accepts fewer reference images. If you need longer video durations or higher resolution outputs, Google Veo 3.1 Reference-to-Video supports 4K generation, though at higher credit costs and slower processing. Vidu Q1 excels in the sweet spot of quality, consistency, and reasonable generation time, making it the go-to choice for social media creators, marketers, and animators who need reliable character fidelity without sacrificing creative flexibility. The optional background music feature also sets it apart for quick, polished social content. Choose Vidu Q1 when your project demands recognizable, consistent subjects across multiple frames and you're willing to invest slightly more credits for superior output. Explore side-by-side comparisons on JAI Portal's model comparison tool or start experimenting with a free trial at jaiportal.com/auth/signup.

More Video Generation Models