Create videos with consistent characters and environments using up to 7 reference images.
"Take @Image1 as start frame. Camera reveals @Element1 standing. Show @Element2 glowing. Keep style of @Image2"
Fill in the parameters below and click "Generate" to try this model
Use @Element1-7 for characters/objects, @Image1-7 for references. Describe camera movements. Max 7 total references
Reference images for style/appearance. Reference as @Image1, @Image2, etc. Max 7 total (elements + images)
Video duration in seconds
Video aspect ratio
Your inputs will be saved and ready after sign in
Generate video with audio from text using LTX-2 19B with custom LoRA support. Advanced text-to-video with style customization through LoRA weights
Create videos using multiple reference images for consistent subject appearance.
Bring your images to life with cinematic motion and animation.
Create videos up to 1080p from text descriptions in Chinese or English.
Create 5-second videos from text in 720p or 1080p with 7 aspect ratio options.
Generate video with audio from text using LTX-2 19B. Advanced text-to-video generation with multi-scale support and audio synthesis
Create premium talking avatar videos with higher quality than Standard.
Generate video with audio from images using LTX-2 19B. Advanced image-to-video generation with multi-scale support and audio synthesis
Apply creative effects to images and generate videos. 40+ effects including Kiss Me AI, Zombie Mode, Dragon Evoker, 3D Figurine, and more
Transforms up to 7 total images and elements into consistent, high-quality video scenes.
Ensures stable character identity, object details, and environmental coherence throughout the video.
Supports detailed prompts including camera movements and stylistic references for precise scene direction.
Accepts both frontal and multiple reference images for each character or object to maintain visual consistency.
Offers flexible video durations (5 or 10 seconds) and aspect ratios (16:9, 9:16, 1:1) for diverse creative needs.
Quick video generation, typically delivering results in 60-120 seconds depending on complexity.
Easy-to-use interface with intuitive reference tagging and element builder options.
Prototyping animated storyboards for film, animation, or advertising projects.
Creating branded marketing videos that maintain strict visual and stylistic consistency.
Generating dynamic social media content tailored to specific visual guidelines.
Visualizing game characters, environments, or assets in motion for concept development.
Producing educational or instructional videos with consistent characters and objects.
Rapidly iterating video concepts for client presentations and review.
Enhancing presentations and digital media with custom, AI-generated video scenes.
Professional designers, marketers, content creators, animators, and educators seeking consistent, high-quality AI-generated videos.
Prepare and upload up to 7 reference images and/or elements, ensuring each element has a clear frontal view.
Use the prompt field to describe your desired scene, referencing your images and elements (e.g., '@Image1', '@Element2') and including camera movement details.
Select your preferred video duration (5 or 10 seconds) and aspect ratio (16:9, 9:16, or 1:1) to match your intended use.
Double-check references and prompt for clarity and accuracy before submitting.
Submit your request and wait for the AI to generate your video, typically within 60-120 seconds.
Download and review the generated video, making adjustments to references or prompts as needed for further iterations.
You can use any clear, high-quality images of characters, objects, or backgrounds as references. For best results, ensure each element has a frontal view and, if possible, additional angles for greater consistency.
You can include up to seven references in total, which can be a mix of elements (characters/objects) and standalone images. This allows for detailed and customized scene creation.
Pricing varies by model and is based on a pay-as-you-go credit system. You only pay for the resources you use, allowing flexibility and scalability for different project sizes.
Kling O1 Reference to Video supports video durations of either 5 or 10 seconds, and you can choose from 16:9 (landscape), 9:16 (portrait), or 1:1 (square) aspect ratios to fit your needs.
Video generation typically takes between 60 to 120 seconds, depending on the complexity of your prompt and the number of references provided. More detailed scenes may require slightly longer processing times.
Hey! Need help? 👋
Click to chat with us