LIMITED OFFER New Year Sale: 50% OFF Best AI Tools
🎥 Video Generation

LongCat Single Avatar (Image + Audio)

Audio-driven avatar with custom image. Creates super-realistic, lip-synchronized videos with natural dynamics using your own portrait image

Example Output

Inputs

Input Image

Input Image
Image

Input Audio

Output

Generated

Try LongCat Single Avatar (Image + Audio)

Fill in the parameters below and click "Generate" to try this model

Image to animate

Audio file to drive the avatar

Text prompt to guide video generation

Negative prompt to avoid unwanted elements

Video resolution (480p=1 unit/sec, 720p=4 units/sec)

Video segments (1st=~5.8s, additional=5s each)

Number of inference steps

Text guidance scale for classifier-free guidance

Audio guidance scale (higher=exaggerated mouth)

Your inputs will be saved and ready after sign in

More Video Generation Models

Kling 1.6 Pro Elements

Turn up to 4 images into video clips with enhanced quality

MiniMax Hailuo 2.3 Pro Image to Video

Animate images into 1080p HD videos with professional-quality motion.

Pika v2.2 PikaScenes

Combine multiple images into a single 5-second video with creative or precise blending.

ByteDance Seedance v1 Lite Reference-to-Video

Generate videos with consistent characters using 1 to 4 reference images.

Kling AI Avatar v2 Standard

Sync any image with audio to create talking avatar videos with humans, animals, or cartoon characters.

Ovi Image-to-Video

Turn images into talking avatars with natural lip-sync and immersive audio from text prompts.

Vidu Q1 Image to Video

Turn images into 1080p videos with adjustable motion intensity.

Kling Video v2.6 Pro Image to Video

Animate images into cinematic videos with dialogue and sound effects.

VEED Fabric 1.0 Text

VEED Fabric 1.0 Text

Turn text and images into talking avatar videos with auto lip-sync and natural voice generation.