LongCat Avatar 1.5
Audio-driven avatar video from a single photo with sharper lip sync, natural head and body motion, and strong identity preservation (up to 64s).
Overview
LongCat Avatar 1.5 is a vidutils model available on the GenVR platform. Audio-driven avatar video from a single photo with sharper lip sync, natural head and body motion, and strong identity preservation (up to 64s).
Pricing
Billed through GenVR credits
20 credits per 5 seconds at 480p, 40 credits per 5 seconds at 720p (min 5s, max 64s)
Properties
Customizable parameters available for this model.
Required
Source portrait photo of the person to animate (clear face, front-facing works best)
Voice or singing track that drives lip sync and performance (trimmed to 64s max)
Optional
Guide expression, pose, style, or motion (e.g. natural speaking with subtle head movement)
Output resolution: 480p or 720p
Random seed for reproducibility (-1 for random)
GenVR Visual App
Experience the power of LongCat Avatar 1.5 through our intuitive visual interface. Experiment with prompts, adjust parameters in real-time, and download your results instantly.
Launch AppDeveloper API Docs
Integrate this model into your own applications. Access enterprise-grade performance, scalable infrastructure, and detailed documentation for rapid deployment.
Explore APIMore in vidutils
Discover other high-performance models in the same category as LongCat Avatar 1.5.