LongCat Avatar 1.5
vidutils Model

LongCat Avatar 1.5

Audio-driven avatar video from a single photo with sharper lip sync, natural head and body motion, and strong identity preservation (up to 64s).

Overview

LongCat Avatar 1.5 is a vidutils model available on the GenVR platform. Audio-driven avatar video from a single photo with sharper lip sync, natural head and body motion, and strong identity preservation (up to 64s).

Pricing

Billed through GenVR credits

20 credits per 5 seconds at 480p, 40 credits per 5 seconds at 720p (min 5s, max 64s)

Credits40
Approx. INR₹40.00
Approx. USD$0.4200

Properties

Customizable parameters available for this model.

Required

image_urlstring

Source portrait photo of the person to animate (clear face, front-facing works best)

audio_urlstring

Voice or singing track that drives lip sync and performance (trimmed to 64s max)

Optional

prompt
string

Guide expression, pose, style, or motion (e.g. natural speaking with subtle head movement)

resolution
enumDefault: 720p

Output resolution: 480p or 720p

480p720p
seed
integerDefault: -1

Random seed for reproducibility (-1 for random)

Model Info
Categoryvidutils

GenVR Visual App

Experience the power of LongCat Avatar 1.5 through our intuitive visual interface. Experiment with prompts, adjust parameters in real-time, and download your results instantly.

Launch App

Developer API Docs

Integrate this model into your own applications. Access enterprise-grade performance, scalable infrastructure, and detailed documentation for rapid deployment.

Explore API

More in vidutils

Discover other high-performance models in the same category as LongCat Avatar 1.5.