Generate talking-head avatar videos from one image with script or uploaded audio.
Inference
6-12s
Per run
$0.025/s
Output
720p / 1080p
Typical API runs; actual latency and pricing may vary.
Image (required)
Audio (optional)