vmodel/talking-photo-turbo
API to convert photos into realistic talking avatars in seconds.
Output: $0.016 / second or 62 seconds / $1
Input
avatar * image
Image url address
input image
speech * audio
Audio url address
Audio File
disable_safety_checker boolean
Note: The website version of this model always runs with safety checks enabled. For details,see VModel's platform safety guidelines..
Disable safety checker for generated images
Reset
Output
{
  "task_id": "d9oo2z1s89lobg8oz5",
  "user_id": 1,
  "version": "11fee5368eda61d569f53f1b24ce1c53b06c867157cd833e9a0a97b66096f974",
  "error": null,
  "total_time": 37,
  "predict_time": 37,
  "logs": null,
  "output": [
    "https://vmodel.ai/data/model/vmodel/talking-photo-turbo/result.mp4"
  ],
  "status": "succeeded",
  "create_at": 1746492954,
  "completed_at": 1746493015,
  "input": {
    "avatar": "https://vmodel.ai/data/model/vmodel/talking-photo-turbo/demo.png",
    "speech": "https://vmodel.ai/data/model/vmodel/talking-photo-turbo/examples_wav_talk_male_law_10s.wav",
    "disable_safety_checker": false
  }
}
Generated in: 37 seconds
Download
Examples
Pricing
Model pricing for vmodel/talking-photo-turbo. Looking for volume pricing? Get in touch.
When
āš™ using this model
$0.016
per second of input audio
or 62 seconds for $1
Readme

Loading...