InfiniteTalk is an audio-driven conversational AI video generation model. With a single image and audio input, you can create dialogue or singing videos. Our pricing is 0.15per5secondsofvideogeneration(480p)or0.3 (720p), and we support video generation up to 10 minutes in length.
Request
Header Params
Body Params application/json
Example
{"image":"https://d1q70pf5vjeyhc.cloudfront.net/media/92ecf66930134a49a5a425b9def0c266/images/1759599101378698906_HKRNJFBy.jpeg","prompt":"Bright evenly lit laboratory room with metallic walls and soft white light reflections. \nA human man in a suit stands face-to-face with a humanoid robot, both in perfect focus. \nCamera: static medium close-up, centered framing, high exposure with clear details on both faces. \nMood: tense, thoughtful, futuristic. \n\n<S>We built you to understand us.<E> \n\nA Sign\n\n<S>But sometimes I wonder if you understand us too well.<E> \n\nThe robot tilts its head slightly, eyes glowing faint blue, voice calm and precise. \n\n<S>Understanding is not the same as becoming.<E> \n\n<AUDCAP>Soft ambient hum of electronics, faint mechanical servo sounds, two clear voices — human and synthetic, calm and steady<ENDAUDCAP>\n","seed":-1}
Request Code Samples
Shell
JavaScript
Java
Swift
Go
PHP
Python
HTTP
C
C#
Objective-C
Ruby
OCaml
Dart
R
Request Request Example
Shell
JavaScript
Java
Swift
curl--location--request POST 'https://api.302.ai/ws/api/v3/character-ai/ovi/image-to-video' \
--header'Authorization: Bearer ' \
--header'Content-Type: application/json' \
--data-raw'{
"image": "https://d1q70pf5vjeyhc.cloudfront.net/media/92ecf66930134a49a5a425b9def0c266/images/1759599101378698906_HKRNJFBy.jpeg",
"prompt": "Bright evenly lit laboratory room with metallic walls and soft white light reflections. \nA human man in a suit stands face-to-face with a humanoid robot, both in perfect focus. \nCamera: static medium close-up, centered framing, high exposure with clear details on both faces. \nMood: tense, thoughtful, futuristic. \n\n<S>We built you to understand us.<E> \n\nA Sign\n\n<S>But sometimes I wonder if you understand us too well.<E> \n\nThe robot tilts its head slightly, eyes glowing faint blue, voice calm and precise. \n\n<S>Understanding is not the same as becoming.<E> \n\n<AUDCAP>Soft ambient hum of electronics, faint mechanical servo sounds, two clear voices — human and synthetic, calm and steady<ENDAUDCAP>\n",
"seed": -1
}'