Use our own optimized whisper model to transcribe audio into word-level text dataPrice for voice separation only: 0.001PTC/min Price for separation + transcription + alignment: 0.003PTC/min
Request
Header Params
Authorization
string
optional
Example:
Bearer {{YOUR_API_KEY}}
Body Params application/json
audio_url
string
音频链接
required
Audio Link
language
string
音频的语言
required
Audio language eg: zh, en
demucs
boolean
是否开启人声分 离
optional
Whether to enable vocal separation
Default:
true
is_only_demucs
boolean
是否只分离人声不执行转录
optional
Whether to separate vocals without performing transcription