AI Video Real-Time Translation Audio vocal separation and transcription
Run in Apifox
Use our own optimized whisper model to transcribe audio into word-level text data Price for voice separation only: 0.001PTC/min Price for separation + transcription + alignment: 0.003PTC/min
Request Body Params application/json
{
"audio_url" : "https://file.302ai.cn/gpt/imgs/tts_demo_minimax_male-qn-qingse_en.mp3" ,
"language" : "en" ,
"demucs" : true
}
Request Code Samples
curl --location --request POST 'https://api.302.ai/302/vt/subtitle/extract' \
--header 'Authorization: Bearer sk-jls4AaVBGoe1GwZD64qZA1qyKTN1MPHa4NmvH1cT68z7K1Zz' \
--header 'Content-Type: application/json' \
--data-raw '{
"audio_url": "https://file.302ai.cn/gpt/imgs/tts_demo_minimax_male-qn-qingse_en.mp3",
"language": "en",
"demucs": true
}'
Responses application/json Generate Code
Modified at 2025-01-19 07:34:45