Audio vocal separation and transcription

POST

/302/vt/subtitle/extract

Use our own optimized whisper model to transcribe audio into word-level text data

Price for voice separation only: 0.001PTC/min
Price for separation + transcription + alignment: 0.003PTC/min

Request

Header Params

Body Params application/json

Example

{
  "audio_url": "https://file.302ai.cn/gpt/imgs/tts_demo_minimax_male-qn-qingse_en.mp3",
  "language": "en",
  "demucs": true
}

Request Code Samples

Shell

JavaScript

Java

Swift

PHP

Python

HTTP

Objective-C

Ruby

OCaml

Dart

curl --location 'https://api.302.ai/302/vt/subtitle/extract' \
--header 'Authorization: Bearer ' \
--header 'Content-Type: application/json' \
--data '{
  "audio_url": "https://file.302ai.cn/gpt/imgs/tts_demo_minimax_male-qn-qingse_en.mp3",
  "language": "en",
  "demucs": true
}'

Responses

🟢200success

application/json

Bodyapplication/json

Example

{
    "task_id": "string"
}

Modified at 2025-01-19 07:34:45

Extract Audio from Video

Subtitle Translation