T2A (voice generation-synchronization)

POST

/minimaxi/v1/t2a_v2

Text-generated timbre frequencies from Minimax
Official Documentation: https://platform.minimaxi.com/document/VoiceGeneration?key=669f5af198ff2c57eeb9a0f0
Price: 30 PTC/million words

Request

Authorization

Header Params

Authorization

string

optional

Example:

Bearer {{YOUR_API_KEY}}

Content-Type

string

optional

Example:

application/json

Body Params application/json

model

string

required

The model version called. Currently supported: speech-01-turbo, speech-01-240228, speech-01-turbo-240228, speech-01-hd

pronunciation_dict

object

optional

tone

array[string]

required

替换需要特殊标注的文字、符号及对应的注音。功能类型1，替换发音（调整音调/替换其他字符发音）：["燕少飞/(yan4)(shao3)(fei1)","达菲/(da2)(fei1)"，"omg/oh my god"，]。
1.声调用数字代替，一声（阴平）为1，二声（阳平）为2，三声（上声）为3，四声（去声）为4），轻声为5。

text

string

required

Listen to the audio text.

voice_setting

object

optional

The age of the sound. The values can be selected: 1.child, 2.teenager, 3.young, 4.middle-aged, 5.old.

speed

number

required

生成声音的语速，可选，取值越大，语速越快。

vol

number

required

生成声音的音量，可选，取值越大，音量越高。

pitch

number

required

生成声音的语调，可选，0为原音色输出，取值需为整数。

voice_id

string

required

请求的音色编号，支持系统音色(id)以及复刻音色（id）两种类型，其中系统音色（ID）如下：
青涩青年音色：male-qn-qingse
精英青年音色：male-qn-jingying
霸道青年音色：male-qn-badao
青年大学生音色：male-qn-daxuesheng
少女音色：female-shaonv
御姐音色：female-yujie
成熟女性音色：female-chengshu
甜美女性音色：female-tianmei
男性主持人：presenter_male
女性主持人：presenter_female
男性有声书1：audiobook_male_1
男性有声书2：audiobook_male_2
女性有声书1：audiobook_female_1
女性有声书2：audiobook_female_2
青涩青年音色-beta：male-qn-qingse-jingpin
精英青年音色-beta：male-qn-jingying-jingpin
霸道青年音色-beta：male-qn-badao-jingpin
青年大学生音色-beta：male-qn-daxuesheng-jingpin
少女音色-beta：female-shaonv-jingpin
御姐音色-beta：female-yujie-jingpin
成熟女性音色-beta：female-chengshu-jingpin
甜美女性音色-beta：female-tianmei-jingpin
聪明男童：clever_boy
可爱男童：cute_boy
萌萌女童：lovely_girl
卡通猪小琪：cartoon_pig

audio_setting

object

optional

sample_rate

number

optional

生成声音的采样率，可选，默认为32000

bitrate

number

optional

生成声音的比特率。可选，，默认值为128000。该参数仅对mp3格式的音频生效。

format

string

optional

生成的音频格式。默认mp3，范围[mp3,pcm,flac]。

channel

number

required

生成音频的声道数。1：单声道；2：双声道。

Example

{
  "model": "speech-01-turbo",
  "text": "302.AI是一个汇集全球顶级品牌的AI超市，汇集全球各类顶尖AI模型，提供多种AI机器人，各种AI工具的使用和AI API接入。",
  "voice_setting": {
    "voice_id": "audiobook_male_1",
    "speed": 1,
    "vol": 1,
    "pitch": 1
  },
  "pronunciation_dict": {
    "tone": [
      "草地/(cao3)(di1)"
    ]
  },
  "audio_setting": {
    "audio_sample_rate": 32000,
    "bitrate": 128000,
    "format": "mp3",
    "channel": 2
  }
}

Request samples

Shell

JavaScript

Java

Swift

PHP

Python

HTTP

Objective-C

Ruby

OCaml

Dart

curl --location --request POST 'https://api.302.ai/minimaxi/v1/t2a_v2' \
--header 'Authorization: Bearer sk-jls4AaVBGoe1GwZD64qZA1qyKTN1MPHa4NmvH1cT68z7K1Zz' \
--header 'Content-Type: application/json' \
--data-raw '{
  "model": "speech-01-turbo",
  "text": "302.AI是一个汇集全球顶级品牌的AI超市，汇集全球各类顶尖AI模型，提供多种AI机器人，各种AI工具的使用和AI API接入。",
  "voice_setting": {
    "voice_id": "audiobook_male_1",
    "speed": 1,
    "vol": 1,
    "pitch": 1
  },
  "pronunciation_dict": {
    "tone": [
      "草地/(cao3)(di1)"
    ]
  },
  "audio_setting": {
    "audio_sample_rate": 32000,
    "bitrate": 128000,
    "format": "mp3",
    "channel": 2
  }
}'

Responses

🟢200OK

application/json

Body

object {0}

Example

{}

Modified at 2025-02-28 08:29:29

Music Generation API

TTS（Text to Speech）