- Large Language Model
- API Migration Guide
- Exclusive Feature
- Model Support
- OpenAI
- Anthropic
- Gemini
- China Model
- Chat (Baidu ERNIE)
- Chat (Tongyi Qianwen)
- Chat (Tongyi Qianwen-VL)
- Chat(Tongyi Qianwen-OCR)
- Chat (Zhipu GLM-4)
- Chat (Zhipu GLM-4V)
- Chat (Baichuan AI)
- Chat (Moonshot AI)
- Chat (Moonshot AI-Vision)
- Chat (01.AI)
- Chat (01.AI-VL)
- Chat (DeepSeek)
- Chat (DeepSeek-VL2)
- Chat (ByteDance Doubao)
- Chat (ByteDance Doubao-Vision)
- Chat(ByteDance Doubao Image Generation)
- Chat (Stepfun)
- Chat (Stepfun Multimodal)
- Chat (iFLYTEK Spark)
- Chat (SenseTime)
- Chat(Minimax)
- Chat (Tencent Hunyuan)
- SiliconFlow
- Open Source Model
- Large Language Model
- Chat(LLaMA3.3)
- Chat(LLaMA3.2 multimodal)
- Chat(LLaMA3.1)
- Chat(Mixtral-8x7B)
- Chat(Mistral-Large-2411)
- Chat(Mistral-small-2503)
- Chat(Pixtral-Large-2411multimodal)
- Chat(Gemma-7B、Gemma-3-27b-it)
- Chat(Gemma2-9B)
- Chat(Command R+)
- Chat(Qwen2)
- Chat(Qwen2.5)
- Chat(Qwen2.5-VL)
- Chat(Llama-3.1-nemotron)
- Chat(QwQ-32B、QwQ-Plus、QwQ-32B-Preview)
- Chat(Mistral-Large-2411) Copy
- Expert Model
- Other Models
- Image Generation
- DALL.E
- Stability.ai
- Text-to-image (Image Generation-V1)
- Generate (Image Generation-SD2)
- Generate (Image Generation-SD3-Ultra)
- Generate (Image Generation-SD3)
- Generate(Image Generation-SD3.5-Large)
- Generate(Image Generation-SD3.5-Medium)
- Generate(Image to Image-SD3)
- Generate(Image to Image-SD3.5-Large)
- Generate(Image to Image-SD3.5-Medium)
- Midjourney
- Midjourney-Relax
- 302.AI
- Glif
- Flux
- Ideogram
- Recraft
- Luma
- Doubao
- Minimax
- ZHIPU
- Baidu
- Image Processing
- 302.AI
- Upscale
- Upscale-V2
- Upscale-V3
- Upscale-V4
- Super-Upscale
- Super-Upscale-V2
- Face-upscale
- Colorize
- Colorize-V2
- Removebg
- Removebg-V2
- Inpaint
- Erase
- Face-to-many
- Llava
- Relight
- Relight-background
- Relight-V2
- Face-swap-V2
- Fetch
- HtmltoPng
- SvgToPng
- image-translate
- image-translate-query
- image-translate-redo
- Flux-selfie
- Trellis(Image to 3D model)
- Pose-Transfer(Human Pose Transformation)
- Pose-Transfer(Human Pose Transformation Result)
- Virtual-Tryon
- Virtual-Tryon(Fetch Result)
- Denoise(AI Denoising)
- Deblur(AI Deblurring)
- 302.AI-ComfyUI
- Create Outfit Change Task
- Query Outfit Change Task Status
- Create Face Swap Task
- Query Face Swap Task Status
- Create a Task to Replace Any Item
- Check the Status of Any Object Replacement Task
- Create a Task to Transform Cartoon Characters into Real People
- Query the status of the task to turn a manga character into a real person
- Create Style Transfer Task
- Query the status of the style transfer task
- Vectorizer
- Stability.ai
- Glif
- Clipdrop
- Recraft
- BRIA
- Flux
- Flux-V1.1-Ultra-Redux(Image-to-image generation-Ultra)
- Flux-V1.1-Pro-Redux(Image-to-image generation-Pro)
- Flux-Dev-Redux(Image-to-image generation-Dev)
- Flux-Schnell-Redux(Image-to-image generation-Schnell)
- Flux-V1-Pro-Canny(Object consistency)
- Flux-V1-Pro-Depth(Depth consistency)
- Flux-V1-Pro-Fill(Partial repainting)
- Hyper3D
- Tripo3D
- FASHN
- Ideogram
- Doubao
- Kling
- 302.AI
- Video Generation
- Unified Interface
- 302.AI
- Stable Diffusion
- Luma AI
- Runway
- Kling
- Txt2Video(Text to Video 1.0 Rapid-5s)
- Txt2Video_HQ(Text to Video 1.5 HQ-5s)
- Txt2Video_HQ(Text to Video 1.5 HQ-10s)
- Image2Video(Image to Video 1.0 Rapid-5s)
- Image2Video(Image to Video 1.0 Rapid-10s)
- Image2Video(Image to Video 1.5 Rapid-5s)
- Image2Video(Image to Video 1.5 Rapid-10s)
- Image2Video_HQ(Image to Video 1.5 HQ-5s)
- Image2Video_HQ(Image to Video 1.5 HQ-10s)
- Extend_Video
- Txt2Video(Text to Video 1.6 Standard-5s)
- Txt2Video(Text to Video 1.6 Standard-10s)
- Txt2Video(Text to Video 1.6 HQ-5s)
- Txt2Video(Text to Video 1.6 HQ-10s)
- Image2Video(Image to Video 1.6 Standard-5s)
- Image2Video(Image to Video 1.6 Standard-10s)
- Image2Video(Image to Video 1.6 HQ-5s)
- Image2Video(Image to Video 1.6 HQ-10s)
- Image2Video (Multiple pictures for reference)
- Fetch
- CogVideoX
- Minimax
- Pika
- PixVerse
- Genmo
- Hedra
- Haiper
- Sync.
- Lightricks
- Hunyuan
- Vidu
- Tongyi Wanxiang
- Jimeng
- SiliconFlow
- Kunlun Tech
- Audio/Video Processing
- 302.AI
- Stable-Audio(instrumental generation)
- Transcript (Audio/Video to Text)
- Transcriptions(Speech to Text)
- Alignments(Subtitle Timing)
- WhisperX
- F5-TTS(Text to Speech)
- F5-TTS (Asynchronous Text-to-Speech)
- F5-TTS (Asynchronously Retrieve Results)
- mmaudio(Text-to-Speech)
- mmaudio(AI Video Voiceover)
- mmaudio (Asynchronous Result Retrieval)
- Diffrhythm(Song Generation)
- OpenAI
- Azure
- Suno
- Doubao
- Fish Audio
- Minimax
- Dubbingx
- Udio
- Elevenlabs
- Information Processing
- 302.AI
- Admin Dashboard
- Information search
- Xiaohongshu_Search
- Xiaohongshu_Note
- Get_Home_Recommend
- Tiktok_Search
- Douyin_Search
- Twitter_Search
- Twitter_Post(X_Post)
- Twitter_User(X_User)
- Weibo_Post
- Search_Video
- Youtube_Info
- Youtube_Subtitles(Youtube Obtain Subtitles)
- Bilibili_Info(Bilibili Obtain Video Information)
- MP_Article_List(Get the list of WeChat official account articles)
- MP_Article(Retrieve WeChat Official Account articles)
- File processing
- Code execution
- Tavily
- SearchAPI
- Search1API
- Doc2x
- Glif
- Jina
- DeepL
- Exa
- Bocha AI
- RSSHub
- Firefly card
- Youdao
- Mistral
- 302.AI
- RAG-related
- Tools API
- AI Video Creation Hub
- AI Paper Writing
- AI Podcast Production
- AI Writing Assistant
- AI Video Real-Time Translation
- AI Document Editor
- AI Prompt Expert
- Al Answer Machine
- AI 3D Modeling
- AI Search Master 3.0
- AI Vector Graphics Generation
- Web Data Extraction Tool
- AI PPT Generator
- Generate PPT interface with one click
- File parsing
- Generate an outline
- Generate outline content
- Get template options
- Generate PPT interface (synchronous interface)
- Load PPT data
- Generate PPT interface (asynchronous interface)
- Asynchronous query generates PPT status
- Download PPT
- Add/update custom PPT templates
- Pagination query PPT template
- AI Academic Paper Search
- Help Center
T2A (voice generation-synchronization)
POST
/minimaxi/v1/t2a_v2
Official Documentation: https://platform.minimaxi.com/document/VoiceGeneration?key=669f5af198ff2c57eeb9a0f0
Price: 30 PTC/million words
Request
Authorization
Header Params
Authorization
string
optional
Example:
Bearer {{YOUR_API_KEY}}
Content-Type
string
optional
Example:
application/json
Body Params application/json
model
string
required
pronunciation_dict
object
optional
tone
array[string]
required
1.声调用数字代替,一声(阴平)为1,二声(阳平)为2,三声(上声)为3,四声(去声)为4),轻声为5。
text
string
required
voice_setting
object
optional
speed
number
required
vol
number
required
pitch
number
required
voice_id
string
required
青涩青年音色:male-qn-qingse
精英青年音色:male-qn-jingying
霸道青年音色:male-qn-badao
青年大学生音色:male-qn-daxuesheng
少女音色:female-shaonv
御姐音色:female-yujie
成熟女性音色:female-chengshu
甜美女性音色:female-tianmei
男性主持人:presenter_male
女性主持人:presenter_female
男性有声书1:audiobook_male_1
男性有声书2:audiobook_male_2
女性有声书1:audiobook_female_1
女性有声书2:audiobook_female_2
青涩青年音色-beta:male-qn-qingse-jingpin
精英 青年音色-beta:male-qn-jingying-jingpin
霸道青年音色-beta:male-qn-badao-jingpin
青年大学生音色-beta:male-qn-daxuesheng-jingpin
少女音色-beta:female-shaonv-jingpin
御姐音色-beta:female-yujie-jingpin
成熟女性音色-beta:female-chengshu-jingpin
甜美女性音色-beta:female-tianmei-jingpin
聪明男童:clever_boy
可爱男童:cute_boy
萌萌女童:lovely_girl
卡通猪小琪:cartoon_pig
audio_setting
object
optional
sample_rate
number
optional
bitrate
number
optional
format
string
optional
channel
number
required
Example
{
"model": "speech-01-turbo",
"text": "302.AI是一个汇集全球顶级品牌的AI超市,汇集全球各类顶尖AI模型,提供多种AI机器人,各种AI工具的使用和AI API接入。",
"voice_setting": {
"voice_id": "audiobook_male_1",
"speed": 1,
"vol": 1,
"pitch": 1
},
"pronunciation_dict": {
"tone": [
"草地/(cao3)(di1)"
]
},
"audio_setting": {
"audio_sample_rate": 32000,
"bitrate": 128000,
"format": "mp3",
"channel": 2
}
}
Request samples
Shell
JavaScript
Java
Swift
Go
PHP
Python
HTTP
C
C#
Objective-C
Ruby
OCaml
Dart
R
Request Request Example
Shell
JavaScript
Java
Swift
curl --location --request POST 'https://api.302.ai/minimaxi/v1/t2a_v2' \
--header 'Authorization: Bearer sk-jls4AaVBGoe1GwZD64qZA1qyKTN1MPHa4NmvH1cT68z7K1Zz' \
--header 'Content-Type: application/json' \
--data-raw '{
"model": "speech-01-turbo",
"text": "302.AI是一个汇集全球顶级品牌的AI超市,汇集全球各类顶尖AI模型,提供多种AI机器人,各种AI工具的使用和AI API接入。",
"voice_setting": {
"voice_id": "audiobook_male_1",
"speed": 1,
"vol": 1,
"pitch": 1
},
"pronunciation_dict": {
"tone": [
"草地/(cao3)(di1)"
]
},
"audio_setting": {
"audio_sample_rate": 32000,
"bitrate": 128000,
"format": "mp3",
"channel": 2
}
}'
Responses
🟢200OK
application/json
Body
object {0}
Example
{}
Modified at 2025-02-28 08:29:29