- Large Language Model
- API Migration Guide
- Exclusive Feature
- Model Support
- OpenAI
- Chat(Talk)
- Responses(Talk)
- Chat(Streamed return.)
- Chat (gpt-4o Image Analysis)
- Chat (gpt-4o Structured Output)
- Chat (gpt-4o function call)
- Chat (gpt-4-plus image analysis)
- Chat (gpt-4-plus image generation)
- Chat(gpt-4o-image-generation modify image)
- Chat (gpts model)
- Chat (chatgpt-4o-latest)
- Chat (o1 Series Model)
- Chat (o3 Series Model)
- Chat(o4 Series)
- Chat(gpt-4o audio model)
- Anthropic
- Gemini
- China Model
- Chat (Baidu ERNIE)
- Chat (Tongyi Qianwen)
- Chat (Tongyi Qianwen-VL)
- Chat(Tongyi Qianwen-OCR)
- Chat (Zhipu GLM-4)
- Chat (Zhipu GLM-4V)
- Chat (Baichuan AI)
- Chat (Moonshot AI)
- Chat (Moonshot AI-Vision)
- Chat (01.AI)
- Chat (01.AI-VL)
- Chat (DeepSeek)
- Chat (DeepSeek-VL2)
- Chat (ByteDance Doubao)
- Chat (ByteDance Doubao-Vision)
- Chat(ByteDance Doubao Image Generation)
- Chat (Stepfun)
- Chat (Stepfun Multimodal)
- Chat (iFLYTEK Spark)
- Chat (SenseTime)
- Chat(Minimax)
- Chat (Tencent Hunyuan)
- SiliconFlow
- PPIO
- Open Source Model
- Expert Model
- Other Models
- Image Generation
- Unified interface
- GPT-Image-1
- DALL.E
- Stability.ai
- Text-to-image (Image Generation-V1)
- Generate (Image Generation-SD2)
- Generate (Image Generation-SD3-Ultra)
- Generate (Image Generation-SD3)
- Generate(Image Generation-SD3.5-Large)
- Generate(Image Generation-SD3.5-Medium)
- Generate(Image to Image-SD3)
- Generate(Image to Image-SD3.5-Large)
- Generate(Image to Image-SD3.5-Medium)
- Midjourney
- Midjourney-Relax
- 302.AI
- Glif
- Flux
- Ideogram
- Recraft
- Luma
- Doubao
- Minimax
- ZHIPU
- Baidu
- Hidream
- Bagel
- SiliconFlow
- Image Processing
- 302.AI
- Upscale
- Upscale-V2
- Upscale-V3
- Upscale-V4
- Super-Upscale
- Super-Upscale-V2
- Face-upscale
- Colorize
- Colorize-V2
- Removebg
- Removebg-V2
- Removebg-V3
- Inpaint
- Erase
- Face-to-many
- Llava
- Relight
- Relight-background
- Relight-V2
- Face-swap-V2
- Fetch
- HtmltoPng
- SvgToPng
- image-translate
- image-translate-query
- image-translate-redo
- Flux-selfie
- Trellis(Image to 3D model)
- Pose-Transfer(Human Pose Transformation)
- Pose-Transfer(Human Pose Transformation Result)
- Virtual-Tryon
- Virtual-Tryon(Fetch Result)
- Denoise(AI Denoising)
- Deblur(AI Deblurring)
- 302.AI-ComfyUI
- Create Outfit Change Task
- Create Outfit Change Task (Upload Mask)
- Query Outfit Change Task Status
- Create Face Swap Task
- Query Face Swap Task Status
- Create a Task to Replace Any Item
- Create Object Replacement Task (Upload Mask)
- Check the Status of Any Object Replacement Task
- Create a Task to Transform Cartoon Characters into Real People
- Query the status of the task to turn a manga character into a real person
- Create Style Transfer Task
- Query the status of the style transfer task
- Create Image Removal Task
- Query Image Removal Task Status
- Create Video Face Swap Task
- Query Video Face Swap Task Status
- Vectorizer
- Stability.ai
- Glif
- Clipdrop
- Recraft
- BRIA
- Flux
- 官方API
- Flux-V1.1-Ultra-Redux(Image-to-image generation-Ultra)
- Flux-V1.1-Pro-Redux(Image-to-image generation-Pro)
- Flux-Dev-Redux(Image-to-image generation-Dev)
- Flux-Schnell-Redux(Image-to-image generation-Schnell)
- Flux-V1-Pro-Canny(Object consistency)
- Flux-V1-Pro-Depth(Depth consistency)
- Flux-V1-Pro-Fill(Partial repainting)
- Flux-Kontext-Pro(Image Edit)
- Flux-Kontext-Max(Image Edit)
- Hyper3D
- Tripo3D
- FASHN
- Ideogram
- Doubao
- Kling
- StepFun
- Bagel
- 302.AI
- Video Generation
- Unified Interface
- 302.AI
- Stable Diffusion
- Luma AI
- Runway
- Kling
- 302 format
- Txt2Video(Text to Video 1.0 Rapid-5s)
- Txt2Video_HQ(Text to Video 1.5 HQ-5s)
- Txt2Video_HQ(Text to Video 1.5 HQ-10s)
- Image2Video(Image to Video 1.0 Rapid-5s)
- Image2Video(Image to Video 1.0 Rapid-10s)
- Image2Video(Image to Video 1.5 Rapid-5s)
- Image2Video(Image to Video 1.5 Rapid-10s)
- Image2Video_HQ(Image to Video 1.5 HQ-5s)
- Image2Video_HQ(Image to Video 1.5 HQ-10s)
- Txt2Video(Text to Video 1.6 Standard-5s)
- Txt2Video(Text to Video 1.6 Standard-10s)
- Txt2Video(Text to Video 1.6 HQ-5s)
- Txt2Video(Text to Video 1.6 HQ-10s)
- Image2Video(Image to Video 1.6 Standard-5s)
- Image2Video(Image to Video 1.6 Standard-10s)
- Image2Video(Image to Video 1.6 HQ-5s)
- Image2Video(Image to Video 1.6 HQ-10s)
- Txt2Video(Text-to-Video 2.0 – HD – 5s)
- Image2Video(Image-to-Video 2.0 – HD – 5s)
- Image2Video(Image-to-Video 2.0 – HD – 10s)
- Image2Video (Multiple pictures for reference)
- Image2Video(Multiple pictures for reference)
- Extend_Video
- Image2Video(Image video 2.1-5 seconds)
- Image2Video(Image video 2.1-10 seconds)
- Image2Video(Image Video 2.1-HD-10 seconds)
- Image2Video(Image Video 2.1-HD-5 seconds)
- Fetch
- Official format
- 302 format
- CogVideoX
- Minimax
- Pika
- 1.5 pikaffects(Image-to-Video Generation)
- Turbo Generate(Text-to-Video Generation)
- Turbo Generate(Text-to-Video Generation)
- 2.1 Generate(Text-to-Video Generation)
- 2.1 Generate(Image-to-Video Generation)
- 2.2 Generate(Text-to-Video Generation)
- 2.2 Generate(Image-to-Video Generation)
- 2.2 Pikascenes(Generate scene videos)
- Fetch(Result)
- PixVerse
- Genmo
- Hedra
- Haiper
- Sync.
- Lightricks
- Hunyuan
- Vidu
- Vidu(Text-to-Video)
- Vidu(Image to Video)
- Vidu(Generate video from the first and last frames)
- Vidu(Reference-based video generation)
- Vidu(Generate scene video)
- Vidu(Smart Ultra HD)
- Fetch(Retrieve Task Results)
- Vidu V2(Text-to-Video Generation)
- Vidu V2(Image-to-Video)
- Vidu V2(Start-and-End Frame Video Generation)
- Vidu V2(Subject-Driven Video Generation)
- Vidu(Scene Video Generation V2)
- Vidu V2(AI Ultra HD – Premium)
- Fetch V2(Retrieve Task Result)
- Tongyi Wanxiang
- Jimeng
- SiliconFlow
- Kunlun Tech
- Higgsfield
- Chanjing
- Audio/Video Processing
- Unified interface
- 302.AI
- Stable-Audio(instrumental generation)
- Transcript (Audio/Video to Text)
- Transcriptions(Speech to Text)
- Alignments(Subtitle Timing)
- WhisperX
- F5-TTS(Text to Speech)
- F5-TTS (Asynchronous Text-to-Speech)
- F5-TTS (Asynchronously Retrieve Results)
- mmaudio(Text-to-Speech)
- mmaudio(AI Video Voiceover)
- mmaudio (Asynchronous Result Retrieval)
- Diffrhythm(Song Generation)
- OpenAI
- Azure
- Suno
- Doubao
- Fish Audio
- Minimax
- Dubbingx
- Udio
- Elevenlabs
- Mureka
- SiliconFlow
- Google
- Chanjing
- Information Processing
- Unified Search API
- 302.AI
- Admin Dashboard
- Information search
- Xiaohongshu_Search
- Xiaohongshu_Note
- Get_Home_Recommend
- Tiktok_Search
- Douyin_Search
- Twitter_Search
- Twitter_Post(X_Post)
- Twitter_User(X_User)
- Weibo_Post
- Search_Video
- Youtube_Info
- Youtube_Subtitles(Youtube Obtain Subtitles)
- Bilibili_Info(Bilibili Obtain Video Information)
- MP_Article_List(Get the list of WeChat official account articles)
- MP_Article(Retrieve WeChat Official Account articles)
- File processing
- Code execution
- Remote Browser
- Tavily
- SearchAPI
- Search1API
- Exa
- Bocha AI
- Doc2x
- Glif
- Jina
- DeepL
- RSSHub
- Firefly card
- Youdao
- Mistral
- Firecrawl
- RAG-related
- Tools API
- AI Video Creation Hub
- AI Paper Writing
- AI Podcast Production
- AI Writing Assistant
- AI Video Real-Time Translation
- AI Document Editor
- Web Data Extraction Tool
- AI Prompt Expert
- AI 3D Modeling
- AI Search Master 3.0
- AI Vector Graphics Generation
- Al Answer Machine
- AI PPT Generator
- Generate PPT interface with one click
- File parsing
- Generate an outline
- Generate outline content
- Get template options
- Generate PPT interface (synchronous interface)
- Load PPT data
- Generate PPT interface (asynchronous interface)
- Asynchronous query generates PPT status
- Download PPT
- Add/update custom PPT templates
- Pagination query PPT template
- AI Academic Paper Search
- One-Click Website Deployment
- AI Avatar Maker
- AI Card Generation
- AI Image Creative Station API
- Help Center
Retrieve speech synthesis results
POST
/chanjing/open/v1/audio_task_state
Primary Field | Secondary Field | Tertiary Field | Description |
---|---|---|---|
code | Response status code | ||
msg | Response message | ||
data | id | Video ID | |
type | Voice type | ||
status | Status: 10 Generating, 9 Generated (includes success and failure) | ||
text | Voice text | ||
full | url | Standard frequency link | |
path | Standard frequency address | ||
duration | Standard frequency duration | ||
slice | Slice | ||
errMsg | Error message | ||
errReason | Error reason | ||
subtitles (array type) | key | Subtitle key value | |
start_time | Subtitle start time | ||
end_time | Subtitle end time | ||
subtitle | Subtitle text |
Request
Header Params
Authorization
string
optional
Example:
Bearer {{YOUR_API_KEY}}
Body Params application/json
task_id
string
required
Example
{
"task_id": "e2afea2439264340a9c7e9cbd091459e"
}
Request samples
Shell
JavaScript
Java
Swift
Go
PHP
Python
HTTP
C
C#
Objective-C
Ruby
OCaml
Dart
R
Request Request Example
Shell
JavaScript
Java
Swift
curl --location --request POST 'https://api.302.ai/chanjing/open/v1/audio_task_state' \
--header 'Authorization: Bearer sk-jls4AaVBGoe1GwZD64qZA1qyKTN1MPHa4NmvH1cT68z7K1Zz' \
--header 'Content-Type: application/json' \
--data-raw '{
"task_id": "e2afea2439264340a9c7e9cbd091459e"
}'
Responses
🟢200成功
application/json
Body
code
integer
required
data
object
required
errMsg
string
required
errReason
string
required
focus_titles
null
required
full
object
required
id
string
required
slice
null
required
status
integer
required
subtitles
array [object {5}]
required
text
array[string]
required
type
string
required
msg
string
required
trace_id
string
required
Example
{
"code": 0,
"data": {
"errMsg": "",
"errReason": "",
"focus_titles": null,
"full": {
"duration": 25.76,
"path": "fe9d6f5c1ce6713adaaab4aca3b1e13a.wav",
"url": "https://file.302ai.cn/gpt/imgs/fe9d6f5c1ce6713adaaab4aca3b1e13a.wav"
},
"id": "858e67f4862a414ba18dbfe464e6062f",
"slice": null,
"status": 9,
"subtitles": [
{
"end_time": 0.44,
"key": "2ddf93da3a83ae96b0ffa988846307fbe37852f5",
"keywords": null,
"start_time": 0,
"subtitle": "大家好"
},
{
"end_time": 2,
"key": "fc630b799ef142922ff02bb66c74f54afc994987",
"keywords": null,
"start_time": 0.44,
"subtitle": "欢迎收听本期内容"
},
{
"end_time": 4.68,
"key": "c1d6a7a8156401a6529f0632e3c49568f8b2dcdf",
"keywords": null,
"start_time": 2,
"subtitle": "今天我们将为大家介绍一项有趣的"
},
{
"end_time": 7.12,
"key": "8546dfae93693f38e2480c7f3aa841f3e0c34a94",
"keywords": null,
"start_time": 4.68,
"subtitle": "科学现象——猫在跌落时能够"
},
{
"end_time": 8.56,
"key": "6cbc62c732a017b5fe47553696e6a092a55b16ef",
"keywords": null,
"start_time": 7.12,
"subtitle": "在空中调整身体"
},
{
"end_time": 10.44,
"key": "7730609b73cc7562f365941815ce8d8a398a65d1",
"keywords": null,
"start_time": 8.56,
"subtitle": "通常能够四脚着地"
},
{
"end_time": 13.24,
"key": "122f34239f25bddd993d4d27b32de18bd024521b",
"keywords": null,
"start_time": 10.44,
"subtitle": "这种“猫右自己”反射显示了它们"
},
{
"end_time": 15.88,
"key": "b1a4ce8cc4fd6cfeb65f8557263f133f995481bb",
"keywords": null,
"start_time": 13.24,
"subtitle": "惊人的身体协调能力和灵活性"
},
{
"end_time": 19.08,
"key": "2f4f00f17c195755c7c222d5d0797ae7a667da82",
"keywords": null,
"start_time": 15.88,
"subtitle": "核磁共振成像技术通过利用人体"
},
{
"end_time": 21.52,
"key": "8a27008620c1c4577224e3cda68dc549098eddb2",
"keywords": null,
"start_time": 19.08,
"subtitle": "细胞中氢原子的磁性来生成"
},
{
"end_time": 22.84,
"key": "2f8369adea2699eb1227eb32fe16f2bac0563cb8",
"keywords": null,
"start_time": 21.52,
"subtitle": "详细的内部图像"
},
{
"end_time": 25.4,
"key": "6c6dfd1d8601cfa5bc793ea27d56ccde25c47ad9",
"keywords": null,
"start_time": 22.84,
"subtitle": "为医学诊断提供了重要工具"
}
],
"text": [
"大家好,欢迎收听本期内容。今天我们将为大家介绍一项有趣的科学现象——猫在跌落时能够在空中调整身体,通常能够四脚着地,这种“猫右自己”反射显示了它们惊人的身体协调能力和灵活性。核磁共振成像技术通过利用人体细胞中氢原子的磁性来生成详细的内部图像,为医学诊断提供了重要工具。"
],
"type": "tts"
},
"msg": "success",
"trace_id": "44bcf49a4620858ef59691221a5ba51d"
}
Modified at 2025-06-23 07:22:25