Model(Create Voice)
POST
/fish-audio/modelSound cloning from Fish Audio: submit audio files for cloning.
Requirements for incoming audio:
Audio Length: 30-45 seconds of high-quality audio
Speaker: Single speaker only.
Other details: Consistent volume, pitch, and emotional expression and Short pauses (approximately 0.5 seconds recommended)
Ideal state:
no background noise
professional recording quality
no room echo
Fish Audio:https://docs.fish.audio/api-reference/endpoint/model/create-model
Price:0 PTC/call
请求参数
Available options: public, unlist, private
Model type, tts is for text to speech
Model title or name
Model description
Model cover image, this is required if the model is public
Available options: fast
Upload voices files that will be used to tune the model
Texts corresponding to the voices, if unspecified, ASR will be performed on the voices
Model tags
Enhance audio quality
示例代码
Responses
{}