Sound cloning from Fish Audio: submit audio files for cloning.Requirements for incoming audio: Audio Length: 30-45 seconds of high-quality audio Speaker: Single speaker only. Other details: Consistent volume, pitch, and emotional expression and Short pauses (approximately 0.5 seconds recommended)Ideal state: no background noise professional recording quality no room echoFish Audio:https://docs.fish.audio/api-reference/endpoint/model/create-modelPrice:0 PTC/call
Request
Header Params
Authorization
string
optional
Example:
Bearer {{YOUR_API_KEY}}
Body Params multipart/form-data
visibility
string
optional
Available options: public, unlist, private
Example:
private
type
string
required
Model type, tts is for text to speech
Example:
tts
title
string
required
Model title or name
description
string
optional
Model description
cover_image
file
optional
Model cover image, this is required if the model is public
train_mode
string
required
Available options: fast
Example:
fast
voices
file
required
Upload voices files that will be used to tune the model
texts
string
optional
Texts corresponding to the voices, if unspecified, ASR will be performed on the voices