Create a Custom Voice
Creates a new voice clone using by uploading an audio file reference.
This endpoint enables users to generate a custom cloned voice based on a provided sample, which is processed to replicate the unique characteristics of the reference voice. The resulting cloned voice can be used for various tasks such as text-to-speech, dubbing, and more.
Supported Files
The file
property accepts the following file formats:
- AAC
- FLAC
- MP3
- WAV
Python Example
Authorizations
The x-api-key
is a custom header required for authenticating requests to our API. Include this header in your request with the appropriate API key value to securely access our endpoints. You can find your API key(s) in the 'API' section of our studio website.
Body
The name or label to be assigned to the voice.
Represents the gender of the speaker in the provided audio. Values are encoded as integers.
0
, 1
, 2
, 9
The reference audio file that will be used to create the custom voice. The file should have clear speech to ensure optimal cloning accuracy. Supported formats include .aac
, .flac
, .mp3
and .wav
.
The estimated or actual age of the speaker in the reference audio.
x > 1
If set to true
, the system will apply audio enhancement techniques such as noise reduction and volume normalization to improve voice clarity.