POST
/
create-custom-voice
curl --request POST \
  --url https://client.camb.ai/apis/create-custom-voice \
  --header 'Content-Type: multipart/form-data' \
  --header 'x-api-key: <api-key>' \
  --form 'voice_name=<string>' \
  --form gender=0 \
  --form age=30 \
  --form enhance_audio=false
{
  "voice_id": 123
}
The audio file should clearly represent the desired voice for optimal cloning accuracy.

This endpoint enables users to generate a custom cloned voice based on a provided sample, which is processed to replicate the unique characteristics of the reference voice. The resulting cloned voice can be used for various tasks such as text-to-speech, dubbing, and more.

Supported Files

The file property accepts the following file formats:

  • AAC
  • FLAC
  • MP3
  • WAV

Python Example

import requests

files = {'file': open('file.mp3', 'rb')}
data = {
    'voice_name': 'Voice Name',
    'gender': 1,
    'age': 30
}
response = requests.post(
    "https://client.camb.ai/apis/create-custom-voice",
    files=files,
    data=data,
    headers={
        "x-api-key": API_KEY
    }
)
print(response.json())

Authorizations

x-api-key
string
header
required

The x-api-key is a custom header required for authenticating requests to our API. Include this header in your request with the appropriate API key value to securely access our endpoints. You can find your API key(s) in the 'API' section of our studio website.

Body

multipart/form-data
voice_name
string
required

The name or label to be assigned to the voice.

gender
enum<integer>
required

Represents the gender of the speaker in the provided audio. Values are encoded as integers.

Available options:
0,
1,
2,
9
file
file
required

The reference audio file that will be used to create the custom voice. The file should have clear speech to ensure optimal cloning accuracy. Supported formats include .aac, .flac, .mp3 and .wav.

age
integer
default:
30

The estimated or actual age of the speaker in the reference audio.

Required range: x > 1
enhance_audio
boolean
default:
false

If set to true, the system will apply audio enhancement techniques such as noise reduction and volume normalization to improve voice clarity.

Response

200
application/json
Successful Response
voice_id
integer