Overview
Create a custom voice clone from a reference audio file and use it to generate speech. The cloned voice captures the unique characteristics of the original speaker.Requirements
- Reference audio file (10-30 seconds of clear speech)
- Supported formats: WAV, MP3, FLAC, OGG
- Clean audio with minimal background noise works best
Prerequisites
Create an account
Sign up at CAMB.AI Studio if you haven’t already.
Get your API key
Go to Settings → API Keys in Studio and copy your key. See Authentication for details.
Install the SDK
Code
Parameters
| Parameter | Description | Values |
|---|---|---|
gender | Voice gender | 1 = male, 2 = female |
language | Voice language | Use Languages enum (e.g., Languages.EN_US, Languages.ES_ES, Languages.FR_FR) |
Tips
- Use high-quality reference audio for best results
- 15-20 seconds of speech is ideal
- Avoid background music or noise in reference audio
- The cloned voice is saved to your account for future use
Next Steps
Emotional Voice Control
Add emotional expression to your cloned voices with mars-instruct.
Text to Speech
Generate speech with any voice using the SDK.
TTS with Accents
Speak in 140+ language accents with the same voice.
API Reference
Full voice cloning API specification.
Text to Sound Effects
Generate sound effects and music from text.