Create Audio Separation
Creates a task to separate an audio file into distinct background and foreground components
Transform your mixed content into clean, isolated components with our powerful audio separation system. This innovative endpoint enables you to extract distinct foreground elements (such as vocals or lead instruments) from background elements (like accompaniment or ambient sounds). Whether you’re producing music, cleaning up recordings, creating remixes, or developing audio-focused applications, this capability allows you to deconstruct complex media files into their fundamental components for enhanced control and creative flexibility.Documentation Index
Fetch the complete documentation index at: https://docs.camb.ai/llms.txt
Use this file to discover all available pages before exploring further.
The Audio Separation Process
When you submit an audio separation request, our system begins a sophisticated workflow:Task Creation
task_id) that you’ll use to track and retrieve your separated audio components.Audio Analysis
/audio-separation/{task_id} endpoint with the task_id provided in your initial response.
Creating Your First Audio Separation Request
Let’s examine how to initiate an audio separation task using Python:Monitoring Your Separation Progress
After submission, your audio separation task enters our processing pipeline. You can monitor the progress by polling the status endpoint:Supported Audio Formats
Our audio separation system supports a variety of common audio formats to accommodate your workflow needs:- FLAC (Free Lossless Audio Codec) - Ideal for high-quality lossless audio
- MP3 (MPEG Audio Layer III) - Common compressed format widely supported across platforms
- WAV (Waveform Audio File Format) - Uncompressed audio format with excellent quality
- AAC (Advanced Audio Coding) - High-quality compressed format commonly used in mobile devices
Best Practices for Audio Separation
To achieve the highest quality separation results, consider these professional recommendations:- Use High-Quality Sources: Whenever possible, use uncompressed (WAV) or lossless (FLAC) audio for separation.
- Optimal Length: While the system can handle various durations, files between 10 seconds and 10 minutes typically yield the best results.
- Clear Recording: Audio with minimal background noise, distortion, or heavy effects will separate more cleanly.
- Balanced Mix: Ensure that both foreground and background elements are audible in the original mix for best separation.
- Consider Preprocessing: For challenging audio, consider using noise reduction or equalization before submission to improve separation quality.
Authorizations
The x-api-key is a custom header required for authenticating requests to our API. Include this header in your request with the appropriate API key value to securely access our endpoints. You can find your API key(s) in the 'API' section of our studio website.
Body
Enter a distinctive name for your project that reflects its purpose or content. This name will be displayed in your CAMB.AI workspace dashboard and used to organize related assets, transcriptions, etc.. . Choose something memorable that helps you quickly identify this specific project among your other voice, audio and localization tasks.
3 - 255Provide details about your project's goals and specifications. Include information such as the target languages for translation or dubbing, desired voice characteristics, emotional tones to capture, or specific audio processing requirements, outlining the workflow here can serve as valuable documentation for organizational purposes.
3 - 5000Media file to processed. AAC, FLAC, MP3 and WAV formats are supported.
Response
Successful Response
A JSON that contains unique identifier for the task. This is used to query the status of the audio separation task that is running. It is returned when a create request is made to separate audio into background and foreground.