Create Audio Separation
Creates a task to separate an audio file into distinct background and foreground components
Transform your mixed content into clean, isolated components with our powerful audio separation system. This innovative endpoint enables you to extract distinct foreground elements (such as vocals or lead instruments) from background elements (like accompaniment or ambient sounds). Whether you’re producing music, cleaning up recordings, creating remixes, or developing audio-focused applications, this capability allows you to deconstruct complex media files into their fundamental components for enhanced control and creative flexibility.
The Audio Separation Process
When you submit an audio separation request, our system begins a sophisticated workflow:
Task Creation
The system registers your request and establishes a dedicated processing task, returning a unique task identifier (
) that you’ll use to track and retrieve your separated audio components.
Audio Analysis
Our advanced audio processing algorithms analyze the spectral and temporal characteristics of your audio file, identifying patterns that distinguish foreground from background elements.
Component Separation
Using specialized neural networks, the system carefully separates the audio into distinct foreground and background stems while preserving the acoustic quality of each component.
Throughout this process, you can monitor the status of your separation task by polling the /audio-separation/{task_id}
endpoint with the task_id
provided in your initial response.
Creating Your First Audio Separation Request
Let’s examine how to initiate an audio separation task using Python:
Monitoring Your Separation Progress
After submission, your audio separation task enters our processing pipeline. You can monitor the progress by polling the status endpoint:
Supported Audio Formats
Our audio separation system supports a variety of common audio formats to accommodate your workflow needs:
- FLAC (Free Lossless Audio Codec) - Ideal for high-quality lossless audio
- MP3 (MPEG Audio Layer III) - Common compressed format widely supported across platforms
- WAV (Waveform Audio File Format) - Uncompressed audio format with excellent quality
- AAC (Advanced Audio Coding) - High-quality compressed format commonly used in mobile devices
Best Practices for Audio Separation
To achieve the highest quality separation results, consider these professional recommendations:
-
Use High-Quality Sources: Whenever possible, use uncompressed (WAV) or lossless (FLAC) audio for separation.
-
Optimal Length: While the system can handle various durations, files between 10 seconds and 10 minutes typically yield the best results.
-
Clear Recording: Audio with minimal background noise, distortion, or heavy effects will separate more cleanly.
-
Balanced Mix: Ensure that both foreground and background elements are audible in the original mix for best separation.
-
Consider Preprocessing: For challenging audio, consider using noise reduction or equalization before submission to improve separation quality.
Authorizations
The x-api-key
is a custom header required for authenticating requests to our API. Include this header in your request with the appropriate API key value to securely access our endpoints. You can find your API key(s) in the 'API' section of our studio website.
Body
Response
Successful Response
A JSON that contains unique identifier for the task. This is used to query the status of the audio separation task that is running. It is returned when a create request is made to separate audio into background and foreground.