Thin, hollow vocals
Happens most often on dense or over-compressed masters where the AI strips harmonic content along with instruments. Less busy mixes produce fuller-sounding acapellas.
Online acapella extractor
AI-extracted acapellas can sound thin or hollow, especially on dense mixes. Test your track here first so you know if the vocal is usable before exporting or paying for another tool.
Drop a song here — or tap to try it on your track
Free, in your browser. No signup. MP3, WAV, FLAC, M4A, OGG, or video.
Choose a fileThese show up with every tool, not just Unmix. Knowing them helps you pick better source tracks.
Happens most often on dense or over-compressed masters where the AI strips harmonic content along with instruments. Less busy mixes produce fuller-sounding acapellas.
Room ambience and effect tails tend to bleed into the vocal stem, especially on wet mixes. Dry, close-miked vocals separate much cleaner.
One track can sound great while another from the same album sounds terrible. Genre and mix style matter more than the tool you use.
Most producers don't use raw extracted vocals directly. Here is the typical chain.
Run the full track, then listen to verses vs choruses separately. Choruses with backing vocals are usually harder than clean verses.
Use EQ to roll off sub-bass mud and high-frequency hiss. Spectral editing tools like iZotope RX can remove remaining artifacts.
Extracted vocals often need reverb, compression, and pitch correction to sit right in a new instrumental. Treat them like a raw recording.
The AI extracts all vocal content together — lead and backing. You cannot isolate just the lead vocal separately from harmonies.
Tight harmonies that share frequency space with the lead tend to come through. Wide-panned backing vocals may partially end up in the instrumental stem.
Tracks with a single prominent vocal and minimal harmonies give the cleanest extraction. Layered vocal production is inherently harder.
A raw extracted vocal almost never ships. Here's the chain most producers run to get a remix-ready stem.
Load the vocal into iZotope RX (Spectral De-noise + De-click + Mouth De-click) or FabFilter Pro-Q 3. Target frequencies: high-pass at 80–120 Hz to remove kick bleed, notch cuts at 250 Hz for muddiness, gentle shelf down above 10 kHz for warbling shimmer. A 2–3 dB cut usually beats a 10 dB cut — over-EQ'd vocals sound processed.
Separation flattens dynamics unpredictably — some phrases sound compressed, others peaky. Put a fast compressor (3:1 ratio, 5 ms attack, 40 ms release) on the stem to re-glue them, then a de-esser targeting 6–8 kHz for sibilance the AI may have amplified. Waves Renaissance Vox or stock Ableton Compressor work fine here.
Re-add the space the AI stripped: a short plate reverb (1.2s, 20% wet) gives it body without dragging it backwards. Match the tonal balance of the new instrumental using a reference track. Most extracted vocals need a subtle saturator (Decapitator, Soundtoys FlipTape) to re-introduce harmonic richness that separation removes.
Upload any song and hear the separated stems in seconds. Free, no account needed.
WAV or high-bitrate audio gives the AI more harmonic detail to work with when separating vocal from instrumental content.
Layered choruses with backing vocals reveal artifacts fast. If that section works, the rest probably will too.
Almost every extracted acapella benefits from EQ and spectral editing. Budget time for post-processing in your remix workflow.
Not perfectly. Separating layered vocals is still one of the hardest problems in stem separation. You will get the combined vocal, not individual parts.
Yes. Upload a video file or use the audio-from-video page if your source is MP4 or another video format.
The AI removes some harmonic content along with the instruments. Higher-quality sources and less compressed masters reduce this effect.
Yes, that is one of the most common use cases. Most producers chain extraction with DAW cleanup to get remix-ready vocals.
Budget 15-30 minutes per track for EQ, spectral editing, and artifact removal. Short vocal chops need less work than full acapellas.
Not without a license. Separating a vocal doesn't change its copyright — you still need permission from the composition and master rights holders for any commercial release. Remix competitions run by labels usually pre-clear the license. Independent flips and bootlegs rely on SoundCloud/Bandcamp's DMCA tolerance, but expect takedowns. For legal sample clearance, services like Tracklib license stems directly.
No single tool wins every track. Most professional producers A/B 2–3 separators on a given song: Unmix for browser speed, MVSep or Demucs for desktop quality, and LALAL.AI or Moises for paid accuracy. Spleeter-family models (what Unmix uses) are fastest but leave more artifacts than MDX-Net models. If one tool gives a thin vocal, another might give cleaner dynamics — compare before committing to cleanup.
Drum bleed usually lives below 300 Hz and in the 2–4 kHz transient range. Use a surgical high-pass at 100–200 Hz, then spectral gating in iZotope RX (Spectral De-noise with a 'learn' on a clean vocal passage). For stubborn snare bleed, manually paint it out with RX Spectral Repair. Budget 15–30 minutes per track — full drum removal from a vocal stem is never clean, only 'good enough.'
Remove vocals from any song and keep the instrumental.
Create backing tracks for singing and practice.
Split a song into individual instrument stems.
Remove drums, bass, or piano from a track.
Extract and separate audio from video files.
Full 5-stem separation on iOS, Android, and Mac.