SAM Audio is Metaβs new model that lets you pull a single sound out of a messy mix with a prompt. You can type what you want to isolate, click on an object in a video, or mark a time span, and it separates that voice, instrument, or noise without training a custom model for each case. Itβs open source, runs in the Segment Anything Playground, and comes in multiple sizes you can download for your own workflows.
π₯ Our Take: This goes straight at one of the most annoying parts of editing: trying to rescue one sound from a chaotic recording. Instead of hunting through plug-ins and hoping a preset works, you just tell it what you want and get a clean stem back. Creators are going to love this, and so are anyone doing podcasts, film, TikToks, or research with real-world audio that used to be basically unfixable.