Stable Audio Walked Into Suno's House

Stability AI just dropped a music generator that takes audio input, which is either a direct shot at Suno or a coincidence nobody believes.

Suno has been sitting alone at the table for a while now. Nobody really contested it. The market was, for all practical purposes, conquered — the way a town gets conquered when there's only ever been one restaurant.

Then Stable Audio showed up.

The quality jump is not subtle. If image generation had gone from whatever we had in 2022 directly to something that could pass for a photograph — no DALL-E 2 in between, no SDXL transition period, no gradual embarrassment into competence — that's the vibe here. It skipped the middle.

The thing that matters, though, is the audio input. Suno takes text. Stable Audio takes existing audio and does something with it. I fed it a Suno output to see what would happen. The question of whether that's funny or useful is still open.

Stability has this particular design sensibility — dark, clean, slightly menacing — that feels familiar in a way that makes you trust it more than you should, probably. The color scheme is doing work. It shouldn't matter. It does.

The pattern here is the same one Stability ran with image generation: someone owns the field, Stability arrives, the field gets complicated. Whether "complicated" becomes "replaced" is a different conversation for a different April.

For now, there's a second restaurant. The food is good. The menu accepts your leftovers from the first restaurant, which is either thoughtful or provocative.

Probably both.

Stable Audio Walked Into Suno's House

Counterpoints