Harmonai

Open-source generative audio lab from Stability AI building diffusion models for music production.

Free· Free open-source models and code; no hosted product on this siteAudioDance Diffusion / Stable Audio family

Best for

Pick Harmonai if you're a producer or ML engineer who wants to run open-source audio diffusion models locally and fine-tune them on your own samples.

Skip if

Skip it if you want a polished, hosted text-to-music app with a UI and account â€” use Stable Audio or Suno instead.

Harmonai is a Stability AI research lab that develops and releases open-source generative audio models aimed at music producers and sound designers. The group is best known for shipping Dance Diffusion, an early diffusion-based music generator, and for contributing to the Stable Audio family of text-to-audio models. Outputs include raw waveform generation, custom sample libraries, and experimental tools for building infinite, royalty-free sound material.

It's not a polished SaaS product with a billing page â€” it's a lab. The website is a portal pointing to a GitHub org and a Discord community where models, training code, and Colab notebooks are released. That makes Harmonai a fit for technically inclined musicians, ML researchers, and audio tool builders who want to run or fine-tune models locally, rather than for consumers looking for a one-click music generator.

If you want a hosted product layer on top of similar tech, Stability AI's Stable Audio service is the commercial sibling. Harmonai itself is the upstream research and open-weights side of that pipeline.

Editor's take

Harmonai is the research wellspring behind a lot of modern open audio generation, and the open weights matter. Just don't expect a product â€” expect a GitHub org and a Discord. For the right user that's the appeal, not a flaw.

— The AI Tool Bible editorial team