📖 The AI Tool Bible

Best AI tools for noise reduction

48 tools in the Audio category, filtered to noise reduction.

All Audio

ElevenLabs

Featured
Audio · ElevenLabs Multilingual v2
9.4

The gold standard for AI voice cloning and TTS.

Freemium· Free 10k chars/mo; from $5/mo Starter; up to $1320/mo ScaleTTSvoice cloning

Suno

Featured
Audio · Suno v4
9.2

Text-to-song AI — full vocal tracks from a prompt.

Freemium· Free credits; Pro $10/mo; Premier $30/mosongwritingdemos

Udio

Audio · Udio (proprietary)
8.8

Suno's main rival for AI-generated full songs.

Freemium· Free; Standard $10/mo; Pro $30/mofull songsmusic demos

AssemblyAI

Audio · Universal / Slam-1
8.7

Speech-to-text API with diarisation, summarisation, and topic detection.

Freemium· Free credits; pay-per-use from $0.37/hrtranscriptiondiarisation

Whisper

Audio · Whisper large-v3
8.6

OpenAI's open-source speech-to-text — the de-facto baseline.

Free· Free open weights; $0.006/min via OpenAI APItranscriptionself-hosted

Resemble.ai

Audio · Resemble v2 / Localize
8.0

Enterprise voice cloning with deepfake-detection layer.

Paid· From $19/mo Creator; enterprise customenterprise voice cloningcompliance

Murf

Audio · Murf Gen2
7.8

TTS aimed at corporate voiceover and e-learning.

Freemium· Free preview; from $19/mo Creator; $66/mo Businessvoiceovere-learning

AI Song Maker

Audio · Multi-model (ACE-Step, MusicGen, DiffRhythm, Riffusion)

Browser-based song generator that wraps multiple open music models behind a single freemium UI.

Freemium· Free: 4 songs/day anon, 20 credits/day signed-in; paid tier for moretext-to-songlyrics-generation

AIVA

Audio · Proprietary (AIVA)

AI music composition tool that generates royalty-friendly tracks in 250+ styles with editable MIDI output.

Freemium· Free; Standard ~€11/mo, Pro ~€33/mo (billed yearly)music-generationsoundtrack-composition

AInterview

Audio

AI host that interviews you and turns the conversation into a finished podcast.

Freemium· Free 10 min/mo; Premium $19/mo (2 hrs); pay-as-you-go $0.13-$0.20/minai-podcast-hostsolo-podcasting

Audify AI

Audio · OpenAI TTS (tts-1, tts-1-hd, gpt-4o-mini-tts)

Pay-as-you-go web wrapper around OpenAI's text-to-speech voices.

Freemium· BYO OpenAI key (free); or top up from $2 pay-per-usetext-to-speechvoiceover

AudioCraft

Audio · MusicGen, AudioGen, EnCodec

Meta's open-source research toolkit for generating music and sound effects from text via a single autoregressive language model.

Free· Free and open source; self-hostedtext-to-musicsound-effects

Azure AI Speech (Neural TTS)

Audio · Azure Neural TTS (plus HD and Azure OpenAI voices)

Microsoft's enterprise-grade neural text-to-speech with 100+ languages, custom brand voices, and SSML control.

Freemium· Free tier (0.5M chars/mo neural); pay-as-you-go per character thereaftertext-to-speechvoice-cloning

Beatoven.ai

Audio · Maestro (proprietary)

Text-to-music generator that spits out royalty-free background tracks with a clean licensing story.

Freemium· Pay-per-track or subscription for download minutesbackground-musicsound-effects

Boomy

Audio · Proprietary (undisclosed)

Generative AI music maker that lets anyone produce a song in under a minute and push it to Spotify.

Freemium· Free tier; Creator ~$9.99/mo; Pro ~$29.99/moai-music-generationsong-creation

CustomPod

Audio

Turns your chosen news sources, RSS feeds, and inboxes into a personalized daily AI podcast.

Freemium· Free tier (manual generation); Pro $4.99/mopersonal podcastnews briefing

Deepgram

Audio · Nova, Flux, Speak (proprietary)

Production-grade speech-to-text, text-to-speech, and voice-agent APIs for real-time and batch audio.

Freemium· Free credits on signup; usage-based pricing; enterprise contracts availablespeech-to-texttext-to-speech

Dia

Audio · Dia-1.6B

Open-weights 1.6B text-to-dialogue model that generates ultra-realistic multi-speaker conversations in one pass.

Free· Free, open weights (Apache 2.0); hosted larger version waitlisteddialogue-generationvoice-cloning

EKHOS AI

Audio · Proprietary local models

Offline Windows transcription app with speaker diarization, GPU acceleration, and 98-language support.

Freemium· Free tier; Premium $9/motranscriptionspeaker-diarization

Ecrett Music

Audio

AI background-music generator that spits out royalty-free instrumental tracks by scene, mood, and genre.

Freemium· Free preview tier; Individual $4.99/mo annual ($7.99 monthly); Business $14.99/mo annual ($24.99 monthly)background-musicyoutube-soundtracks

ElevenLabs Conversational AI

Audio · ElevenLabs Scribe (ASR) + pluggable LLM + ElevenLabs TTS

Production-grade voice agent platform layering ElevenLabs TTS, ASR, and LLM orchestration into a single deployable stack.

Paid· Usage-based; contact sales for enterprisevoice-agentsivr-replacement

Fireflies.ai

Audio · Multi-model (proprietary ASR + LLM layer)

AI meeting assistant that joins calls, transcribes them, and turns the talk into searchable notes and action items.

Freemium· Free tier; paid plans roughly $10-$39/user/mo, Enterprise on requestmeeting transcriptioncall summaries

Harmonai

Audio · Dance Diffusion / Stable Audio family

Open-source generative audio lab from Stability AI building diffusion models for music production.

Free· Free open-source models and code; no hosted product on this sitemusic-generationsound-design

Hume AI

Audio · Octave, EVI, TADA

Emotionally intelligent voice AI with expressive TTS, speech-to-speech, and human-feedback evaluation APIs.

Freemiumexpressive-ttsvoice-cloning

LOVO AI

Audio · Proprietary (LOVO Pro V2 voices)

Text-to-speech and voice cloning platform with 500+ voices, an integrated video editor, and a developer API.

Freemium· 14-day free Pro trial, no credit card; paid subscription tierstext-to-speechvoice-cloning

Loudly

Audio · Proprietary Loudly AI

AI music generator with royalty-free output, stem splitting, and distribution to Spotify and friends.

Freemium· Free tier; paid plans on /music/pricingtext-to-musicroyalty-free background music

MockingBird

Audio · GE2E + Tacotron + HiFi-GAN/WaveRNN/Fre-GAN

Open-source Mandarin-first voice cloning that mimics a speaker from a 5-second sample.

Free· Free, open source (MIT)voice-cloningtext-to-speech

Mubert

Audio · Proprietary sample-based generative engine

AI music generator that spits out royalty-free background tracks for video, podcast, and app use.

Freemium· Free tier; paid plans for commercial use; API via sales demobackground-musicroyalty-free-soundtracks

Murf AI

Audio · Murf Gen2 / Murf Falcon

Studio-grade text-to-speech and real-time voice agents with 200+ voices across 35+ languages.

Freemium· Free Studio (10 min/mo); paid plans + API at ~$0.01/min (Falcon)text-to-speechvoice-cloning

Otter.ai

Audio · Proprietary speech + LLM stack

AI meeting notetaker that transcribes calls, summarizes them, and pulls out action items in real time.

Freemium· Free Basic; Business $19.99/user/mo; Enterprise custommeeting-transcriptionmeeting-summaries

Read AI

Audio · Multi-model

AI meeting copilot that transcribes, summarizes, and surfaces action items across Zoom, Meet, and Teams.

Freemium· Free (5 meetings/mo); paid tiers + Enterprisemeeting-transcriptionmeeting-summaries

Remusic

Audio · Remusic V4 Pro (proprietary)

All-in-one AI music studio that bundles song generation, voice cloning, stem splitting, and karaoke tools.

Freemium· Free daily credits; Starter $49/yr, Basic $94/yr, Pro $249/yrtext-to-musicvoice-cloning

Respeecher

Audio · Proprietary Respeecher voice models

Studio-grade AI voice cloning and TTS used by Hollywood productions for speech-to-speech and dubbing work.

Freemium· Free trial; TTS API $2/hour pay-as-you-go; custom enterprise pricing for voice cloningvoice-cloningtext-to-speech

Scribbl

Audio

Bot-free AI meeting recorder, transcriber, and summarizer for Google Meet.

Freemium· Free: 15 meetings/month; paid team plans for shared libraries and CRM integrationsmeeting-transcriptionmeeting-summaries

Sesame

Audio · Sesame CSM (1B / 3B / 8B)

Conversational voice AI aiming to cross the uncanny valley with context-aware, emotionally aware speech.

Free· Free research preview; consumer product pricing not announcedconversational-voicetext-to-speech

Soundful

Audio · Proprietary (human-aided AI)

Template-driven AI music generator that spits out royalty-free, commercially licensable tracks in seconds.

Freemium· Free tier; Plus/Pro/Business monthly per-user; Enterprise on requestbackground-musiccontent-creator-audio

Soundraw

Audio · Proprietary in-house model

AI music generator that spits out royalty-free, customizable tracks by genre and mood.

Freemium· Free trial; Creator $5.99/mo; Artist Pro $12.59/mo; Unlimited $17.49/mo; Enterprise custombackground musicvideo soundtracks

Stable Audio

Audio · Stable Audio 3.0 (Large/Medium/Small/Small SFX)

Stability AI's generative audio model family for music and sound effects, with open weights for the smaller variants.

Freemium· Free web app tier; API metered; enterprise licensing for Large modelmusic-generationsound-effects

Transgate

Audio · Multi-model speech-to-text

Pay-as-you-go AI transcription and translation with summaries, highlights, and chat over your audio.

Freemium· Free 20-minute trial; pay-as-you-go credit packstranscriptiontranslation

Veritone Voice

Audio · Proprietary (Veritone aiWARE)

Enterprise-grade voice cloning and synthesis platform built for broadcasters, studios, and large media operations.

Enterprise· Contact sales / demo onlyvoice-cloningtext-to-speech

Vibe

Audio · OpenAI Whisper (via whisper.cpp)

Offline desktop transcription app powered by Whisper, with diarization, batch processing, and an HTTP API.

Free· Free and open-source (MIT)transcriptionsubtitles

Voicebox

Audio · Multi-model (Chatterbox, Qwen TTS, Whisper, etc.)

Open-source desktop voice studio for local cloning, dictation, and giving MCP agents a voice.

Free· Free and open source; optional $VOICEBOX token donationsvoice-cloningtext-to-speech

WellSaid

Audio · Proprietary WellSaid TTS

Enterprise-grade AI text-to-speech built on licensed voice actor recordings.

Freemium· Free trial; paid plans for teams and enterprise (contact sales for API)text-to-speeche-learning narration

WellSaid Labs

Audio · Proprietary WellSaid TTS (closed model)

Enterprise AI text-to-speech studio built on licensed voice-actor recordings, with a director-style editor for pacing and pronunciation.

Paid· Subscription plans (Maker/Team/Enterprise); free trial availablee-learning narrationcorporate training

WhisperAPI

Audio · OpenAI Whisper

Hosted OpenAI Whisper transcription with a pay-as-you-go API and drop-in web dashboard.

Paid· Pay-as-you-go credits; $5 for 20 credits, down to ~$0.10/credit in bulkaudio-transcriptionvideo-subtitles

Wispr Flow

Audio

System-wide voice-to-text dictation that auto-edits filler words and learns your jargon.

Freemium· 14-day Pro trial; paid individual and enterprise plansdictationvoice-to-text

ZenMic

Audio

Text-to-podcast generator with multi-speaker AI voices and RSS publishing.

Freemium· Free 10 min trial; $19/mo or $99/yr (100 min/mo)text-to-podcastcontent-repurposing

iSpeech

Audio

Veteran cloud TTS and speech recognition API with broad SDK and language coverage.

Freemium· Free mobile SDK for non-revenue apps; ~$0.0001-$0.05 per word/transactiontext-to-speechspeech-recognition