Best AI tools for noise reduction
48 tools in the Audio category, filtered to noise reduction.
ElevenLabs
FeaturedThe gold standard for AI voice cloning and TTS.
Suno
FeaturedText-to-song AI — full vocal tracks from a prompt.
Udio
Suno's main rival for AI-generated full songs.
AssemblyAI
Speech-to-text API with diarisation, summarisation, and topic detection.
Whisper
OpenAI's open-source speech-to-text — the de-facto baseline.
Resemble.ai
Enterprise voice cloning with deepfake-detection layer.
Murf
TTS aimed at corporate voiceover and e-learning.
AI Song Maker
Browser-based song generator that wraps multiple open music models behind a single freemium UI.
AIVA
AI music composition tool that generates royalty-friendly tracks in 250+ styles with editable MIDI output.
AInterview
AI host that interviews you and turns the conversation into a finished podcast.
Audify AI
Pay-as-you-go web wrapper around OpenAI's text-to-speech voices.
AudioCraft
Meta's open-source research toolkit for generating music and sound effects from text via a single autoregressive language model.
Azure AI Speech (Neural TTS)
Microsoft's enterprise-grade neural text-to-speech with 100+ languages, custom brand voices, and SSML control.
Beatoven.ai
Text-to-music generator that spits out royalty-free background tracks with a clean licensing story.
Boomy
Generative AI music maker that lets anyone produce a song in under a minute and push it to Spotify.
CustomPod
Turns your chosen news sources, RSS feeds, and inboxes into a personalized daily AI podcast.
Deepgram
Production-grade speech-to-text, text-to-speech, and voice-agent APIs for real-time and batch audio.
Dia
Open-weights 1.6B text-to-dialogue model that generates ultra-realistic multi-speaker conversations in one pass.
EKHOS AI
Offline Windows transcription app with speaker diarization, GPU acceleration, and 98-language support.
Ecrett Music
AI background-music generator that spits out royalty-free instrumental tracks by scene, mood, and genre.
ElevenLabs Conversational AI
Production-grade voice agent platform layering ElevenLabs TTS, ASR, and LLM orchestration into a single deployable stack.
Fireflies.ai
AI meeting assistant that joins calls, transcribes them, and turns the talk into searchable notes and action items.
Harmonai
Open-source generative audio lab from Stability AI building diffusion models for music production.
Hume AI
Emotionally intelligent voice AI with expressive TTS, speech-to-speech, and human-feedback evaluation APIs.
LOVO AI
Text-to-speech and voice cloning platform with 500+ voices, an integrated video editor, and a developer API.
Loudly
AI music generator with royalty-free output, stem splitting, and distribution to Spotify and friends.
MockingBird
Open-source Mandarin-first voice cloning that mimics a speaker from a 5-second sample.
Mubert
AI music generator that spits out royalty-free background tracks for video, podcast, and app use.
Murf AI
Studio-grade text-to-speech and real-time voice agents with 200+ voices across 35+ languages.
Otter.ai
AI meeting notetaker that transcribes calls, summarizes them, and pulls out action items in real time.
Read AI
AI meeting copilot that transcribes, summarizes, and surfaces action items across Zoom, Meet, and Teams.
Remusic
All-in-one AI music studio that bundles song generation, voice cloning, stem splitting, and karaoke tools.
Respeecher
Studio-grade AI voice cloning and TTS used by Hollywood productions for speech-to-speech and dubbing work.
Scribbl
Bot-free AI meeting recorder, transcriber, and summarizer for Google Meet.
Sesame
Conversational voice AI aiming to cross the uncanny valley with context-aware, emotionally aware speech.
Soundful
Template-driven AI music generator that spits out royalty-free, commercially licensable tracks in seconds.
Soundraw
AI music generator that spits out royalty-free, customizable tracks by genre and mood.
Stable Audio
Stability AI's generative audio model family for music and sound effects, with open weights for the smaller variants.
Transgate
Pay-as-you-go AI transcription and translation with summaries, highlights, and chat over your audio.
Veritone Voice
Enterprise-grade voice cloning and synthesis platform built for broadcasters, studios, and large media operations.
Vibe
Offline desktop transcription app powered by Whisper, with diarization, batch processing, and an HTTP API.
Voicebox
Open-source desktop voice studio for local cloning, dictation, and giving MCP agents a voice.
WellSaid
Enterprise-grade AI text-to-speech built on licensed voice actor recordings.
WellSaid Labs
Enterprise AI text-to-speech studio built on licensed voice-actor recordings, with a director-style editor for pacing and pronunciation.
WhisperAPI
Hosted OpenAI Whisper transcription with a pay-as-you-go API and drop-in web dashboard.
Wispr Flow
System-wide voice-to-text dictation that auto-edits filler words and learns your jargon.
ZenMic
Text-to-podcast generator with multi-speaker AI voices and RSS publishing.
iSpeech
Veteran cloud TTS and speech recognition API with broad SDK and language coverage.