Audio & Music AI Tools

Coqui

Coqui is an open-source deep learning toolkit for text-to-speech synthesis, offering pretrained models, voice cloning, and tools for training custom TTS models.

5.0

free

Soundful

AI music generation platform that produces unique, royalty-free tracks built on ethically trained models guided by real producers.

5.0

freemium

PodcastAdBlock

Podcast app that automatically skips ads and intros, generates AI summaries, and builds a smart listening queue based on your habits.

4.9

freemium

Audio Strip

Online vocal isolation tool that strips the vocals or the backing track from any song directly in your browser for free.

4.9

free

VocalRemover

Web-based AI tool that separates vocals and instruments from audio tracks, with extra utilities for cutting, pitch shifting, and BPM detection.

4.8

freemium

OptimizerAI

AI sound generation platform that creates stereo, studio-quality audio effects from text descriptions or uploaded audio samples.

4.6

contact

Descript

Video and podcast editor that lets you cut and rewrite media by editing a text transcript rather than a timeline.

4.6

freemium

ElevenLabs Featured

Voice AI platform for generating realistic text-to-speech, cloning voices, and dubbing audio and video in multiple languages.

4.6

freemium

Suno Featured

AI music generator that turns a text description into a complete original song with vocals, lyrics, and full production.

4.6

freemium

Audio Converter

Online audio and video transcription service that converts recordings into accurate, timestamped text with speaker recognition in 200-plus languages.

4.6

freemium

Murf AI

AI voice generation platform offering 200-plus studio-quality voices for voiceovers, dubbing, and real-time voice agents in 35-plus languages.

4.5

freemium

Adobe Podcast

Browser-based AI audio suite from Adobe that cleans up speech recordings and adds studio-quality sound to voice content.

4.5

freemium

Speechify

AI-powered text-to-speech app that converts documents, PDFs, and web pages into natural audio across 60-plus languages.

4.5

freemium

AssemblyAI

Speech AI API platform for developers that transcribes audio, understands spoken content, and enables voice agent applications.

4.5

freemium

Udio

AI music creation tool that generates original songs from text prompts, with plans for casual listeners and professional producers.

4.5

freemium

Krisp

Voice AI platform that removes background noise, transcribes meetings, and converts accents in real time during calls.

4.5

freemium

AIVA

AI music composition tool that generates original scored pieces across 250-plus styles from a text or audio influence in seconds.

4.4

freemium

Whisper-web

Browser-based speech recognition tool that runs OpenAI Whisper entirely in the browser with no server required.

4.4

free

Google Magenta Featured

Google's open-source research platform for exploring machine learning in music and art creation.

4.4

free

Mubert Featured

AI music generator that creates original royalty-free soundtracks for video, podcasts, games, and apps in real time.

4.3

freemium

Hailuo AI TTS

Text-to-speech tool from MiniMax (Hailuo) that generates expressive, human-sounding voice in over 70 languages with voice cloning support.

4.3

freemium

Soundraw

Royalty-free AI music generator that creates and edits original tracks by blending genres, with STEM downloads and commercial licensing included.

4.3

freemium

Coqui TTS

Open-source deep learning toolkit for text-to-speech with over 1,100 pretrained models, voice cloning, and multi-language support.

4.3

free

UniMusic AI

AI music generator that turns text prompts and lyrics into complete songs with vocals and full arrangements in under two minutes.

4.3

freemium

Resemble AI

Voice AI platform for creating synthetic voices, cloning real voices, and detecting audio and video deepfakes.

4.3

freemium

Altered

Professional AI voice transformation platform for real-time accent conversion, voice cloning, and post-production audio editing.

4.3

freemium

Podcastle

AI content creation platform for recording, editing, and publishing audio and video, formerly known as Podcastle.

4.3

freemium

Poison Pill

Poison Pill embeds imperceptible adversarial noise into music files to prevent AI models from using those tracks as training data without consent.

4.3

contact

Cleanvoice AI

AI podcast editor that automatically removes filler words, silences, and background noise from audio and video recordings.

4.3

freemium

Riffusion

AI music generator that turns text prompts into original, royalty-free songs spanning any genre, with no musical training required.

4.3

free

Loudly Featured

AI music creation platform for generating, remixing, and distributing original tracks, with a royalty-free catalog and text-to-music tools.

4.3

freemium

EmotiVoice

Open-source TTS engine from Netease Youdao with over 2,000 voices and built-in emotional speech synthesis in English and Chinese.

4.2

free

WhisperX

Enhanced Whisper transcription library that adds word-level timestamps, speaker diarization, and fast batched inference.

4.2

free

Buzz

Desktop transcription and translation app powered by OpenAI Whisper that works offline on audio files and live mic input.

4.2

free

Endel

Personalized soundscape app that generates adaptive audio for focus, relaxation, sleep, and movement based on your environment and habits.

4.2

freemium

StarVoiceAi

Celebrity AI voice generator that creates synthetic speech in the style of famous voices for content creators and entertainment projects.

4.2

paid

Ecrett Music

AI music composition tool that generates royalty-free background tracks for videos, games, and podcasts by selecting scene, mood, and genre.

4.2

freemium

AI Mastering

Online audio mastering service that automatically enhances uploaded tracks to reach commercial loudness and clarity standards.

4.2

freemium

Boomy

AI music creation platform that lets anyone generate a full song in seconds and release it directly to Spotify, Apple Music, and 40-plus streaming services.

4.2

freemium

Emergent Drums

AI-powered drum sample generator from Audialab that creates unique, royalty-free drum sounds without using any recorded human audio.

4.2

contact

Modulate Featured

Voice AI platform for enterprises that detects fraud, deepfakes, compliance violations, and behavioral signals directly from audio in real time.

4.2

contact

Vibe Transcribe

Vibe is a free, open-source desktop app that transcribes audio and video files completely offline using OpenAI's Whisper model.

4.2

free

Musico Featured

AI music generation platform that produces editable MIDI compositions from emotional and visual inputs, with full DAW integration and on-device processing.

4.2

freemium

iSpeech

Text-to-speech and speech recognition API platform used by developers to add voice capabilities to apps and websites.

4.1

freemium

Respeecher

AI voice cloning and text-to-speech platform built for broadcast-quality audio in film, gaming, and media production.

4.1

freemium

YobiYoba

Pay-as-you-go speech-to-text service that transcribes audio and video files into time-coded, editable transcripts.

4.1

contact

Orb Composer

DAW plugin from LANDR that generates chord progressions, melodies, basslines, and arpeggios to help musicians break through creative blocks.

4.1

paid

PlaylistAI

AI app that builds personalized Spotify and Apple Music playlists from text prompts, festival posters, mood cues, and listening history.

4.1

freemium

Mureka

AI music generator that turns text prompts into original, royalty-free tracks across any genre with vocal and stem export options.

4.1

freemium

EKHOS AI

Offline transcription software that converts audio and video to text locally on your device with no data sent to the cloud.

4.1

freemium

Listnr

Listnr is a text-to-speech platform that turns written content into natural-sounding audio using over 1,000 AI voices across 142+ languages.

4.1

freemium

ElevenLabs Sound Effects

AI sound effect generator from ElevenLabs that turns text descriptions into royalty-free audio samples within seconds.

4.0

freemium

FakeYou

Text-to-speech platform with a library of thousands of community-built voice models covering characters, celebrities, and custom voices.

4.0

freemium

Jamorphosia

Browser-based AI audio stem splitter that separates vocals, guitar, bass, drums, and piano from any uploaded track for remixing or practice.

4.0

freemium

Lovo.ai

AI voiceover and video creation platform with 500-plus voices across 100 languages and an integrated online video editor.

4.0

freemium

CustomPod Featured

AI briefing app that turns your chosen news sources, RSS feeds, and newsletters into personalized daily podcast episodes.

4.0

freemium

Musicfy

AI voice and music tool that lets you clone voices, cover songs, and generate original tracks without any musical background.

3.9

freemium

MuseGen

AI music studio that generates full-length songs with vocals, lyrics, and mastering from a simple text description.

3.8

freemium

Descript Overdub

AI voice regeneration feature inside Descript that fixes audio mismatches, removes background noise, and enhances weak takes.

3.8

freemium

Beatoven.ai

AI music generation platform that creates original background tracks and sound effects from a text description, with commercial licensing included.

3.8

freemium

SFX Engine

Credit-based AI sound effect generator with studio-quality output, commercial licensing, and API access for professional audio workflows.

3.7

paid

Remusic

Remusic is an AI music platform for generating original songs, separating vocals from tracks, creating covers with cloned voices, and producing karaoke videos.

3.7

freemium

Voicemod

Real-time AI voice changer and soundboard for gamers and streamers that transforms your voice across Discord, Twitch, and 50+ platforms.

3.7

freemium

ChatTTS

Open-source conversational text-to-speech model optimized for dialogue, with fine-grained prosody control including laughter and pauses.

3.6

free

WellSaid

WellSaid is an AI voiceover platform that converts scripts into professional-quality audio using studio-grade AI voices built from licensed recordings.

3.6

paid

Podcast Maker

Turns blog posts, YouTube videos, web pages, and documents into narrated podcast episodes with cloned or AI voices in around ten minutes.

3.6

freemium

So-vits-svc

Archived open-source singing voice conversion tool that transforms one singer's voice into another using SoftVC and VITS technology.

3.5

free

WhisperDesktop

Windows desktop app that uses GPU acceleration to transcribe audio files and live mic input with OpenAI Whisper.

3.5

free

Eleven Labs

ElevenLabs is an AI audio platform for generating realistic text-to-speech, voice cloning, music, sound effects, and conversational voice agents.

3.5

freemium

MuzicGenerator

AI music generator that produces full songs with vocals, lyrics, and instrumentation from a text description in minutes.

3.5

freemium

Veritone Voice

Enterprise AI voice cloning platform that creates licensed synthetic voices for broadcasters, rights holders, and media companies.

3.5

contact

Play.ht

AI text-to-speech platform offering over 800 voices in 142 languages, voice cloning, and multi-speaker audio generation for content creators and developers.

0.0

freemium