Categories

Audio & Music

AI tools for audio processing and music generation

72 tools

Coqui screenshot

Coqui

Coqui is an open-source deep learning toolkit for text-to-speech synthesis, offering pretrained models, voice cloning, and tools for training custom TTS models.

Soundful screenshot

Soundful

AI music generation platform that produces unique, royalty-free tracks built on ethically trained models guided by real producers.

PodcastAdBlock screenshot

PodcastAdBlock

Podcast app that automatically skips ads and intros, generates AI summaries, and builds a smart listening queue based on your habits.

Audio Strip screenshot

Audio Strip

Online vocal isolation tool that strips the vocals or the backing track from any song directly in your browser for free.

VocalRemover screenshot

VocalRemover

Web-based AI tool that separates vocals and instruments from audio tracks, with extra utilities for cutting, pitch shifting, and BPM detection.

OptimizerAI screenshot

OptimizerAI

AI sound generation platform that creates stereo, studio-quality audio effects from text descriptions or uploaded audio samples.

Descript screenshot

Descript

Video and podcast editor that lets you cut and rewrite media by editing a text transcript rather than a timeline.

ElevenLabs screenshot

ElevenLabs Featured

Voice AI platform for generating realistic text-to-speech, cloning voices, and dubbing audio and video in multiple languages.

Suno screenshot

Suno Featured

AI music generator that turns a text description into a complete original song with vocals, lyrics, and full production.

Audio Converter screenshot

Audio Converter

Online audio and video transcription service that converts recordings into accurate, timestamped text with speaker recognition in 200-plus languages.

Murf AI screenshot

Murf AI

AI voice generation platform offering 200-plus studio-quality voices for voiceovers, dubbing, and real-time voice agents in 35-plus languages.

Adobe Podcast screenshot

Adobe Podcast

Browser-based AI audio suite from Adobe that cleans up speech recordings and adds studio-quality sound to voice content.

Speechify screenshot

Speechify

AI-powered text-to-speech app that converts documents, PDFs, and web pages into natural audio across 60-plus languages.

AssemblyAI screenshot

AssemblyAI

Speech AI API platform for developers that transcribes audio, understands spoken content, and enables voice agent applications.

Udio screenshot

Udio

AI music creation tool that generates original songs from text prompts, with plans for casual listeners and professional producers.

Krisp screenshot

Krisp

Voice AI platform that removes background noise, transcribes meetings, and converts accents in real time during calls.

AIVA screenshot

AIVA

AI music composition tool that generates original scored pieces across 250-plus styles from a text or audio influence in seconds.

Whisper-web screenshot

Whisper-web

Browser-based speech recognition tool that runs OpenAI Whisper entirely in the browser with no server required.

Google Magenta screenshot

Google Magenta Featured

Google's open-source research platform for exploring machine learning in music and art creation.

Mubert screenshot

Mubert Featured

AI music generator that creates original royalty-free soundtracks for video, podcasts, games, and apps in real time.

Hailuo AI TTS screenshot

Hailuo AI TTS

Text-to-speech tool from MiniMax (Hailuo) that generates expressive, human-sounding voice in over 70 languages with voice cloning support.

Soundraw screenshot

Soundraw

Royalty-free AI music generator that creates and edits original tracks by blending genres, with STEM downloads and commercial licensing included.

Coqui TTS screenshot

Coqui TTS

Open-source deep learning toolkit for text-to-speech with over 1,100 pretrained models, voice cloning, and multi-language support.

UniMusic AI screenshot

UniMusic AI

AI music generator that turns text prompts and lyrics into complete songs with vocals and full arrangements in under two minutes.

Resemble AI screenshot

Resemble AI

Voice AI platform for creating synthetic voices, cloning real voices, and detecting audio and video deepfakes.

Altered screenshot

Altered

Professional AI voice transformation platform for real-time accent conversion, voice cloning, and post-production audio editing.

Podcastle screenshot

Podcastle

AI content creation platform for recording, editing, and publishing audio and video, formerly known as Podcastle.

Poison Pill screenshot

Poison Pill

Poison Pill embeds imperceptible adversarial noise into music files to prevent AI models from using those tracks as training data without consent.

Cleanvoice AI screenshot

Cleanvoice AI

AI podcast editor that automatically removes filler words, silences, and background noise from audio and video recordings.

Riffusion screenshot

Riffusion

AI music generator that turns text prompts into original, royalty-free songs spanning any genre, with no musical training required.

Loudly screenshot

Loudly Featured

AI music creation platform for generating, remixing, and distributing original tracks, with a royalty-free catalog and text-to-music tools.

EmotiVoice screenshot

EmotiVoice

Open-source TTS engine from Netease Youdao with over 2,000 voices and built-in emotional speech synthesis in English and Chinese.

WhisperX screenshot

WhisperX

Enhanced Whisper transcription library that adds word-level timestamps, speaker diarization, and fast batched inference.

Buzz screenshot

Buzz

Desktop transcription and translation app powered by OpenAI Whisper that works offline on audio files and live mic input.

Endel screenshot

Endel

Personalized soundscape app that generates adaptive audio for focus, relaxation, sleep, and movement based on your environment and habits.

StarVoiceAi screenshot

StarVoiceAi

Celebrity AI voice generator that creates synthetic speech in the style of famous voices for content creators and entertainment projects.

Ecrett Music screenshot

Ecrett Music

AI music composition tool that generates royalty-free background tracks for videos, games, and podcasts by selecting scene, mood, and genre.

AI Mastering screenshot

AI Mastering

Online audio mastering service that automatically enhances uploaded tracks to reach commercial loudness and clarity standards.

Boomy screenshot

Boomy

AI music creation platform that lets anyone generate a full song in seconds and release it directly to Spotify, Apple Music, and 40-plus streaming services.

Emergent Drums screenshot

Emergent Drums

AI-powered drum sample generator from Audialab that creates unique, royalty-free drum sounds without using any recorded human audio.

Modulate screenshot

Modulate Featured

Voice AI platform for enterprises that detects fraud, deepfakes, compliance violations, and behavioral signals directly from audio in real time.

Vibe Transcribe screenshot

Vibe Transcribe

Vibe is a free, open-source desktop app that transcribes audio and video files completely offline using OpenAI's Whisper model.

Musico screenshot

Musico Featured

AI music generation platform that produces editable MIDI compositions from emotional and visual inputs, with full DAW integration and on-device processing.

iSpeech screenshot

iSpeech

Text-to-speech and speech recognition API platform used by developers to add voice capabilities to apps and websites.

Respeecher screenshot

Respeecher

AI voice cloning and text-to-speech platform built for broadcast-quality audio in film, gaming, and media production.

YobiYoba screenshot

YobiYoba

Pay-as-you-go speech-to-text service that transcribes audio and video files into time-coded, editable transcripts.

Orb Composer screenshot

Orb Composer

DAW plugin from LANDR that generates chord progressions, melodies, basslines, and arpeggios to help musicians break through creative blocks.

PlaylistAI screenshot

PlaylistAI

AI app that builds personalized Spotify and Apple Music playlists from text prompts, festival posters, mood cues, and listening history.

Mureka screenshot

Mureka

AI music generator that turns text prompts into original, royalty-free tracks across any genre with vocal and stem export options.

EKHOS AI screenshot

EKHOS AI

Offline transcription software that converts audio and video to text locally on your device with no data sent to the cloud.

Listnr screenshot

Listnr

Listnr is a text-to-speech platform that turns written content into natural-sounding audio using over 1,000 AI voices across 142+ languages.

ElevenLabs Sound Effects screenshot

ElevenLabs Sound Effects

AI sound effect generator from ElevenLabs that turns text descriptions into royalty-free audio samples within seconds.

FakeYou screenshot

FakeYou

Text-to-speech platform with a library of thousands of community-built voice models covering characters, celebrities, and custom voices.

Jamorphosia screenshot

Jamorphosia

Browser-based AI audio stem splitter that separates vocals, guitar, bass, drums, and piano from any uploaded track for remixing or practice.

Lovo.ai screenshot

Lovo.ai

AI voiceover and video creation platform with 500-plus voices across 100 languages and an integrated online video editor.

CustomPod screenshot

CustomPod Featured

AI briefing app that turns your chosen news sources, RSS feeds, and newsletters into personalized daily podcast episodes.

Musicfy screenshot

Musicfy

AI voice and music tool that lets you clone voices, cover songs, and generate original tracks without any musical background.

MuseGen screenshot

MuseGen

AI music studio that generates full-length songs with vocals, lyrics, and mastering from a simple text description.

Descript Overdub screenshot

Descript Overdub

AI voice regeneration feature inside Descript that fixes audio mismatches, removes background noise, and enhances weak takes.

Beatoven.ai screenshot

Beatoven.ai

AI music generation platform that creates original background tracks and sound effects from a text description, with commercial licensing included.

SFX Engine screenshot

SFX Engine

Credit-based AI sound effect generator with studio-quality output, commercial licensing, and API access for professional audio workflows.

Remusic screenshot

Remusic

Remusic is an AI music platform for generating original songs, separating vocals from tracks, creating covers with cloned voices, and producing karaoke videos.

Voicemod screenshot

Voicemod

Real-time AI voice changer and soundboard for gamers and streamers that transforms your voice across Discord, Twitch, and 50+ platforms.

ChatTTS screenshot

ChatTTS

Open-source conversational text-to-speech model optimized for dialogue, with fine-grained prosody control including laughter and pauses.

WellSaid screenshot

WellSaid

WellSaid is an AI voiceover platform that converts scripts into professional-quality audio using studio-grade AI voices built from licensed recordings.

Podcast Maker screenshot

Podcast Maker

Turns blog posts, YouTube videos, web pages, and documents into narrated podcast episodes with cloned or AI voices in around ten minutes.

So-vits-svc screenshot

So-vits-svc

Archived open-source singing voice conversion tool that transforms one singer's voice into another using SoftVC and VITS technology.

WhisperDesktop screenshot

WhisperDesktop

Windows desktop app that uses GPU acceleration to transcribe audio files and live mic input with OpenAI Whisper.

Eleven Labs screenshot

Eleven Labs

ElevenLabs is an AI audio platform for generating realistic text-to-speech, voice cloning, music, sound effects, and conversational voice agents.

MuzicGenerator screenshot

MuzicGenerator

AI music generator that produces full songs with vocals, lyrics, and instrumentation from a text description in minutes.

Veritone Voice screenshot

Veritone Voice

Enterprise AI voice cloning platform that creates licensed synthetic voices for broadcasters, rights holders, and media companies.

Play.ht

AI text-to-speech platform offering over 800 voices in 142 languages, voice cloning, and multi-speaker audio generation for content creators and developers.