Coqui
Coqui is an open-source deep learning toolkit for text-to-speech synthesis, offering pretrained models, voice cloning, and tools for training custom TTS models.
Soundful
AI music generation platform that produces unique, royalty-free tracks built on ethically trained models guided by real producers.
PodcastAdBlock
Podcast app that automatically skips ads and intros, generates AI summaries, and builds a smart listening queue based on your habits.
Audio Strip
Online vocal isolation tool that strips the vocals or the backing track from any song directly in your browser for free.
VocalRemover
Web-based AI tool that separates vocals and instruments from audio tracks, with extra utilities for cutting, pitch shifting, and BPM detection.
OptimizerAI
AI sound generation platform that creates stereo, studio-quality audio effects from text descriptions or uploaded audio samples.
Descript
Video and podcast editor that lets you cut and rewrite media by editing a text transcript rather than a timeline.
ElevenLabs Featured
Voice AI platform for generating realistic text-to-speech, cloning voices, and dubbing audio and video in multiple languages.
Suno Featured
AI music generator that turns a text description into a complete original song with vocals, lyrics, and full production.
Audio Converter
Online audio and video transcription service that converts recordings into accurate, timestamped text with speaker recognition in 200-plus languages.
Murf AI
AI voice generation platform offering 200-plus studio-quality voices for voiceovers, dubbing, and real-time voice agents in 35-plus languages.
Adobe Podcast
Browser-based AI audio suite from Adobe that cleans up speech recordings and adds studio-quality sound to voice content.
Speechify
AI-powered text-to-speech app that converts documents, PDFs, and web pages into natural audio across 60-plus languages.
AssemblyAI
Speech AI API platform for developers that transcribes audio, understands spoken content, and enables voice agent applications.
Udio
AI music creation tool that generates original songs from text prompts, with plans for casual listeners and professional producers.
Krisp
Voice AI platform that removes background noise, transcribes meetings, and converts accents in real time during calls.
AIVA
AI music composition tool that generates original scored pieces across 250-plus styles from a text or audio influence in seconds.
Whisper-web
Browser-based speech recognition tool that runs OpenAI Whisper entirely in the browser with no server required.
Google Magenta Featured
Google's open-source research platform for exploring machine learning in music and art creation.
Mubert Featured
AI music generator that creates original royalty-free soundtracks for video, podcasts, games, and apps in real time.
Hailuo AI TTS
Text-to-speech tool from MiniMax (Hailuo) that generates expressive, human-sounding voice in over 70 languages with voice cloning support.
Soundraw
Royalty-free AI music generator that creates and edits original tracks by blending genres, with STEM downloads and commercial licensing included.
Coqui TTS
Open-source deep learning toolkit for text-to-speech with over 1,100 pretrained models, voice cloning, and multi-language support.
UniMusic AI
AI music generator that turns text prompts and lyrics into complete songs with vocals and full arrangements in under two minutes.
Resemble AI
Voice AI platform for creating synthetic voices, cloning real voices, and detecting audio and video deepfakes.
Altered
Professional AI voice transformation platform for real-time accent conversion, voice cloning, and post-production audio editing.
Podcastle
AI content creation platform for recording, editing, and publishing audio and video, formerly known as Podcastle.
Poison Pill
Poison Pill embeds imperceptible adversarial noise into music files to prevent AI models from using those tracks as training data without consent.
Cleanvoice AI
AI podcast editor that automatically removes filler words, silences, and background noise from audio and video recordings.
Riffusion
AI music generator that turns text prompts into original, royalty-free songs spanning any genre, with no musical training required.
Loudly Featured
AI music creation platform for generating, remixing, and distributing original tracks, with a royalty-free catalog and text-to-music tools.
EmotiVoice
Open-source TTS engine from Netease Youdao with over 2,000 voices and built-in emotional speech synthesis in English and Chinese.
WhisperX
Enhanced Whisper transcription library that adds word-level timestamps, speaker diarization, and fast batched inference.
Buzz
Desktop transcription and translation app powered by OpenAI Whisper that works offline on audio files and live mic input.
Endel
Personalized soundscape app that generates adaptive audio for focus, relaxation, sleep, and movement based on your environment and habits.
StarVoiceAi
Celebrity AI voice generator that creates synthetic speech in the style of famous voices for content creators and entertainment projects.
Ecrett Music
AI music composition tool that generates royalty-free background tracks for videos, games, and podcasts by selecting scene, mood, and genre.
AI Mastering
Online audio mastering service that automatically enhances uploaded tracks to reach commercial loudness and clarity standards.
Boomy
AI music creation platform that lets anyone generate a full song in seconds and release it directly to Spotify, Apple Music, and 40-plus streaming services.
Emergent Drums
AI-powered drum sample generator from Audialab that creates unique, royalty-free drum sounds without using any recorded human audio.
Modulate Featured
Voice AI platform for enterprises that detects fraud, deepfakes, compliance violations, and behavioral signals directly from audio in real time.
Vibe Transcribe
Vibe is a free, open-source desktop app that transcribes audio and video files completely offline using OpenAI's Whisper model.
Musico Featured
AI music generation platform that produces editable MIDI compositions from emotional and visual inputs, with full DAW integration and on-device processing.
iSpeech
Text-to-speech and speech recognition API platform used by developers to add voice capabilities to apps and websites.
Respeecher
AI voice cloning and text-to-speech platform built for broadcast-quality audio in film, gaming, and media production.
YobiYoba
Pay-as-you-go speech-to-text service that transcribes audio and video files into time-coded, editable transcripts.
Orb Composer
DAW plugin from LANDR that generates chord progressions, melodies, basslines, and arpeggios to help musicians break through creative blocks.
PlaylistAI
AI app that builds personalized Spotify and Apple Music playlists from text prompts, festival posters, mood cues, and listening history.
Mureka
AI music generator that turns text prompts into original, royalty-free tracks across any genre with vocal and stem export options.
EKHOS AI
Offline transcription software that converts audio and video to text locally on your device with no data sent to the cloud.
Listnr
Listnr is a text-to-speech platform that turns written content into natural-sounding audio using over 1,000 AI voices across 142+ languages.
ElevenLabs Sound Effects
AI sound effect generator from ElevenLabs that turns text descriptions into royalty-free audio samples within seconds.
FakeYou
Text-to-speech platform with a library of thousands of community-built voice models covering characters, celebrities, and custom voices.
Jamorphosia
Browser-based AI audio stem splitter that separates vocals, guitar, bass, drums, and piano from any uploaded track for remixing or practice.
Lovo.ai
AI voiceover and video creation platform with 500-plus voices across 100 languages and an integrated online video editor.
CustomPod Featured
AI briefing app that turns your chosen news sources, RSS feeds, and newsletters into personalized daily podcast episodes.
Musicfy
AI voice and music tool that lets you clone voices, cover songs, and generate original tracks without any musical background.
MuseGen
AI music studio that generates full-length songs with vocals, lyrics, and mastering from a simple text description.
Descript Overdub
AI voice regeneration feature inside Descript that fixes audio mismatches, removes background noise, and enhances weak takes.
Beatoven.ai
AI music generation platform that creates original background tracks and sound effects from a text description, with commercial licensing included.
SFX Engine
Credit-based AI sound effect generator with studio-quality output, commercial licensing, and API access for professional audio workflows.
Remusic
Remusic is an AI music platform for generating original songs, separating vocals from tracks, creating covers with cloned voices, and producing karaoke videos.
Voicemod
Real-time AI voice changer and soundboard for gamers and streamers that transforms your voice across Discord, Twitch, and 50+ platforms.
ChatTTS
Open-source conversational text-to-speech model optimized for dialogue, with fine-grained prosody control including laughter and pauses.
WellSaid
WellSaid is an AI voiceover platform that converts scripts into professional-quality audio using studio-grade AI voices built from licensed recordings.
Podcast Maker
Turns blog posts, YouTube videos, web pages, and documents into narrated podcast episodes with cloned or AI voices in around ten minutes.
So-vits-svc
Archived open-source singing voice conversion tool that transforms one singer's voice into another using SoftVC and VITS technology.
WhisperDesktop
Windows desktop app that uses GPU acceleration to transcribe audio files and live mic input with OpenAI Whisper.
Eleven Labs
ElevenLabs is an AI audio platform for generating realistic text-to-speech, voice cloning, music, sound effects, and conversational voice agents.
MuzicGenerator
AI music generator that produces full songs with vocals, lyrics, and instrumentation from a text description in minutes.
Veritone Voice
Enterprise AI voice cloning platform that creates licensed synthetic voices for broadcasters, rights holders, and media companies.
Play.ht
AI text-to-speech platform offering over 800 voices in 142 languages, voice cloning, and multi-speaker audio generation for content creators and developers.