Speech & Audio
Text-to-speech, speech-to-text, and audio processing APIs
14 tools
AssemblyAI
FreemiumSpeech-to-text and audio intelligence API with transcription, summarization, sentiment analysis, and topic detection.
Speech & AudioAzure AI Speech
FreemiumCloud-based speech AI service providing speech-to-text, text-to-speech, speech translation, and speaker recognition APIs...
Speech & AudioCartesia
FreemiumReal-time text-to-speech API with ultra-low latency voice generation, voice cloning, and streaming audio for voice agent...
Speech & AudioDeepgram
FreemiumReal-time and batch speech-to-text API with state-of-the-art ASR models, speaker diarization, and voice AI features.
Speech & AudioElevenLabs
FreemiumAI-powered speech platform offering text-to-speech, speech-to-text, voice cloning, and conversational AI agents. Differe...
Speech & AudioHume AI
FreemiumEmotionally intelligent voice AI platform offering text-to-speech, speech-to-speech, expression measurement, and human e...
Speech & AudioLMNT
FreemiumUltra-low latency text-to-speech API optimized for real-time voice applications and conversational AI agents.
Speech & AudioOpenAI TTS
PaidText-to-speech API with 6 natural voices. HD mode available. Great for audiobook and accessibility use cases.
Speech & AudioOpenAI Whisper API
PaidOpenAI's managed speech-to-text API powered by the Whisper model. Transcribes and translates audio in 99+ languages with...
Speech & AudioPlayHT
FreemiumText-to-speech and voice cloning API with 900+ AI voices across 142 languages and real-time streaming capability.
Speech & AudioResemble AI
FreemiumVoice cloning and AI speech generation platform with real-time voice synthesis, neural TTS, and voice watermarking.
Speech & AudioRev AI
FreemiumEnterprise-grade speech recognition API offering async and streaming transcription with high accuracy across diverse aud...
Speech & AudioSuno
FreemiumAI music generation from text prompts. Create full songs with vocals, instruments, and lyrics.
Speech & AudioWhisper
Open SourceOpen-source automatic speech recognition model by OpenAI trained on 680k hours of multilingual data, available for self-...
Speech & Audio