xAI Launches Grok Voice API: $0.10/hr STT, $4.20/M TTS
xAI launched the Grok Voice Agent API April 18 with standalone speech-to-text and text-to-speech endpoints priced at $0.10 per hour batch STT and $4.20 per million characters TTS.
xAI launched the Grok Voice Agent API April 18 with standalone speech-to-text and text-to-speech endpoints priced at $0.10 per hour batch STT and $4.20 per million characters TTS.
The AI voice cloning market reaches $4.06 billion in 2026. We compare ElevenLabs, Voxtral TTS, and Fish Audio S2 on quality, pricing, latency, and self-hosting to help creators choose the right tool.
Mistral Voxtral TTS is a 4B parameter open-weights model that matches ElevenLabs quality in human evaluations, with 3-second voice cloning across 9 languages and self-hosting support.