Supertonic 3: On-Device TTS Speaks 31 Languages
Supertonic 3 is an open-weights, CPU-only TTS engine from Supertone with 31 languages, expression tags, and zero-shot voice cloning.
Supertonic 3 is an open-weights, CPU-only TTS engine from Supertone with 31 languages, expression tags, and zero-shot voice cloning.
OpenReader v3.0 converts PDF, EPUB, DOCX, TXT, and Markdown files into synchronized read-along sessions or exported audiobooks, with multiple TTS providers and Docker deployment.
xAI launched the Grok Voice Agent API April 18 with standalone speech-to-text and text-to-speech endpoints priced at $0.10 per hour batch STT and $4.20 per million characters TTS.
The AI voice cloning market reaches $4.06 billion in 2026. We compare ElevenLabs, Voxtral TTS, and Fish Audio S2 on quality, pricing, latency, and self-hosting to help creators choose the right tool.
Mistral Voxtral TTS is a 4B parameter open-weights model that matches ElevenLabs quality in human evaluations, with 3-second voice cloning across 9 languages and self-hosting support.