Xiaomi MiMo-V2.5 Open-Sources 8B Voice Pipeline
Xiaomi released MiMo-V2.5 on April 22, an 8B open-weights speech recognizer plus a three-model TTS series with prompt-built voice generation.
Xiaomi released MiMo-V2.5 on April 22, an 8B open-weights speech recognizer plus a three-model TTS series with prompt-built voice generation.
Jamie Pine's Voicebox brings seven TTS engines and voice cloning to your local machine. Free, open source, and entirely offline.
OpenAI has launched ChatGPT Voice Mode on Apple CarPlay, giving drivers hands-free access to AI conversations for advice, brainstorming, and language practice while on the road.
Microsoft launched three in-house AI models on April 2, reducing its dependence on OpenAI. MAI-Image-2, MAI-Voice-1, and MAI-Transcribe-1 cover image generation, text-to-speech, and speech recognition.
Six open-source releases in two weeks just made a full creative pipeline possible without a single subscription Between March 1 and March 15, 2026, something happened that has never happened before in creative AI: open-source models matched or exceeded commercial tools across every major creative...