Deep Dive
Jun 2, 2026
MAI-Image-2.5 hits No. 3 on Arena and edits past Nano Banana 2. We compare the new MAI Foundry stack against Google, BFL, Ideogram, and ElevenLabs on price, identity preservation, and bundled-stack ROI.
openrouter
Jun 2, 2026
OpenRouter May 2026 release spotlight covers voice fusion models, new providers, and routing improvements for AI developers.
microsoft
Jun 2, 2026
Microsoft shipped three creator-focused AI models on June 2: MAI-Image-2.5 beats Gemini on Arena benchmarks, MAI-Transcribe-1.5 runs 5x faster with 43-language support, and MAI-Voice-2 clones voices from short audio samples across 15 languages.
News
May 27, 2026
Voice AI lab Sesame opened the iOS preview of its mobile app on May 27, 2026, shipping four distinct voice agents with real-time web search and visual information cards.
AI Tools
May 27, 2026
Anthropic is preparing a significant expansion of Claude's mobile voice mode, with app teardown data revealing 18 new languages coming to the feature.
Deep Dive
May 21, 2026
Wrap an existing chat agent (Claude, GPT, Gemini) with ElevenLabs Speech Engine voice in roughly 30 minutes. Step-by-step tutorial covers Node and Python SDKs, WebRTC token flow, turn detection, interruption, and the upgrade path to ElevenAgents.
Deep Dive
May 17, 2026
Security researchers have found a way to hijack voice AI models using inaudible sounds embedded in ordinary audio clips. AudioHijack achieved 79 to 96 percent success rates across 13 large audio language models.
Deep Dive
May 15, 2026
ComfyUI-DramaBox added LoRA weight injection on May 16, letting creators load custom voice personalities into workflows without model reloads.
OpenAI
May 15, 2026
OpenAI quietly bought voice-cloning startup Weights.gg, dispersed its six-person team across existing groups, and folded the tech into Voice Engine work.
AI Tools
May 14, 2026
Adobe faces a class action lawsuit filed May 14, 2026, alleging the company used journalists' and podcasters' voices to train AI models without consent under Illinois BIPA law.
AI News
May 12, 2026
Meta rolled out Muse Spark voice conversations, @meta.ai Threads mentions, and cross-platform side chats across its consumer apps on May 12.
Deep Dive
May 11, 2026
Mira Murati's Thinking Machines shipped TML-Interaction-Small, a 276B full-duplex voice model that listens and speaks simultaneously, beating GPT-Realtime on latency and interaction quality.
Deep Dive
May 7, 2026
OpenAI released three new voice intelligence models through its Realtime API on May 7, 2026: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper.
Deep Dive
May 5, 2026
ElevenLabs crossed $500 million in ARR on May 5, 2026, announcing BlackRock, Nvidia, Jamie Foxx, and Eva Longoria as new investors. The $11B valuation makes it the most-funded independent voice AI company.
voice-ai
Apr 22, 2026
Xiaomi released MiMo-V2.5 on April 22, an 8B open-weights speech recognizer plus a three-model TTS series with prompt-built voice generation.
Audio
Apr 20, 2026
Jamie Pine's Voicebox brings seven TTS engines and voice cloning to your local machine. Free, open source, and entirely offline.
ChatGPT
Apr 2, 2026
OpenAI has launched ChatGPT Voice Mode on Apple CarPlay, giving drivers hands-free access to AI conversations for advice, brainstorming, and language practice while on the road.
AI
Apr 2, 2026
Microsoft launched three in-house AI models on April 2, reducing its dependence on OpenAI. MAI-Image-2, MAI-Voice-1, and MAI-Transcribe-1 cover image generation, text-to-speech, and speech recognition.
Deep Dive
Mar 14, 2026
Six open-source releases in two weeks just made a full creative pipeline possible without a single subscription Between March 1 and March 15, 2026, something happened that has never happened before in creative AI: open-source models matched or exceeded commercial tools across every major creative...