voice-ai - Creative AI News

Deep Dive Jun 2, 2026

Microsoft MAI Foundry vs Nano Banana 2, FLUX.2: Compared

MAI-Image-2.5 hits No. 3 on Arena and edits past Nano Banana 2. We compare the new MAI Foundry stack against Google, BFL, Ideogram, and ElevenLabs on price, identity preservation, and bundled-stack ROI.

openrouter Jun 2, 2026

OpenRouter Adds Voice, Model Fusion, and 20 New Models

OpenRouter May 2026 release spotlight covers voice fusion models, new providers, and routing improvements for AI developers.

microsoft Jun 2, 2026

Microsoft Releases MAI Image, Voice, and Transcribe Models

Microsoft shipped three creator-focused AI models on June 2: MAI-Image-2.5 beats Gemini on Arena benchmarks, MAI-Transcribe-1.5 runs 5x faster with 43-language support, and MAI-Voice-2 clones voices from short audio samples across 15 languages.

News May 27, 2026

Sesame iOS Preview: 4 Voice Agents, Real-Time Search

Voice AI lab Sesame opened the iOS preview of its mobile app on May 27, 2026, shipping four distinct voice agents with real-time web search and visual information cards.

AI Tools May 27, 2026

Claude Voice Mode Expanding to 18 New Languages

Anthropic is preparing a significant expansion of Claude's mobile voice mode, with app teardown data revealing 18 new languages coming to the feature.

Deep Dive May 21, 2026

ElevenLabs Speech Engine Tutorial: Add Voice in 30 Min

Wrap an existing chat agent (Claude, GPT, Gemini) with ElevenLabs Speech Engine voice in roughly 30 minutes. Step-by-step tutorial covers Node and Python SDKs, WebRTC token flow, turn detection, interruption, and the upgrade path to ElevenAgents.

Deep Dive May 17, 2026

Voice AI Security Flaw: AudioHijack Hits 13 Models

Security researchers have found a way to hijack voice AI models using inaudible sounds embedded in ordinary audio clips. AudioHijack achieved 79 to 96 percent success rates across 13 large audio language models.

Deep Dive May 15, 2026

ComfyUI-DramaBox Adds LoRA Support for Custom Voice Training

ComfyUI-DramaBox added LoRA weight injection on May 16, letting creators load custom voice personalities into workflows without model reloads.

OpenAI May 15, 2026

OpenAI Acquires Weights.gg, Folds Voice-Cloning Team

OpenAI quietly bought voice-cloning startup Weights.gg, dispersed its six-person team across existing groups, and folded the tech into Voice Engine work.

AI Tools May 14, 2026

Adobe Sued Over AI Voice Training Under Illinois Privacy Law

Adobe faces a class action lawsuit filed May 14, 2026, alleging the company used journalists' and podcasters' voices to train AI models without consent under Illinois BIPA law.

AI News May 12, 2026

Meta AI Voice Mode and @meta.ai Threads Mentions Launch

Meta rolled out Muse Spark voice conversations, @meta.ai Threads mentions, and cross-platform side chats across its consumer apps on May 12.

Deep Dive May 11, 2026

Thinking Machines TML-Interaction: Full-Duplex Voice AI

Mira Murati's Thinking Machines shipped TML-Interaction-Small, a 276B full-duplex voice model that listens and speaks simultaneously, beating GPT-Realtime on latency and interaction quality.

Deep Dive May 7, 2026

OpenAI Realtime API: New Voice Models for Creator Workflows

OpenAI released three new voice intelligence models through its Realtime API on May 7, 2026: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper.

Deep Dive May 5, 2026

ElevenLabs Hits $500M ARR: What It Means for Creators

ElevenLabs crossed $500 million in ARR on May 5, 2026, announcing BlackRock, Nvidia, Jamie Foxx, and Eva Longoria as new investors. The $11B valuation makes it the most-funded independent voice AI company.

voice-ai Apr 22, 2026

Xiaomi MiMo-V2.5 Open-Sources 8B Voice Pipeline

Xiaomi released MiMo-V2.5 on April 22, an 8B open-weights speech recognizer plus a three-model TTS series with prompt-built voice generation.

Audio Apr 20, 2026

Voicebox: Free Open-Source Voice Studio with 7 TTS Engines

Jamie Pine's Voicebox brings seven TTS engines and voice cloning to your local machine. Free, open source, and entirely offline.

ChatGPT Apr 2, 2026

ChatGPT Voice Mode Arrives on Apple CarPlay

OpenAI has launched ChatGPT Voice Mode on Apple CarPlay, giving drivers hands-free access to AI conversations for advice, brainstorming, and language practice while on the road.

AI Apr 2, 2026

Microsoft Launches 3 In-House AI Models, Reducing OpenAI Dependence

Microsoft launched three in-house AI models on April 2, reducing its dependence on OpenAI. MAI-Image-2, MAI-Voice-1, and MAI-Transcribe-1 cover image generation, text-to-speech, and speech recognition.

Deep Dive Mar 14, 2026

Open-Source Creative AI Catches Up in March 2026

Six open-source releases in two weeks just made a full creative pipeline possible without a single subscription Between March 1 and March 15, 2026, something happened that has never happened before in creative AI: open-source models matched or exceeded commercial tools across every major creative...

Stay ahead of Creative AI