Audio - Creative AI News (Page 2)

OpenAI May 15, 2026

OpenAI Acquires Weights.gg, Folds Voice-Cloning Team

OpenAI quietly bought voice-cloning startup Weights.gg, dispersed its six-person team across existing groups, and folded the tech into Voice Engine work.

tts May 15, 2026

Supertonic 3: On-Device TTS Speaks 31 Languages

Supertonic 3 is an open-weights, CPU-only TTS engine from Supertone with 31 languages, expression tags, and zero-shot voice cloning.

Deep Dive May 14, 2026

Break-the-Beat: AI Turns MIDI Drums Into Audio

Break-the-Beat! is a new AI model that renders drum MIDI patterns as realistic audio using a reference recording for timbre. Researchers released a demo with dozens of examples spanning Speed Metal, Funk Rock, and electronic drum kits.

Open Source May 13, 2026

Scenema Audio: Open-Source Voice Cloning From LTX 2.3

ScenemaAI released Scenema Audio on Hugging Face and GitHub, an open-weights expressive TTS and zero-shot voice cloning model built on the audio half of Lightricks LTX-2. MIT inference code, 13 languages, real-time on a 24 GB GPU.

News May 6, 2026

Google Flow Music Reaches Believe and TuneCore Artists

Google partners with Believe and TuneCore to route Flow Music, its Lyria 3 Pro song studio, to indie artists. Ambassador program meets Google's product team weekly.

Deep Dive May 5, 2026

ElevenLabs Hits $500M ARR: What It Means for Creators

ElevenLabs crossed $500 million in ARR on May 5, 2026, announcing BlackRock, Nvidia, Jamie Foxx, and Eva Longoria as new investors. The $11B valuation makes it the most-funded independent voice AI company.

Deep Dive May 5, 2026

Ableton Live 12.4: Link Audio and 6 Workflow Updates

Ableton Live 12.4 shipped May 5, 2026 as a free update for Live 12. The standout feature is Link Audio, which streams audio wirelessly between two devices on your local network without extra hardware.

Audio Apr 20, 2026

Voicebox: Free Open-Source Voice Studio with 7 TTS Engines

Jamie Pine's Voicebox brings seven TTS engines and voice cloning to your local machine. Free, open source, and entirely offline.

nab-2026 Apr 17, 2026

RODE Launches RODECaster Studio: AI Podcast Post-Production App

RODE announced RODECaster Studio at NAB 2026 on April 17, 2026 -- a new Mac and Windows desktop app that brings AI-powered editing to podcast post-production.

AI Tools Apr 16, 2026

Google Gemini Mac App Adds Image, Video, and Music Generation

Google launched a native Gemini app for Mac on April 16, 2026, bringing image, video, and music generation directly to the desktop for the first time.

AI Tools Apr 15, 2026

Darwin-TTS Adds Emotion to AI Voice With No Training Required

Researchers published Darwin-TTS on April 15, 2026, a text-to-speech model that adds emotional expression to AI voice without any training, fine-tuning, or new data.

AI Tools Apr 15, 2026

Google Gemini 3.1 Flash TTS Beats ElevenLabs in Quality Benchmarks

Google released Gemini 3.1 Flash TTS on April 15, 2026, a text-to-speech model that outperforms ElevenLabs v3 in quality benchmarks while offering a generous free tier.

News Apr 15, 2026

Splice Launches AI Sample Variations with Built-In Creator Payouts

Splice launched Variations and Craft on April 15, 2026, letting producers remix any sample from its 3-million-sound library while automatically compensating the original creator.

AI Apr 14, 2026

ComfyUI Adds Frame-Synced AI Music via Sonilo

ComfyUI now supports Sonilo via Partner Nodes, letting creators generate full-length soundtracks that sync to video footage frame by frame.

Open Source Apr 12, 2026

llama.cpp Adds Audio Support With Qwen3-Omni

llama.cpp release b8769 adds audio multimodal support for Qwen3-Omni and Qwen3-ASR models, bringing local speech recognition and audio understanding to consumer hardware.

Audio Apr 12, 2026

VoxCPM2 Open-Sources 2B-Param TTS With 30 Languages

OpenBMB released VoxCPM2, a 2 billion parameter text-to-speech model that runs on 8GB VRAM, supports 30 languages at 48kHz, and can design voices from natural language descriptions.

Audio Apr 9, 2026

ElevenLabs Brings Voice AI On-Premise and On-Device

ElevenLabs has announced on-premise and on-device deployment options for its voice AI platform, letting organizations run text-to-speech inference entirely within their own infrastructure.

News Apr 9, 2026

Suno Licensing Talks With Universal and Sony Stall Over Download Rights

Suno licensing negotiations with Universal Music Group and Sony Music have stalled over a core disagreement: whether users can download and share AI-generated songs outside the platform.

AI Apr 6, 2026

Google Eloquent Brings Offline AI Dictation to iPhone

Google quietly launches Eloquent, a free Gemma-powered dictation app for iOS with offline transcription, filler word removal, and text transformation tools.

AI Apr 6, 2026

Sync-3 Ships 4K AI Lipsync With Full-Shot Processing

Sync, the Y Combinator-backed lipsync startup, launched Sync-3 on April 6. The new model processes entire shots at once, producing 4K native output with built-in obstruction detection.

AI Tools Apr 2, 2026

Sony Woosh: Open Source Sound Effects AI Model

Sony AI has released Woosh, an open source sound effects foundation model supporting text-to-audio and video-to-audio generation.

AI Tools Apr 1, 2026

OmniVoice: Open Source TTS Covers 600 Languages

OmniVoice is a zero-shot text-to-speech model supporting 600+ languages with voice cloning, an Apache 2.0 license, and inference 40x faster than real-time.

Audio Mar 27, 2026

Deezer Licenses AI Music Detection to Rights Body

Deezer has licensed its AI-generated music detection technology to Hungarys EJI performers rights body. The platform flags over 60,000 AI tracks daily, with 39% of all uploads now involving AI.

Audio Mar 25, 2026

ElevenLabs and IBM Partner on Enterprise Voice AI

ElevenLabs and IBM announced a partnership on March 25 to integrate 10,000+ voices across 70 languages into IBM watsonx Orchestrate.

Stay ahead of Creative AI