Audio - Creative AI News

News Jun 17, 2026

DeepL Acquires Mixhalo for Live Voice Translation

DeepL has acquired Mixhalo, the real-time audio platform built for concerts and conferences, pushing AI voice translation into live events.

Deep Dive Jun 15, 2026

Cartesia Sonic-3.5 and Ink-2: The #1 Voice AI Stack

Cartesia launched Sonic-3.5 and Ink-2 on June 15, 2026, claiming the top streaming voice AI on both speaking and listening, built for real-time voice agents.

AI Jun 14, 2026

TTS Audio Suite v5 Brings Higgs v3 to ComfyUI

TTS Audio Suite shipped v5.0.0 for ComfyUI, adding Higgs Audio v3 voice cloning, Transformers 5, and Runtime Isolation for legacy engines.

Deep Dive Jun 13, 2026

The Best AI Music Generators in 2026: Suno, Udio, ElevenLabs and More

The best AI music generators in 2026, from Suno and Udio to ElevenLabs Music, Google Lyria, and Stable Audio. We compare standout features, pricing, and commercial-use rights so you can pick the right tool for songs, scores, or royalty-free background music.

Open Source Jun 12, 2026

Audio-Reactive LoRA Syncs LTX 2.3 Video to Music

A new open-source Audio-Reactive LoRA for LTX 2.3 turns a still image and an audio track into a music-driven clip, with motion synced to the beat.

AI Jun 11, 2026

Suno Advanced Split: Pick From 100 Instrument Stems

Suno's new Advanced Split mode lets Premier subscribers extract any of nearly 100 instruments from a track, from a full drum kit to vocals.

Audio Jun 8, 2026

TTS Benchmark Ranks 46 Voice Models on Blind Tests

A revamped open-source TTS benchmark now compares 46 text-to-speech models using objective scores and blind human voting, so creators can see which voices actually hold up.

Deep Dive May 31, 2026

HAIM: AI Music Detectors Fail on Human-AI Hybrid Tracks

A new research benchmark published June 1, 2026 reveals that state-of-the-art AI music detectors systematically fail on hybrid productions, the kind created by most real-world music producers using tools like Suno or Udio.

Deep Dive May 31, 2026

MOSS-Audio 8B: Open-Source Audio AI Beating 30B Rivals

OpenMOSS published the MOSS-Audio technical report on June 1, 2026, documenting four open-source audio-language models that achieve benchmark scores rivaling systems three to four times their size.

Audio May 30, 2026

15,000 Free AI Audio Samples Built with Stable Audio 3

A developer used Stability AI Stable Audio 3 Medium to generate 15,834 free audio samples: 10,359 drum one-shots and 5,475 pitched instrument recordings, available for immediate download.

News May 29, 2026

NAVA: Baidu Open-Sources 6.3B Audio-Video AI Model

Baidu open-sourced NAVA, a 6.3B parameter joint audio-video model that generates 720p video with synced dual-channel audio in a single pass.

Deep Dive May 28, 2026

parakeet.cpp: NVIDIA Parakeet Speech-to-Text, No Python

parakeet.cpp ports NVIDIA Parakeet automatic speech recognition models to ggml, eliminating the Python runtime entirely — with byte-identical output to NeMo at up to 1.86x faster throughput.

Deep Dive May 27, 2026

Dasheng AudioGen: Unified Text-to-Audio Scene Generation

A team of 10 researchers published Dasheng AudioGen on May 27, 2026, a unified model that generates complete audio scenes from text descriptions.

ai-music May 26, 2026

ElevenLabs Music v2: 50% Cheaper, Genre Transitions Now

ElevenLabs released Music v2 on May 26, 2026 with dynamic genre transitions and up to 50% lower prices. Live now on ElevenMusic and ElevenCreative.

News May 25, 2026

Orchestria Launches: AI Music You Conduct as Stems

Orchestria launches May 25 with a multi-agent AI music engine that exposes drums, bass, and melody as editable stems. Free, Pro $8, Maestro $25.

Audio May 24, 2026

Pretzel: Chat-Driven AI Music Sequencer from Google I/O

Developer Shukant Pal built Pretzel at the Google I/O hackathon on May 24, 2026: a live web-based music sequencer where an AI agent controls what everyone hears. No signup required.

News May 21, 2026

Sonar Brings Audio Search to AI Agents and Podcast Creators

Sonar entered public beta offering natural-language search across podcasts, news, earnings calls, and radio archives for AI agents. Free tier: 500 queries/month.

Audio May 21, 2026

Spotify Studio Brings AI Personal Podcasts to 20+ Markets

Spotify's new Studio desktop app uses an AI agent to generate private, personalized podcasts from your calendar, email, and web browsing. In research preview across 20+ markets.

Audio May 21, 2026

Spotify Debuts ElevenLabs Audiobook Tool for Creators

Spotify announced a new ElevenLabs-powered audiobook creation tool at its May 21 Investor Day, with new audiobook subscription plans expected later in 2026.

Deep Dive May 21, 2026

ElevenLabs Speech Engine Tutorial: Add Voice in 30 Min

Wrap an existing chat agent (Claude, GPT, Gemini) with ElevenLabs Speech Engine voice in roughly 30 minutes. Step-by-step tutorial covers Node and Python SDKs, WebRTC token flow, turn detection, interruption, and the upgrade path to ElevenAgents.

Audio May 21, 2026

Context-Aware Audio Denoising: AI That Adapts to Your Scene

Researchers at Tampere University published a preprint introducing ACAD, an AI system that adapts what it treats as noise based on the acoustic scene it detects.

Deep Dive May 20, 2026

Stable Audio 3: Open-Weights AI Generates 6-Minute Songs

Stability AI released Stable Audio 3 on May 20, 2026, a family of open-weights latent diffusion models that generate up to 6 minutes and 20 seconds of audio from a text prompt, including on a MacBook Pro M4.

Deep Dive May 18, 2026

StemDeck: Free Local Audio Stem Separation

StemDeck v0.5.0 splits any track into vocals, drums, bass, guitar, piano, and other elements. Runs locally, free, no account required.

Deep Dive May 15, 2026

ComfyUI-DramaBox Adds LoRA Support for Custom Voice Training

ComfyUI-DramaBox added LoRA weight injection on May 16, letting creators load custom voice personalities into workflows without model reloads.

Stay ahead of Creative AI