DeepL Acquires Mixhalo for Live Voice Translation

DeepL has acquired Mixhalo, the real-time audio platform built for concerts, conferences, and sports venues, in a deal announced on June 17, 2026. The move pushes DeepL beyond text and into live spoken communication, signaling that real-time AI voice translation is moving from demo to production infrastructure that creators can actually build on.

What Happened

DeepL, best known for high-quality machine translation, bought Mixhalo, a company founded in 2016 by Incubus guitarist Mike Einziger, violinist Ann Marie Simpson-Einziger, and CEO Vik Singh. Mixhalo started by streaming crystal-clear concert audio directly to fans' phones and expanded into low-latency audio for sports and live events. Mixhalo was already a DeepL customer, and the acquisition folds that live-event audio pipeline into DeepL's translation stack. It builds directly on the voice-to-voice translation DeepL launched earlier in 2026.

Why It Matters

Live translation has always been the hardest version of the problem: it has to be fast, accurate, and delivered to many devices at once. Pairing DeepL's translation models with Mixhalo's low-latency audio delivery targets exactly that gap. For creators, this is the same shift happening across the audio space, where real-time translation is reaching dozens of languages. DeepL's own Voice product is the consumer-facing side of this; the acquisition is the live-event side.

Key Details

Deal terms were not disclosed. Mixhalo had raised more than $39 million from investors including Fortress Investment, Founders Fund, and Cowboy Ventures. DeepL says it is opening a San Francisco office to grow its U.S. operations on the back of the deal, and plans to demonstrate voice-to-voice and voice-to-text translation in real conference and event settings. DeepL's voice translation already covers 40-plus languages, competing with players like Wordly AI and Palabra.

What This Enables for Creators

If you produce multilingual content, this is the tooling layer maturing under you. Real-time voice translation that runs at event scale means live streams, webinars, virtual summits, and podcasts can ship multilingual audio without a room full of human interpreters. The same pipeline that translates a keynote for an in-room audience can caption or dub a creator's livestream for a global one. The practical takeaway: voice-to-voice translation is now reliable enough to design into a workflow, not just bolt on afterward.

What to Do Next

Map where language is currently capping your reach. If your stream, course, or podcast has international viewers, test a real-time translation pass on one episode before committing. Developers building multilingual live experiences can wire transcription and translation directly through the DeepL Voice API, which opens a WebSocket connection to transcribe and translate audio as it streams. Start with one language pair, measure latency and accuracy on your actual content, then expand from there.

DeepL Acquires Mixhalo for Live Voice Translation

What Happened

Why It Matters

Key Details

What This Enables for Creators

What to Do Next

Keep reading

NVIDIA XR AI: Open-Source AI Agents for AR Glasses

Kling 3 vs Runway vs Sora: Best AI Video 2026

AI Video Generation Cost in 2026: Price Per Second

What Happened

Why It Matters

Key Details

What This Enables for Creators

What to Do Next

Stay ahead of AI

Keep reading

NVIDIA XR AI: Open-Source AI Agents for AR Glasses

Kling 3 vs Runway vs Sora: Best AI Video 2026

AI Video Generation Cost in 2026: Price Per Second

Stay ahead of Creative AI