llama.cpp Adds Audio Support With Qwen3-Omni

llama.cpp Adds Audio Support With Qwen3-Omni

llama.cpp release b8769 adds audio multimodal support for Qwen3-Omni and Qwen3-ASR models, bringing local speech recognition and audio understanding to consumer hardware.

MegaStyle Trains FLUX on 1.4M Styled Images

MegaStyle Trains FLUX on 1.4M Styled Images

Researchers from Tongji University, Tencent, and five other institutions released MegaStyle, a 1.4-million image dataset for style transfer alongside a FLUX-based model.

The Creator's Guide to Running AI Locally in 2026

The Creator's Guide to Running AI Locally in 2026

In April 2026, a creator with a $1,200 desktop PC and a 12GB graphics card can generate publication-quality images in under 10 seconds, produce video clips, run a conversational AI, and compose music without sending a single byte to the cloud.

Tencent OmniWeaving Unifies 7 Video Gen Tasks

Tencent OmniWeaving Unifies 7 Video Gen Tasks

Tencent has released OmniWeaving, a unified video generation model that handles seven distinct tasks from text-to-video to reasoning-augmented generation, with publicly available weights and code.

Open Source Audio AI Closes the Quality Gap

Open Source Audio AI Closes the Quality Gap

The first week of April 2026 delivered three significant open-source audio AI models that directly challenge paid alternatives in voice synthesis, sound effects, and multilingual speech.

Gemma 4 Rewrites the Open Source AI Playbook

Gemma 4 Rewrites the Open Source AI Playbook

Google released Gemma 4, a family of four open multimodal models under Apache 2.0. The 31B dense model scores 89.2% on AIME 2026, quadrupling Gemma 3. For creators and developers building on open models, this changes everything.

Alibaba Wan2.7-Image Unifies AI Generation and Editing

Alibaba Wan2.7-Image Unifies AI Generation and Editing

Alibaba released Wan2.7-Image, a unified AI model that handles both image generation and editing in one system, with fine-grained personalization, color control, and a Pro variant capable of 4K output.

Free Weekly Newsletter

Stay ahead of Creative AI

Join creators getting the latest AI tools, model releases, and workflow tips delivered weekly.

No spam. Unsubscribe anytime.