llama.cpp Adds Audio Support With Qwen3-Omni
llama.cpp release b8769 adds audio multimodal support for Qwen3-Omni and Qwen3-ASR models, bringing local speech recognition and audio understanding to consumer hardware.
llama.cpp release b8769 adds audio multimodal support for Qwen3-Omni and Qwen3-ASR models, bringing local speech recognition and audio understanding to consumer hardware.
In April 2026, a creator with a $1,200 desktop PC and a 12GB graphics card can generate publication-quality images in under 10 seconds, produce video clips, run a conversational AI, and compose music without sending a single byte to the cloud.
Topaz Starlight Precise 2.5 is now available as ComfyUI partner nodes, delivering sharper video upscaling from 720p to 4K with fewer artifacts.
NVIDIA announced DGX Spark at GTC 2026, a desktop workstation powered by the Grace Blackwell Superchip that runs AI models up to 120B parameters locally.
The MacBook Pro M5 Max can run a 70-billion parameter language model at 18 to 25 tokens per second, generate a FLUX image 3.8 times faster than its predecessor, and fit everything in 128GB of unified memory without offloading to the CPU
AMD launched the Ryzen AI 400 Series desktop processors at MWC 2026 on March 2, marking the first time its XDNA 2 NPU and RDNA 3.5 integrated graphics are available in a desktop form factor