VOID, BiRefNet, and Gemma 4 Are Now in ComfyUI

VOID, BiRefNet, and Gemma 4 Are Now in ComfyUI

Three significant open-source models arrived in ComfyUI on May 14, 2026: VOID for video object deletion, BiRefNet for image segmentation, and Gemma 4 for multimodal reasoning.

OpenReader v3: Self-Hosted TTS for PDF, EPUB, DOCX

OpenReader v3: Self-Hosted TTS for PDF, EPUB, DOCX

OpenReader v3.0 converts PDF, EPUB, DOCX, TXT, and Markdown files into synchronized read-along sessions or exported audiobooks, with multiple TTS providers and Docker deployment.

Scenema Audio: Open-Source Voice Cloning From LTX 2.3

Scenema Audio: Open-Source Voice Cloning From LTX 2.3

ScenemaAI released Scenema Audio on Hugging Face and GitHub, an open-weights expressive TTS and zero-shot voice cloning model built on the audio half of Lightricks LTX-2. MIT inference code, 13 languages, real-time on a 24 GB GPU.

IBM Granite 4.1: Dense LLMs Walk Back the MoE Bet

IBM Granite 4.1: Dense LLMs Walk Back the MoE Bet

IBM released Granite 4.1 on April 29, 2026 with dense decoder-only 3B, 8B, and 30B Apache 2.0 LLMs that walk back the Granite 4.0 MoE bet. The 8B dense beats the 32B-A9B MoE on most benchmarks. 128K production context, 512K extension.

Poolside Releases Laguna M.1 and Open-Weight XS.2

Poolside Releases Laguna M.1 and Open-Weight XS.2

Poolside released the first two models in its Laguna family on April 28: a 225B-parameter flagship called M.1 and an open-weight 33B model called XS.2 under Apache 2.0. Both are pitched at agentic, long-horizon coding work.

SenseNova U1 Drops VAEs: Open-Source Unified Image AI

SenseNova U1 Drops VAEs: Open-Source Unified Image AI

SenseTime open-sourced SenseNova U1 on April 28, 2026, releasing three model variants under Apache 2.0. The architecture drops VAEs entirely, unifying image generation and text reasoning in one space.

Open-Source AI Models 2026: The Creator Reference Guide

Open-Source AI Models 2026: The Creator Reference Guide

The complete creator reference to open-source AI in 2026: LLMs (DeepSeek V4, Kimi, Qwen), image (FLUX, Wan 2.7), video (Wan, Skywork, LTX), 3D (HY-World, TRELLIS), audio (Voicebox, OmniVoice, Darwin-TTS), all license-aware.

Free Weekly Newsletter

Stay ahead of Creative AI

Join creators getting the latest AI tools, model releases, and workflow tips delivered weekly.

No spam. Unsubscribe anytime.