Open Source - Creative AI News (Page 5)

AI Video Apr 23, 2026

Vista4D: Eyeline Labs Reshoots Any Video in 4D

Eyeline Labs released Vista4D, an open-source framework that reshoots existing video from new camera angles using a 4D point cloud plus video diffusion.

AI Tools Apr 22, 2026

LLaDA2.0-Uni: Ant Group's Open Image Gen and Edit Model

Inclusion AI released LLaDA2.0-Uni, a 16B MoE diffusion model that unifies text-to-image generation, instruction-based editing, and image understanding in one Apache-2.0 checkpoint.

AI Models Apr 22, 2026

Qwen3.6-27B: Open Dense Model Beats Its 35B Sibling

Alibaba released Qwen3.6-27B on April 22, 2026. The 27B dense open-weight model beats its larger 35B-A3B sibling on coding, agent, and vision benchmarks.

voice-ai Apr 22, 2026

Xiaomi MiMo-V2.5 Open-Sources 8B Voice Pipeline

Xiaomi released MiMo-V2.5 on April 22, an 8B open-weights speech recognizer plus a three-model TTS series with prompt-built voice generation.

AI Tools Apr 21, 2026

Alibaba Open-Sources Tstars-Tryon Virtual Try-On Benchmark

Alibaba's Taobao and Tmall algorithm team released Tstars-Tryon 1.0 on April 21, 2026, a commercial-scale virtual try-on system already serving millions of Taobao shoppers.

Audio Apr 20, 2026

Voicebox: Free Open-Source Voice Studio with 7 TTS Engines

Jamie Pine's Voicebox brings seven TTS engines and voice cloning to your local machine. Free, open source, and entirely offline.

Deep Dive Apr 19, 2026

Kimi K2.6 Deep Dive: Inside the Open-Weight Model That Tops Every Coding Benchmark

Moonshot AI's Kimi K2.6 scores 58.6 on SWE-Bench Pro, placing first above GPT-5.4 and Claude Opus 4.6. Full architecture breakdown, benchmark analysis, and what the open-weight license actually allows.

AI Models Apr 19, 2026

Qwen3.6-Max-Preview: Alibaba's New AI Tops Coding Benchmarks

Alibaba released Qwen3.6-Max-Preview on April 20, 2026, topping six programming benchmarks including SWE-benchPro. Here is what it means for creators building AI workflows.

AI News Apr 19, 2026

Quiver SVG Generation Launches in ComfyUI Partner Nodes

ComfyUI added Quiver Arrow 1.1 as a Partner Node on April 20, 2026, bringing structured SVG generation to open-source visual workflows for the first time.

3d-ai Apr 19, 2026

TRELLIS.2 Image-to-3D Now Runs on Apple Silicon Without Nvidia

A community port of Microsoft TRELLIS.2 brings image-to-3D mesh generation to Apple Silicon. No Nvidia GPU required, GLB output ready for Blender and Unity.

AI Tools Apr 15, 2026

Darwin-TTS Adds Emotion to AI Voice With No Training Required

Researchers published Darwin-TTS on April 15, 2026, a text-to-speech model that adds emotional expression to AI voice without any training, fine-tuning, or new data.

AI Video Apr 14, 2026

HeyGen Launches Developer Platform for AI Video Agents

HeyGen launched a developer platform with an open-source CLI, an HTML-to-video rendering framework called Hyperframes, and agent skills for Claude Code, Codex, and Cursor.

Image Generation Apr 14, 2026

Nucleus-Image Is the First Open-Source MoE Diffusion Model

NucleusAI released Nucleus-Image on April 14, a 17-billion parameter diffusion model that activates only 2 billion parameters per image -- cutting compute without sacrificing quality.

Deep Dive Apr 13, 2026

Seedance 2.0 Arrives in ComfyUI: What It Means for Video Creators

ByteDance's Seedance 2.0 is now a native node in ComfyUI, bringing multimodal audio-video generation with 12 simultaneous reference inputs directly into the most popular open-source creative AI pipeline.

ComfyUI Apr 13, 2026

ComfyUI v0.19.0 Adds Music, Text Gen, and Video Nodes

ComfyUI v0.19.0 lands with Ace Step 1.5 XL music generation, Qwen 3.5 text generation, SeeDance 2.0 video nodes, and a faster Flux 2 decoder.

Open Source Apr 12, 2026

llama.cpp Adds Audio Support With Qwen3-Omni

llama.cpp release b8769 adds audio multimodal support for Qwen3-Omni and Qwen3-ASR models, bringing local speech recognition and audio understanding to consumer hardware.

Audio Apr 12, 2026

VoxCPM2 Open-Sources 2B-Param TTS With 30 Languages

OpenBMB released VoxCPM2, a 2 billion parameter text-to-speech model that runs on 8GB VRAM, supports 30 languages at 48kHz, and can design voices from natural language descriptions.

ai-music Apr 10, 2026

MiniMax Music 2.6 Adds AI Cover and Style Transfer

MiniMax released Music 2.6 on April 10, adding a Cover feature that extracts a song's melodic skeleton and lets creators rebuild everything around it.

AI Apr 9, 2026

llama.cpp Ships Multi-GPU Tensor Parallelism

llama.cpp b8738 adds backend-agnostic tensor parallelism that enables large AI models to run across multiple GPUs without vendor-specific code.

ai-3d Apr 9, 2026

GaussiAnimate Auto-Rigs 3D Gaussian Splatting

Researchers released GaussiAnimate, a framework that automatically rigs and animates 3D Gaussian Splatting assets with 17.3% quality improvement.

ai-image Apr 9, 2026

MegaStyle Trains FLUX on 1.4M Styled Images

Researchers from Tongji University, Tencent, and five other institutions released MegaStyle, a 1.4-million image dataset for style transfer alongside a FLUX-based model.

AI Video Apr 9, 2026

LPM 1.0 Generates Real-Time Character Video

A team of 23 researchers released LPM 1.0, a 17-billion parameter Diffusion Transformer that generates real-time character video from audio input.

AI Apr 8, 2026

Safetensors Joins PyTorch Foundation as Open Project

Hugging Face contributes Safetensors to the PyTorch Foundation, making the de facto AI model storage format a vendor-neutral, community-governed project.

AI Apr 7, 2026

Google Open-Sources Scion Multi-Agent Orchestrator

Google Cloud Platform has open-sourced Scion, an experimental orchestration testbed that runs multiple AI coding agents as isolated, concurrent container processes.

Stay ahead of Creative AI