Open Source - Creative AI News

Deep Dive Jun 18, 2026

ComfyUI v0.25 Bundles Kling, Depth, and 3D Nodes

ComfyUI shipped v0.25.0 and v0.25.1 in three days, adding Kling V3-Turbo, Depth Anything 3, Tripo3D, and native 3D preview nodes.

Deep Dive Jun 18, 2026

Text-to-CAD AI: Turn Prompts Into Printable Models

Yes, AI can turn a sentence into a real 3D model you can print, but only some tools give you a model you can also edit. The split that decides everything is parametric versus mesh.

Open Source Jun 18, 2026

Beyond LoRA: Hugging Face Tests Better Fine-Tuning Methods

Hugging Face benchmarked LoRA against newer PEFT methods like OFT, which beat it on image generation while using less memory. The default is not always the best fit.

AI Jun 16, 2026

UCP-Local: Offline RAG for Claude Desktop and Cursor

A new open-source tool, UCP-Local, grounds Claude Desktop, Cursor, and LM Studio in your own files for fully offline retrieval. Released June 16, 2026 under Apache-2.0.

ai-music Jun 15, 2026

SlipMate Is an Open-Source AI DJ Instrument for Mac

SlipMate is a free, open-source generative DJ instrument that runs two AI music models locally and lets you mix their output in real time like vinyl.

News Jun 15, 2026

PixlStash Desktop App: Self-Hosted AI Image Manager

PixlStash, the open-source self-hosted image manager for AI creators, now ships as a native desktop app for Windows, macOS, and Linux with one-click install.

AI Jun 13, 2026

GLM 5.2 Ships 1M-Token Context on Z.ai Coding Plan

Zhipu has shipped GLM 5.2, a coding-first model now live across every tier of its Z.ai Coding Plan, led by a 1-million-token context window.

Open Source Jun 12, 2026

VibeClip: Open-Source AI Video Editor You Direct by Chat

VibeClip is a new open-source, self-hosted tool that turns long videos into vertical captioned shorts you direct by chatting.

Open Source Jun 12, 2026

Audio-Reactive LoRA Syncs LTX 2.3 Video to Music

A new open-source Audio-Reactive LoRA for LTX 2.3 turns a still image and an audio track into a music-driven clip, with motion synced to the beat.

AI Models Jun 12, 2026

Kimi K2.7-Code: Open Model Beats Opus 4.8 on Tool Use

Moonshot released Kimi K2.7-Code on June 12, an open-weights coding model that beats Claude Opus 4.8 on MCPMark tool use at far lower cost.

Deep Dive Jun 10, 2026

DiffusionGemma: Google's 4x Faster Open Text Model

Google DeepMind released DiffusionGemma on June 10, 2026, an Apache 2.0 open model that generates text up to 4x faster by denoising blocks of tokens in parallel, and it runs on a single RTX GPU.

ai-coding Jun 10, 2026

Xiaomi Open-Sources MiMo Code, a Claude Code Rival

Xiaomi open-sourced MiMo Code, a free MIT-licensed terminal coding agent with persistent memory that runs on its MiMo-V2.5-Pro model and rivals Claude Code.

Deep Dive Jun 9, 2026

OpenCV 5.0 Turns Vision Into a Local AI Runtime

OpenCV 5.0 now runs LLMs, vision-language models, diffusion, and inpainting natively, turning the most-used vision library into a local runtime for an entire creative pipeline.

NVIDIA Jun 4, 2026

NVIDIA Nemotron 3.5 ASR: 40 Languages at 80ms Latency

NVIDIA released Nemotron 3.5 ASR on June 4: an open 600M streaming speech model covering 40 language-locales with sub-100ms latency for voice agents.

News Jun 3, 2026

Gemma 4 12B: Encoder-Free Multimodal in 12 Billion Params

Google's new Gemma 4 12B drops separate vision and audio encoders, packing native video and speech understanding into a single 12B open-weights model that runs locally.

AI Tools Jun 3, 2026

Google AI Edge Gallery Lands on Mac With Gemma 4 12B

Google AI Edge Gallery arrives on macOS with Gemma 4 12B support, bringing local AI model testing to Mac users for the first time.

Deep Dive Jun 2, 2026

Ideogram 4.0 Open Weights: Run a 9.3B Model in ComfyUI

Ideogram released its first open-weight text-to-image model on June 3, 2026: a 9.3B parameter Diffusion Transformer with JSON-structured prompting, in-image text rendering, and day-zero ComfyUI support.

AI Tools Jun 2, 2026

TinyFish Bigset: Open-Source Web Data Agent for Creators

TinyFish Bigset launched June 2 as an AGPL-3.0 open-source multi-agent system that turns a plain-English sentence into a structured dataset pulled from the live web, then refreshes it on a schedule.

NVIDIA Jun 2, 2026

NVIDIA Cosmos Coalition: Runway, BFL, LTX Join

NVIDIA forms Cosmos Coalition with Runway, Black Forest Labs, and LTX to build open video generation infrastructure.

jetbrains Jun 2, 2026

JetBrains Mellum2 Thinking: Apache 2.0 MoE Coding Model

JetBrains releases Mellum2 Thinking under Apache 2.0, an open-weights coding model with chain-of-thought reasoning.

Deep Dive Jun 2, 2026

Fizgig v1.2.4: Flux Klein 9B LoRA Training on 16GB GPUs

Fizgig v1.2.4 makes full Flux 2 Klein 9B LoRA training possible on 16GB GPUs using fp8 Base DiT at 9.6GB VRAM. The free, open-source studio includes training presets, repair tools, and profiler.

Image Generation Jun 2, 2026

NVIDIA PiD Adds FLUX.2 Color Fix and Qwen-Image Support

NVIDIA pushed new PiD checkpoints June 2 with a FLUX.2 color-fix variant plus Qwen-Image support, all on Apache 2.0 for direct 4K decode in ComfyUI.

News Jun 1, 2026

Holo 3.1: Open-Weights Local Computer-Use Agent

H Company released Holo 3.1, an Apache 2.0 computer-use agent family with quantized weights, mobile control, and 79.3% AndroidWorld score.

Deep Dive Jun 1, 2026

Run Holo 3.1 Locally: Computer-Use Agent in 30 Min

Deploy H Company's Apache 2.0 Holo 3.1 4B locally with vLLM on a 12GB consumer GPU, hook it into a desktop-agent workflow, and benchmark step latency and cost against Claude Computer Use and OpenAI Operator.

Stay ahead of Creative AI