Alibaba Wan2.7-Image Unifies AI Generation and Editing

Alibaba Wan2.7-Image Unifies AI Generation and Editing

Alibaba released Wan2.7-Image, a unified AI model that handles both image generation and editing in one system, with fine-grained personalization, color control, and a Pro variant capable of 4K output.

PrismML Open-Sources Bonsai 8B: A True 1-Bit LLM

PrismML Open-Sources Bonsai 8B: A True 1-Bit LLM

PrismML emerged from stealth and open-sourced Bonsai 8B, a true 1-bit large language model that fits 8.2 billion parameters into just 1.15 GB of memory, roughly 14x more compressed than comparable 16-bit models.

H Company Holo3 Sets New SOTA for GUI Agents

H Company Holo3 Sets New SOTA for GUI Agents

H Company released Holo3, a vision-language model optimized for GUI agents that sets new state-of-the-art on the OSWorld-Verified benchmark, beating proprietary models with only 3B active parameters.

Seedance 2.0 Relaunches on CapCut With IP Guards

Seedance 2.0 Relaunches on CapCut With IP Guards

ByteDance is rolling out Dreamina Seedance 2.0 globally through CapCut after pausing the launch to address copyright concerns from Hollywood studios. The model now blocks real-face video generation.

Gemini 3.1 Flash Live Makes Voice AI Natural

Gemini 3.1 Flash Live Makes Voice AI Natural

Google launched Gemini 3.1 Flash Live, a new audio-first AI model designed for real-time voice conversations. It supports over 90 languages, filters background noise more effectively, and maintains context for twice as long.

Cohere Transcribe Tops Open ASR Leaderboard

Cohere Transcribe Tops Open ASR Leaderboard

Cohere released Transcribe, a 2-billion-parameter open-source speech recognition model that tops the HuggingFace Open ASR Leaderboard with a 5.42% average word error rate, beating OpenAI Whisper Large v3 by 27%.

xAI SuperGrok Lite Brings AI Images for $10 a Month

xAI SuperGrok Lite Brings AI Images for $10 a Month

xAI launched SuperGrok Lite, a $10-per-month subscription tier that includes basic AI image and video generation. The plan sits below the existing $30 SuperGrok tier and targets casual creators.

Mistral Voxtral TTS Ships Open-Weights Speech Model

Mistral Voxtral TTS Ships Open-Weights Speech Model

Mistral AI released Voxtral TTS, a 4-billion-parameter open-weights text-to-speech model that matches or beats ElevenLabs on naturalness benchmarks. It supports nine languages and clones voices from just three seconds of reference audio.

Meta's AI Gamble: What $600B in Spending Means for Creators

Meta's AI Gamble: What $600B in Spending Means for Creators

Meta is spending $600 billion on AI data centers, cutting 16,000 jobs, and still falling behind Google and OpenAI. In a single week in March 2026, the company delayed its flagship AI model, acquired two agent startups, and reportedly began planning the largest layoffs in its history

Apple's M5 Max Changes the Math on Local AI for Creators

Apple's M5 Max Changes the Math on Local AI for Creators

The MacBook Pro M5 Max can run a 70-billion parameter language model at 18 to 25 tokens per second, generate a FLUX image 3.8 times faster than its predecessor, and fit everything in 128GB of unified memory without offloading to the CPU

Meta Signs $50M-Per-Year AI Content Deal with News Corp

Meta Signs $50M-Per-Year AI Content Deal with News Corp

Meta Platforms signed a three-year AI content licensing deal with News Corp worth up to $50 million per year. The agreement gives Meta access to News Corp's US and UK news archives to train AI models and power real-time responses in Meta AI products across Facebook, Instagram, WhatsApp, and Messe...

Free Weekly Newsletter

Stay ahead of Creative AI

Join creators getting the latest AI tools, model releases, and workflow tips delivered weekly.

No spam. Unsubscribe anytime.