HeyGen Launches Developer Platform for AI Video Agents
HeyGen launched a developer platform with an open-source CLI, an HTML-to-video rendering framework called Hyperframes, and agent skills for Claude Code, Codex, and Cursor.
Deep dives, tutorials, and analysis for AI-powered creators.
HeyGen launched a developer platform with an open-source CLI, an HTML-to-video rendering framework called Hyperframes, and agent skills for Claude Code, Codex, and Cursor.
Cursor shipped version 3.1 on April 13, adding tiled multi-agent layouts, upgraded voice dictation, and a string of performance fixes that make the AI coding editor feel noticeably faster.
Springboards launched Flint, a creative AI model engineered for novelty that generates seven functionally distinct responses per ten prompts, compared to 2.88 for leading LLMs.
Meta develops photorealistic AI avatar of Zuckerberg for 72K employees. Superintelligence Labs tech could let creators and influencers build their own digital twins.
ComfyUI v0.19.0 lands with Ace Step 1.5 XL music generation, Qwen 3.5 text generation, SeeDance 2.0 video nodes, and a faster Flux 2 decoder.
llama.cpp release b8769 adds audio multimodal support for Qwen3-Omni and Qwen3-ASR models, bringing local speech recognition and audio understanding to consumer hardware.
OpenBMB released VoxCPM2, a 2 billion parameter text-to-speech model that runs on 8GB VRAM, supports 30 languages at 48kHz, and can design voices from natural language descriptions.
Anthropic launched Claude for Word in public beta, bringing AI-powered document editing with native tracked changes to Microsoft Word on Mac and Windows.
Game studios reorganizing around AI are completing prototypes 4x faster and generating UI assets up to 20x faster, according to new Wharton research.
MiniMax released an official open-source CLI tool that brings AI video, music, image, and speech generation to the terminal with seven core capabilities.
MiniMax released Music 2.6 on April 10, adding a Cover feature that extracts a song's melodic skeleton and lets creators rebuild everything around it.
llama.cpp b8738 adds backend-agnostic tensor parallelism that enables large AI models to run across multiple GPUs without vendor-specific code.
Researchers released GaussiAnimate, a framework that automatically rigs and animates 3D Gaussian Splatting assets with 17.3% quality improvement.
Overworld AI released Waypoint-1.5, a real-time video world model that generates interactive environments at 720p and 60 FPS on consumer GPUs like the RTX 3090.
Researchers from Tongji University, Tencent, and five other institutions released MegaStyle, a 1.4-million image dataset for style transfer alongside a FLUX-based model.
OpenAI launched a new $100/month ChatGPT Pro tier on April 9, delivering 5x more Codex usage than Plus and filling the gap between the $20 and $200 plans.
Adobe has launched two new AI image editing features in Firefly: Precision Flow generates multiple variations from a single prompt, while AI Markup lets creators draw on images to guide AI edits.