Vannarot Roeung
Recent posts by Vannarot Roeung
Meta Builds Zuckerberg AI Clone, Plans Creator Digital Twins
Meta develops photorealistic AI avatar of Zuckerberg for 72K employees. Superintelligence Labs tech could let creators and influencers build their own digital twins.
Stanford AI Index 2026: What 53% Adoption Means for Creators
Stanford HAI's ninth annual AI Index Report reveals that generative AI reached 53% global adoption in three years, while the creative AI sector faces a transparency crisis and environmental reckoning.
ComfyUI v0.19.0 Adds Music, Text Gen, and Video Nodes
ComfyUI v0.19.0 lands with Ace Step 1.5 XL music generation, Qwen 3.5 text generation, SeeDance 2.0 video nodes, and a faster Flux 2 decoder.
llama.cpp Adds Audio Support With Qwen3-Omni
llama.cpp release b8769 adds audio multimodal support for Qwen3-Omni and Qwen3-ASR models, bringing local speech recognition and audio understanding to consumer hardware.
VoxCPM2 Open-Sources 2B-Param TTS With 30 Languages
OpenBMB released VoxCPM2, a 2 billion parameter text-to-speech model that runs on 8GB VRAM, supports 30 languages at 48kHz, and can design voices from natural language descriptions.
Claude for Word Beta Adds AI Editing to Microsoft Word
Anthropic launched Claude for Word in public beta, bringing AI-powered document editing with native tracked changes to Microsoft Word on Mac and Windows.
Game Studios Hit 4-20x Faster With AI
Game studios reorganizing around AI are completing prototypes 4x faster and generating UI assets up to 20x faster, according to new Wharton research.
MiniMax CLI Brings AI Video, Music, and Image Generation to Terminal
MiniMax released an official open-source CLI tool that brings AI video, music, image, and speech generation to the terminal with seven core capabilities.
MiniMax Music 2.6 Adds AI Cover and Style Transfer
MiniMax released Music 2.6 on April 10, adding a Cover feature that extracts a song's melodic skeleton and lets creators rebuild everything around it.
HappyHorse: Alibaba's Stealth Video AI Tops Every Benchmark
HappyHorse-1.0, the anonymous AI video model that topped every benchmark, is from Alibaba. Analysis of Zhang Di's defection from Kling, single-pass architecture, and open-source claims.
llama.cpp Ships Multi-GPU Tensor Parallelism
llama.cpp b8738 adds backend-agnostic tensor parallelism that enables large AI models to run across multiple GPUs without vendor-specific code.
Black Forest Labs: The 70-Person Giant Killer Pivots
Black Forest Labs declined xAI partnership and pivots to physical AI. Analysis of the 70-person startup behind FLUX, its $300M+ enterprise deals, and what robots mean for creators.
GaussiAnimate Auto-Rigs 3D Gaussian Splatting
Researchers released GaussiAnimate, a framework that automatically rigs and animates 3D Gaussian Splatting assets with 17.3% quality improvement.
Luma Dream Brief Tests AI Ads at Cannes Lions
21 AI-generated ads compete at Cannes Lions 2026 with a $1M prize. Analysis of Luma's Dream Brief, director-AI collaboration, and post-DM9 transparency.
Waypoint-1.5 Generates Interactive AI Worlds on Consumer GPUs
Overworld AI released Waypoint-1.5, a real-time video world model that generates interactive environments at 720p and 60 FPS on consumer GPUs like the RTX 3090.
MegaStyle Trains FLUX on 1.4M Styled Images
Researchers from Tongji University, Tencent, and five other institutions released MegaStyle, a 1.4-million image dataset for style transfer alongside a FLUX-based model.
Figma Weave: Node Graphs Replace the Prompt Box
Figma Weave connects 20+ AI models from 12 providers into a node-based canvas. Analysis of multi-model orchestration, App Mode, and what it means for design teams.
ChatGPT Pro Adds $100 Tier With 5x Codex Limits
OpenAI launched a new $100/month ChatGPT Pro tier on April 9, delivering 5x more Codex usage than Plus and filling the gap between the $20 and $200 plans.
Adobe Firefly Adds Precision Flow and AI Markup
Adobe has launched two new AI image editing features in Firefly: Precision Flow generates multiple variations from a single prompt, while AI Markup lets creators draw on images to guide AI edits.
Suno Licensing Talks With Universal and Sony Stall Over Download Rights
Suno licensing negotiations with Universal Music Group and Sony Music have stalled over a core disagreement: whether users can download and share AI-generated songs outside the platform.
ElevenLabs Brings Voice AI On-Premise and On-Device
ElevenLabs has announced on-premise and on-device deployment options for its voice AI platform, letting organizations run text-to-speech inference entirely within their own infrastructure.
PrismAudio Turns Silent AI Video Into Stereo
PrismAudio, developed by Alibaba FunAudioLLM team, generates spatial stereo audio directly from silent AI video files in an average of 0.63 seconds.
LPM 1.0 Generates Real-Time Character Video
A team of 23 researchers released LPM 1.0, a 17-billion parameter Diffusion Transformer that generates real-time character video from audio input.
Meta Muse Spark Marks Proprietary Shift for AI
Meta launched Muse Spark on April 8, the first AI model built by its new Superintelligence Labs division. Unlike every Llama release before it, Muse Spark is proprietary.