Vannarot Roeung
Recent posts by Vannarot Roeung
Sony Acquires AI Startup Cinemersive Labs for Photo-to-3D Tech
Sony Interactive Entertainment has acquired Cinemersive Labs, a UK-based startup specializing in AI-powered technology that converts ordinary photos and videos into immersive 3D volumetric experiences.
ChatGPT Voice Mode Arrives on Apple CarPlay
OpenAI has launched ChatGPT Voice Mode on Apple CarPlay, giving drivers hands-free access to AI conversations for advice, brainstorming, and language practice while on the road.
Gemma 4 Rewrites the Open Source AI Playbook
Google released Gemma 4, a family of four open multimodal models under Apache 2.0. The 31B dense model scores 89.2% on AIME 2026, quadrupling Gemma 3. For creators and developers building on open models, this changes everything.
Figma Make Kits Let Design Systems Power AI Generation
Figma launched Make Kits and Make Attachments, two features that let design system authors teach Figma Make how to use their components and reference real data during AI generation.
OpenAI Buys TBPN Media Network for Hundreds of Millions
OpenAI acquired TBPN, the Technology Business Programming Network, a live tech news show with 300,000 followers, for low hundreds of millions of dollars.
Cursor 3 Rebuilds AI Coding Editor as Agent Workspace
Cursor shipped a complete overhaul on April 2, replacing its code editor with a unified workspace built around AI agents, multi-agent management, and cloud-local handoff.
ElevenLabs Launches ElevenMusic App for AI Song Creation
ElevenLabs launched ElevenMusic, a free iOS app that lets users create and remix songs using text prompts. The app marks the company expansion from voice AI into music generation.
Sony Woosh: Open Source Sound Effects AI Model
Sony AI has released Woosh, an open source sound effects foundation model supporting text-to-audio and video-to-audio generation.
Open Source Audio AI Closes the Quality Gap
The first week of April 2026 delivered three significant open-source audio AI models that directly challenge paid alternatives in voice synthesis, sound effects, and multilingual speech.
Google Vids Adds AI Avatars With Veo 3.1 and Free Video
Google Vids now lets Workspace users create AI-directed avatars using Veo 3.1, generate custom soundtracks with Lyria 3, and produce up to 10 free video clips per month.
Microsoft Launches 3 In-House AI Models, Reducing OpenAI Dependence
Microsoft launched three in-house AI models on April 2, reducing its dependence on OpenAI. MAI-Image-2, MAI-Voice-1, and MAI-Transcribe-1 cover image generation, text-to-speech, and speech recognition.
Alibaba Qwen3.6-Plus Turns Wireframes Into Code
Alibaba releases Qwen3.6-Plus with 1M-token context, autonomous repository-level coding, and the ability to convert wireframes and screenshots directly into working frontend code.
GitHub Copilot /fleet Runs Multiple AI Agents in Parallel
GitHub launches /fleet in Copilot CLI, dispatching multiple AI agents to work on different tasks simultaneously instead of processing requests sequentially.
Liquid AI LFM2.5-350M Runs AI Agents in 81MB on Mobile
Liquid AI releases LFM2.5-350M, a 350M parameter open-weight model that runs AI agents on mobile GPUs in just 81MB with 32k context window.
GLM-5V-Turbo Scores 94.8 on Design-to-Code
Z.ai launches GLM-5V-Turbo, a vision-coding model that scored 94.8 on Design2Code versus 77.3 for Claude Opus 4.6. Converts mockups directly to frontend code.
Arcee Ships Open Reasoning Model at 96% Less Cost
Arcee AI releases Trinity-Large-Thinking under Apache 2.0, ranking #2 on PinchBench behind Claude Opus 4.6 at just $0.90 per million output tokens.
OmniVoice: Open Source TTS Covers 600 Languages
OmniVoice is a zero-shot text-to-speech model supporting 600+ languages with voice cloning, an Apache 2.0 license, and inference 40x faster than real-time.
Alibaba Wan2.7-Image Unifies AI Generation and Editing
Alibaba released Wan2.7-Image, a unified AI model that handles both image generation and editing in one system, with fine-grained personalization, color control, and a Pro variant capable of 4K output.
Falcon Perception: 0.6B Vision Model Beats Meta SAM 3
Technology Innovation Institute releases Falcon Perception, a compact 0.6B parameter vision model that beats Meta SAM 3 across six key benchmarks while running on consumer hardware.
PrismML Open-Sources Bonsai 8B: A True 1-Bit LLM
PrismML emerged from stealth and open-sourced Bonsai 8B, a true 1-bit large language model that fits 8.2 billion parameters into just 1.15 GB of memory, roughly 14x more compressed than comparable 16-bit models.
Google Veo 3.1 Lite Cuts AI Video Costs in Half
Google launches Veo 3.1 Lite, a cost-effective video generation model available now via the Gemini API and Google AI Studio.
H Company Holo3 Sets New SOTA for GUI Agents
H Company released Holo3, a vision-language model optimized for GUI agents that sets new state-of-the-art on the OSWorld-Verified benchmark, beating proprietary models with only 3B active parameters.
The Anti-Slop Movement Is Reshaping Creative AI
A viral browser game called Your AI Slop Bores Me hit 50 million views in its first week. The anti-slop movement is reshaping how creators, brands, and platforms think about AI in creative work.
Claude Code Source Leak: 500K Lines Exposed via npm
Anthropic accidentally exposed the full source code of Claude Code on March 31, leaking roughly 1,900 files and over 500,000 lines of TypeScript through an npm packaging error.