Vannarot Roeung - Creative AI News (Page 10)

AI Apr 2, 2026

Sony Acquires AI Startup Cinemersive Labs for Photo-to-3D Tech

Sony Interactive Entertainment has acquired Cinemersive Labs, a UK-based startup specializing in AI-powered technology that converts ordinary photos and videos into immersive 3D volumetric experiences.

ChatGPT Apr 2, 2026

ChatGPT Voice Mode Arrives on Apple CarPlay

OpenAI has launched ChatGPT Voice Mode on Apple CarPlay, giving drivers hands-free access to AI conversations for advice, brainstorming, and language practice while on the road.

Deep Dive Apr 2, 2026

Gemma 4 Rewrites the Open Source AI Playbook

Google released Gemma 4, a family of four open multimodal models under Apache 2.0. The 31B dense model scores 89.2% on AIME 2026, quadrupling Gemma 3. For creators and developers building on open models, this changes everything.

AI Tools Apr 2, 2026

Figma Make Kits Let Design Systems Power AI Generation

Figma launched Make Kits and Make Attachments, two features that let design system authors teach Figma Make how to use their components and reference real data during AI generation.

News Apr 2, 2026

OpenAI Buys TBPN Media Network for Hundreds of Millions

OpenAI acquired TBPN, the Technology Business Programming Network, a live tech news show with 300,000 followers, for low hundreds of millions of dollars.

News Apr 2, 2026

Cursor 3 Rebuilds AI Coding Editor as Agent Workspace

Cursor shipped a complete overhaul on April 2, replacing its code editor with a unified workspace built around AI agents, multi-agent management, and cloud-local handoff.

AI Apr 2, 2026

ElevenLabs Launches ElevenMusic App for AI Song Creation

ElevenLabs launched ElevenMusic, a free iOS app that lets users create and remix songs using text prompts. The app marks the company expansion from voice AI into music generation.

AI Tools Apr 2, 2026

Sony Woosh: Open Source Sound Effects AI Model

Sony AI has released Woosh, an open source sound effects foundation model supporting text-to-audio and video-to-audio generation.

Deep Dive Apr 2, 2026

Open Source Audio AI Closes the Quality Gap

The first week of April 2026 delivered three significant open-source audio AI models that directly challenge paid alternatives in voice synthesis, sound effects, and multilingual speech.

AI Video Apr 2, 2026

Google Vids Adds AI Avatars With Veo 3.1 and Free Video

Google Vids now lets Workspace users create AI-directed avatars using Veo 3.1, generate custom soundtracks with Lyria 3, and produce up to 10 free video clips per month.

AI Apr 2, 2026

Microsoft Launches 3 In-House AI Models, Reducing OpenAI Dependence

Microsoft launched three in-house AI models on April 2, reducing its dependence on OpenAI. MAI-Image-2, MAI-Voice-1, and MAI-Transcribe-1 cover image generation, text-to-speech, and speech recognition.

AI Apr 2, 2026

Alibaba Qwen3.6-Plus Turns Wireframes Into Code

Alibaba releases Qwen3.6-Plus with 1M-token context, autonomous repository-level coding, and the ability to convert wireframes and screenshots directly into working frontend code.

ai-coding Apr 1, 2026

GitHub Copilot /fleet Runs Multiple AI Agents in Parallel

GitHub launches /fleet in Copilot CLI, dispatching multiple AI agents to work on different tasks simultaneously instead of processing requests sequentially.

Open Source Apr 1, 2026

Liquid AI LFM2.5-350M Runs AI Agents in 81MB on Mobile

Liquid AI releases LFM2.5-350M, a 350M parameter open-weight model that runs AI agents on mobile GPUs in just 81MB with 32k context window.

AI Apr 1, 2026

GLM-5V-Turbo Scores 94.8 on Design-to-Code

Z.ai launches GLM-5V-Turbo, a vision-coding model that scored 94.8 on Design2Code versus 77.3 for Claude Opus 4.6. Converts mockups directly to frontend code.

AI Apr 1, 2026

Arcee Ships Open Reasoning Model at 96% Less Cost

Arcee AI releases Trinity-Large-Thinking under Apache 2.0, ranking #2 on PinchBench behind Claude Opus 4.6 at just $0.90 per million output tokens.

AI Tools Apr 1, 2026

OmniVoice: Open Source TTS Covers 600 Languages

OmniVoice is a zero-shot text-to-speech model supporting 600+ languages with voice cloning, an Apache 2.0 license, and inference 40x faster than real-time.

ai-image Apr 1, 2026

Alibaba Wan2.7-Image Unifies AI Generation and Editing

Alibaba released Wan2.7-Image, a unified AI model that handles both image generation and editing in one system, with fine-grained personalization, color control, and a Pro variant capable of 4K output.

News Apr 1, 2026

Falcon Perception: 0.6B Vision Model Beats Meta SAM 3

Technology Innovation Institute releases Falcon Perception, a compact 0.6B parameter vision model that beats Meta SAM 3 across six key benchmarks while running on consumer hardware.

News Apr 1, 2026

PrismML Open-Sources Bonsai 8B: A True 1-Bit LLM

PrismML emerged from stealth and open-sourced Bonsai 8B, a true 1-bit large language model that fits 8.2 billion parameters into just 1.15 GB of memory, roughly 14x more compressed than comparable 16-bit models.

AI Video Mar 31, 2026

Google Veo 3.1 Lite Cuts AI Video Costs in Half

Google launches Veo 3.1 Lite, a cost-effective video generation model available now via the Gemini API and Google AI Studio.

News Mar 31, 2026

H Company Holo3 Sets New SOTA for GUI Agents

H Company released Holo3, a vision-language model optimized for GUI agents that sets new state-of-the-art on the OSWorld-Verified benchmark, beating proprietary models with only 3B active parameters.

Deep Dive Mar 31, 2026

The Anti-Slop Movement Is Reshaping Creative AI

A viral browser game called Your AI Slop Bores Me hit 50 million views in its first week. The anti-slop movement is reshaping how creators, brands, and platforms think about AI in creative work.

AI Tools Mar 31, 2026

Claude Code Source Leak: 500K Lines Exposed via npm

Anthropic accidentally exposed the full source code of Claude Code on March 31, leaking roughly 1,900 files and over 500,000 lines of TypeScript through an npm packaging error.

Recent posts by Vannarot Roeung

Stay ahead of Creative AI