Vannarot Roeung
Recent posts by Vannarot Roeung
Meta Business Agent Now Live on WhatsApp and Instagram
Meta launched its Business Agent globally on June 3, 2026, bringing automated customer service, product recommendations, and sales capabilities to WhatsApp, Messenger, and Instagram.
NVIDIA Cosmos 3: Free Physical AI Tools Launch at CVPR 2026
NVIDIA unveiled Cosmos 3 at CVPR 2026: an open physical AI foundation model paired with free vision synthesis and 3D reconstruction tools on GitHub.
Google Dreambeans: Daily AI Stories From Your Life
Google Labs launched Dreambeans on June 3, 2026, turning Gmail, Calendar, and Google Photos into AI-illustrated daily stories powered by Nano Banana 2.
Ideogram 4.0 Open Weights: Run a 9.3B Model in ComfyUI
Ideogram released its first open-weight text-to-image model on June 3, 2026: a 9.3B parameter Diffusion Transformer with JSON-structured prompting, in-image text rendering, and day-zero ComfyUI support.
OpenAI Codex Sites: Build Hosted Web Apps From a Prompt
OpenAI launched Sites for Codex, letting ChatGPT Business and Enterprise users describe internal tools in plain English and get hosted, authenticated web apps in return.
TinyFish Bigset: Open-Source Web Data Agent for Creators
TinyFish Bigset launched June 2 as an AGPL-3.0 open-source multi-agent system that turns a plain-English sentence into a structured dataset pulled from the live web, then refreshes it on a schedule.
Microsoft Open-Sources Windows Agent Framework at Build
Microsoft announces Windows Agent Framework and a new Agent Store at Build 2026, enabling AI agents to control desktop applications natively.
Project Polaris Replaces GPT-4 in GitHub Copilot Aug 2026
Microsoft unveiled Project Polaris at Build 2026: an in-house mixture-of-experts coding model that replaces GPT-4 Turbo as the GitHub Copilot default in August 2026.
OpenAI Codex Adds Creative and Design Plugins
OpenAI shipped six Codex plugins on June 2 with Creative Production and Product Design tiers wired into Figma, Canva, Picsart, Shutterstock, and Fal.
Microsoft MAI Foundry vs Nano Banana 2, FLUX.2: Compared
MAI-Image-2.5 hits No. 3 on Arena and edits past Nano Banana 2. We compare the new MAI Foundry stack against Google, BFL, Ideogram, and ElevenLabs on price, identity preservation, and bundled-stack ROI.
Microsoft Launches MAI v2 Media Suite at Build 2026
Microsoft used its Build 2026 keynote on June 2 to ship a refreshed MAI media stack: MAI-Image-2.5 with editing, MAI-Voice-2 for TTS, and MAI-Transcribe-1.5 for ASR.
MAI-Code-1-Flash: Microsoft's New Coding Model in Copilot
Microsoft used its Build 2026 keynote on June 2 to ship MAI-Code-1-Flash, an in-house coding model that is live today inside the GitHub Copilot model picker for Free, Pro, Pro+, and Max tiers.
Anthropic Scales Claude Mythos to Critical Infrastructure
Anthropic is scaling Project Glasswing to 150 new organizations across more than 15 countries, expanding its AI-powered security initiative.
NVIDIA Cosmos Coalition: Runway, BFL, LTX Join
NVIDIA forms Cosmos Coalition with Runway, Black Forest Labs, and LTX to build open video generation infrastructure.
OpenRouter Adds Voice, Model Fusion, and 20 New Models
OpenRouter May 2026 release spotlight covers voice fusion models, new providers, and routing improvements for AI developers.
JetBrains Mellum2 Thinking: Apache 2.0 MoE Coding Model
JetBrains releases Mellum2 Thinking under Apache 2.0, an open-weights coding model with chain-of-thought reasoning.
ComfyUI v0.23.0: Six Partner Nodes and Native 3D Splats
ComfyUI v0.23.0 adds partner nodes marketplace and native Gaussian splat support for 3D creators.
Qwen3.7-Plus Adds Vision and Video to Reasoning Stack
Qwen 3.7 Plus adds native vision and video understanding to Alibaba Bailian platform, challenging GPT-5.5 and Gemini for multimodal creative workflows.
Microsoft MAI-Thinking-1 Tops Claude Sonnet in User Evals
Microsoft's MAI-Thinking-1, a 35B sparse MoE with 256k context, tops Claude Sonnet 4.6 in blind evaluations and matches Claude Opus 4.6 on SWE-Bench Pro.
Junco Turns Your Newsletter Subscriptions Into AI Audio
Junco converts your email newsletter subscriptions into short AI-generated audio episodes. Free on iPhone, iOS 17 and later.
Microsoft Launches Web IQ: Bing Search API for AI Agents
Microsoft launched Web IQ on June 2, 2026, a suite of AI-native search APIs built for AI agents. Powered by Bing, it already runs under Copilot and ChatGPT web responses.
Fizgig v1.2.4: Flux Klein 9B LoRA Training on 16GB GPUs
Fizgig v1.2.4 makes full Flux 2 Klein 9B LoRA training possible on 16GB GPUs using fp8 Base DiT at 9.6GB VRAM. The free, open-source studio includes training presets, repair tools, and profiler.
NVIDIA PiD Adds FLUX.2 Color Fix and Qwen-Image Support
NVIDIA pushed new PiD checkpoints June 2 with a FLUX.2 color-fix variant plus Qwen-Image support, all on Apache 2.0 for direct 4K decode in ComfyUI.
Google Gemini Spark Review: The AI Agent That Knows Too Much
Google gave early access to Gemini Spark, its new always-on AI agent that mines your Gmail, Drive, and Photos to complete tasks without being told context.