OpenAI Codex Sites: Build Hosted Web Apps From a Prompt
OpenAI launched Sites for Codex, letting ChatGPT Business and Enterprise users describe internal tools in plain English and get hosted, authenticated web apps in return.
OpenAI launched Sites for Codex, letting ChatGPT Business and Enterprise users describe internal tools in plain English and get hosted, authenticated web apps in return.
Google gave early access to Gemini Spark, its new always-on AI agent that mines your Gmail, Drive, and Photos to complete tasks without being told context.
Deploy H Company's Apache 2.0 Holo 3.1 4B locally with vLLM on a 12GB consumer GPU, hook it into a desktop-agent workflow, and benchmark step latency and cost against Claude Computer Use and OpenAI Operator.
H Company released Holo 3.1, an Apache 2.0 computer-use agent family with quantized weights, mobile control, and 79.3% AndroidWorld score.
Cursor shipped auto-review mode today in version 3.6, a new run mode that lets the agent work for longer stretches with fewer approval prompts.
Linear launched Diffs on May 28, bringing pull request reviews into the same workspace as issues, projects, and customer signals.
Qwen3.7-Max sets a new bar for non-hallucination rate among frontier agent models. Here is how it stacks up against Claude Opus 4.7, Gemini 3.1 Pro, and GPT-5.5 across the four reliability dimensions that decide which model goes into production.
Google open-sourced Agent Executor (AX), a distributed runtime that gives any AI agent durable execution, sandboxed isolation, and crash recovery. Here is the 30-minute migration path for an existing Claude Code, Cursor, or Gemini CLI workflow, end-to-end.
Open-source Forge framework adds guardrails to 8B local models, lifting agentic task reliability to 86.5% -- no cloud API required.
NVIDIA has launched a formal verification pipeline for its AI agent skills, introducing cryptographic signing, automated security scanning, and machine-readable trust metadata.
Manus released Scheduled Tasks 2.0 on May 18, 2026, letting recurring agent runs stay inside the same task context instead of starting fresh every time. Web apps built with Manus can also schedule their own actions.
Moonshot AI released Kimi WebBridge on May 14, a browser extension that lets coding agents like Claude Code, Cursor, and Codex drive your local Chrome or Edge through the Chrome DevTools Protocol. Logins stay local.
Anthropic launched Claude for Small Business on May 13 with seven SaaS connectors. We compare it against Microsoft Copilot, Gemini for Workspace, ChatGPT for Business, and Adobe Firefly on connector coverage, native creative output, pricing transparency, and data defaults.
OpenSquilla v0.1.0 launched May 12, 2026: an Apache 2.0 microkernel runtime that smart-routes agent traffic across model tiers for 60 to 80 percent token savings.
How to use Google DeepMind Magic Pointer in Chrome and AI Studio. Five step-by-step workflows: enable in Chrome, edit images by pointing, navigate maps, rewrite Google Docs paragraphs, compare products, and where it fits versus Anthropic Computer Use, TML-Interaction, and Codex Chrome.
Microsoft Research tested Claude 4.6 Opus, GPT 5.4, and Gemini 3.1 Pro across 52 professional workflows. Frontier models lost 25 percent of document content on average.
Alibaba shipped the first end-to-end agentic shopping stack at 300M-MAU. How it compares against Amazon Rufus, Shopify Sidekick, and Etsy AI on search, virtual try-on, in-chat checkout, and scale.
OpenAI launched a Codex Chrome extension on May 7, 2026, putting the AI coding agent inside the browser on macOS and Windows. It works on any site you can open in Chrome.
Spotify shipped a CLI on May 7 2026 that lets Claude Code, OpenAI Codex, and OpenClaw save AI-generated Personal Podcasts directly to your Spotify library, private to your account.
Manus launched Cloud Computer on April 30, an always-on Ubuntu VM that lets the agent host 24/7 bots, persistent databases, and scheduled jobs.
xAI quietly shipped Grok Imagine Agent on April 30, 2026, giving Heavy and Super Grok subscribers an autonomous AI that decomposes project briefs into planned multi-asset generation runs.
Latitude, makers of AI Dungeon, launched Voyage on April 21, 2026 -- an AI-native RPG platform where creators build game worlds with natural language and every NPC thinks independently.
Anthropic added Live Artifacts to Claude Cowork on April 20, 2026: AI dashboards and trackers that connect to your apps and files and auto-refresh with current data every time you open them.
In Q1 2026, MCP integrations surpassed 1,000, Cursor shipped agent workspaces, and Windsurf released SWE-1.5 free. The single-tool era of creative AI is ending. The agent era has arrived.