NVIDIA Cosmos 3 Unifies World Simulation for Robots
NVIDIA unveiled Cosmos 3 at GTC, the first world foundation model that unifies synthetic world generation, physical AI reasoning, and action simulation.
Deep dives, tutorials, and analysis for AI-powered creators.
NVIDIA unveiled Cosmos 3 at GTC, the first world foundation model that unifies synthetic world generation, physical AI reasoning, and action simulation.
SD3.5-Flash generates images in just 4 steps instead of the usual 30 to 50, making on-device AI image generation practical on smartphones and laptops with under 8GB of RAM.
OpenArt Worlds generates navigable 3D environments from text prompts using World Labs spatial AI, letting creators walk through scenes and capture shots from any angle.
Amazon Ads launches Creative Agent, a free AI tool that handles scriptwriting, image generation, video, voiceovers, and music for ads on Amazon.
Anthropic has launched Dispatch, a new research preview for Claude Cowork that lets users send AI tasks from their phone and have Claude execute them on their Mac desktop.
Google has released an open-source MCP server for Google Colab, letting AI agents like Claude Code and Gemini CLI create notebooks, execute Python code, and access cloud GPUs directly.
Midjourney released V8 Alpha on March 17, bringing 5x faster generation, native 2K rendering, and significantly improved text rendering to its AI image platform.
OpenAI released GPT-5.4 mini and GPT-5.4 nano on March 17, bringing the capabilities of its flagship GPT-5.4 model to significantly smaller, faster, and cheaper packages.
H Company released Holotron-12B on March 17, an open-source computer use agent model that delivers more than 2x the throughput of its predecessor on a single H100 GPU.
Gamma, the AI-powered presentation and design platform with over 100 million users, launched Gamma Imagine on March 17 with AI image generation for brand-specific visual assets.
ComfyUI has integrated Reve, an aesthetic-driven AI image model that generates images at up to 4K resolution with three distinct operating modes.
Mistral launched Forge at NVIDIA GTC, a platform that lets enterprises train frontier-grade AI models from scratch using their own proprietary data.
Maxon has released Redshift 2026.4, introducing real-time architectural visualization that lets architects and 3D artists walk through projects at interactive speed.
Unsloth AI released Studio, a free local application that provides a visual interface for fine-tuning large language models while using 70% less VRAM than standard training methods.
NVIDIA CloudXR 6.0 now streams RTX-powered 4K graphics directly to Apple Vision Pro, enabling professional 3D design reviews and immersive workflows from any RTX workstation.
Researchers from CMU, Princeton, Together AI, and Cartesia AI published Mamba-3, a state-space model architecture that matches transformer performance while running inference 7x faster.
Adobe and NVIDIA announce a strategic partnership to build next-gen Firefly models on CUDA-X and Cosmos, plus a 3D digital twin solution now in public beta.
Picsart launched an AI agent marketplace that lets creators hire specialized AI assistants for tasks like Shopify product optimization, content resizing, and batch editing.
NVIDIA unveiled DLSS 5 at GTC 2026, introducing generative AI-powered neural rendering that transforms game visuals with photoreal lighting and materials.
xAI expanded Grok Imagine templates from mobile to web on March 16, letting users transform photos into AI-generated content through one-click template workflows.
NVIDIA announced DGX Spark at GTC 2026, a desktop workstation powered by the Grace Blackwell Superchip that runs AI models up to 120B parameters locally.
Mistral released Leanstral, the first open-source AI agent built for Lean 4 formal verification, proving AI-generated code meets its specifications at 93% lower cost.