Hume TADA: Open-Source TTS With Zero Hallucinations
Hume AI has released TADA (Text-Acoustic Dual-Alignment), an open-source text-to-speech model that eliminates a persistent problem in AI speech synthesis: hallucinated words
Deep dives, tutorials, and analysis for AI-powered creators.
Hume AI has released TADA (Text-Acoustic Dual-Alignment), an open-source text-to-speech model that eliminates a persistent problem in AI speech synthesis: hallucinated words
NVIDIA announced DLSS 4.5 at GDC 2026, introducing Dynamic Multi Frame Generation that automatically adjusts generated frames in real time to hit a target frame rate
Meta has acquired Moltbook, a Reddit-style social network built exclusively for AI agents. The platform launched in late January 2026 as an experimental "third space" where AI bots can autonomously post, comment, and upvote or downvote content
NVIDIA is launching NemoClaw, an open-source enterprise AI agent platform, ahead of GTC 2026 (March 16-19). The platform integrates with the NeMo framework and Nemotron models, backed by partnerships with Salesforce, Google, Adobe, Cisco, and CrowdStrike
Adobe launched a public beta of its AI assistant for Photoshop on March 10, 2026, while simultaneously expanding Firefly with generative editing tools and integrations with 25+ third-party AI models including Runway, OpenAI, and Black Forest Labs. Adobe shipped two major updates on the same day
Black Forest Labs, the team behind the FLUX image generation models, published Self-Flow on March 10, 2026: an open-source framework that trains image, video, and audio generation models 2.8x faster without requiring external encoders
Freepik has shipped version 5.6 of its mobile app, adding two significant creative tools: Cinematic Shot for generating studio-grade photorealistic images and an AI Music Generator that produces royalty-free tracks on demand
Fish Audio released S2, an open-source text-to-speech model trained on over 10 million hours of audio data spanning 50 languages. The model beats GPT-4o-mini-tts on the EmergentTTS-Eval benchmark with an 81.88% win rate, and ships with full weights, fine-tuning code, and a streaming inference engine
Picsart just launched AI Playground, a unified creative hub that brings together more than 90 AI models from 24 providers into a single interface
Runway launched Runway Characters on March 9, 2026, a real-time video agent API that turns any image into a fully expressive, conversational digital persona
Framer launches Server API in open beta, enabling AI agents and automated workflows to sync CMS, publish changes, and update designs programmatically.
Independent musicians filed a class action lawsuit against Google, alleging Lyria 3 was trained on 44 million YouTube clips without permission.
Anthropic launched Claude Code Review on March 9, 2026, a multi-agent system that dispatches teams of AI reviewers on every pull request. The tool uses color-coded severity ratings and boosted thorough code reviews from 16% to 54% in internal testing
Yann LeCun's AMI Labs raised $1.03 billion at a $3.5 billion pre-money valuation, making it Europe's largest seed round ever. LeCun left Meta at the end of 2025 to build "world models" that learn from physical reality rather than text alone
Tripo debuted P1.0, a native 3D diffusion model that generates production-ready 3D assets in roughly two seconds, at GDC 2026 in San Francisco
Caitlin Kalinowski, OpenAI's head of robotics and consumer hardware, resigned on March 7, 2026, over concerns about the company's recently announced Pentagon deal
Pika Labs launched AI Selves on March 7, 2026, a feature that creates persistent AI video avatars from user-uploaded reference material. Each AI Self has memory, personality traits, and can appear in any scene the creator generates through Pika's existing text-to-video pipeline
Anthropic launched the Claude Marketplace on March 6, 2026, a procurement platform where enterprises can spend their existing Anthropic commitments on third-party applications built on Claude
ElevenLabs launched its Iconic Voice Marketplace in early March 2026, a licensing platform where celebrity voices are made available to businesses and creators for commercial use
IBM released Granite 4.0 1B Speech on March 6, a compact multilingual speech model that now ranks first on the OpenASR Leaderboard for automatic speech recognition
Hugging Face launched Modular Diffusers on March 5, replacing the monolithic DiffusionPipeline with a composable block system
Google merged three separate AI creative tools into one platform in March 2026. Flow now combines Whisk (mood boards and collages), ImageFX (text-to-image generation), and Flow's original video capabilities into a single workspace powered by Veo 3.1 and Nano Banana