Gemini API File Search Goes Multimodal with Image Embeddings

Gemini API File Search Goes Multimodal with Image Embeddings

Gemini File Search now indexes images natively via gemini-embedding-2. Step-by-step multimodal RAG workflow tutorial: when to use it, how to set it up, pricing math, and how it compares to Pinecone and Weaviate for visual retrieval.

Gemini Generates PDFs, Excel, Slides Direct From Chat

Gemini Generates PDFs, Excel, Slides Direct From Chat

Google's Gemini app can now create PDFs, Microsoft Word, Excel, Google Docs, Sheets, Slides, LaTeX and more directly inside the chat. Available globally to all Gemini users starting April 29, 2026.

Google Launches Ask YouTube AI Conversational Search

Google Launches Ask YouTube AI Conversational Search

Google launched Ask YouTube on April 28, 2026, a conversational search beta that replaces keyword queries with natural-language questions and returns a mix of videos, Shorts, and AI summaries.

Google Adds Veo Video Upscaling on Vertex AI

Google Adds Veo Video Upscaling on Vertex AI

Google added a standalone Veo upscaling capability to Vertex AI on April 23, alongside the enterprise rollout of Veo 3.1 Lite. The new tool sharpens any video to 1080p or 4K, regardless of whether the source was generated by Veo, another AI model, or a traditional.

Google Flow Music Launches: AI Song Studio Built on Lyria 3

Google Flow Music Launches: AI Song Studio Built on Lyria 3

Google launched Flow Music on April 18, 2026, rebranding ProducerAI into a standalone AI music studio powered by Lyria 3. Create full songs, AI music videos with Veo, and remix tracks with new Replace and Extend features -- free at flowmusic.app.

Avid Adds Gemini AI Agents to Media Composer for NAB 2026

Avid Adds Gemini AI Agents to Media Composer for NAB 2026

Avid and Google Cloud announced a multi-year partnership on April 16, 2026, embedding Gemini and Vertex AI into Media Composer for agentic search, B-roll, and metadata workflows. Public demos run at NAB Show April 19 to 22.

Gemma 4 Rewrites the Open Source AI Playbook

Gemma 4 Rewrites the Open Source AI Playbook

Google released Gemma 4, a family of four open multimodal models under Apache 2.0. The 31B dense model scores 89.2% on AIME 2026, quadrupling Gemma 3. For creators and developers building on open models, this changes everything.

Google Maps Adds Gemini AI and 3D Navigation

Google Maps Adds Gemini AI and 3D Navigation

Google launched Ask Maps and Immersive Navigation on March 12, 2026, bringing Gemini AI to Google Maps for the first time. Ask Maps lets users ask complex conversational questions, and Immersive Navigation replaces flat 2D directions with photorealistic 3D views of the road ahead

Google Flow Merges Whisk, ImageFX, and Video into One AI Studio

Google Flow Merges Whisk, ImageFX, and Video into One AI Studio

Google merged three separate AI creative tools into one platform in March 2026. Flow now combines Whisk (mood boards and collages), ImageFX (text-to-image generation), and Flow's original video capabilities into a single workspace powered by Veo 3.1 and Nano Banana

Google Gemini 3.1 Pro Doubles Reasoning Performance

Google Gemini 3.1 Pro Doubles Reasoning Performance

Google released Gemini 3.1 Pro on February 19, 2026, reporting more than double the reasoning performance of its predecessor on key benchmarks. The model scored 77.1% on ARC-AGI-2, a test designed to measure general intelligence capabilities

Free Weekly Newsletter

Stay ahead of Creative AI

Join creators getting the latest AI tools, model releases, and workflow tips delivered weekly.

No spam. Unsubscribe anytime.