AI Breakfast
Posts
Full Recap of Google's Massive AI Releases

Full Recap of Google's Massive AI Releases

Google crushes expectations in generative AI

AI Breakfast
May 21, 2025

Good morning. It’s Wednesday, May 21st.

On this day in tech history: 1952: IBM announced its first electronic computer, the Model 701

In today’s email:

Google I/O 2025
Microsoft Build 2025
5 New AI Tools
Latest AI Research Papers

You read. We listen. Let us know what you think by replying to this email.

^{In partnership with}^{Head AI}

Meet Head: The World's FIRST AI Marketer is Here!

Forget tools, meet a new species! Head, the world's first true AI Marketer, isn't just software—it's a living AI aligned with your brand's DNA. It autonomously evolves, strategizes, and executes, from influencer programs to multi-channel campaigns and creative work. Proven to deliver real business results, generating over $100M revenue for clients. This is the dawn of a new marketing era.

See examples of how AI Is Taking Over Marketing!

^{Thank you for supporting our sponsors!}

Today’s trending AI news stories

Google I/O 2025 Recap: Multimodal AI Takes Center Stage with Smarter Search and Generative Tools

Google’s latest AI advancements introduced at I/O 2025 include a range of multimodal models and tools grouped by functionality.

The Gemini family leads with Gemini 2.5 Pro “Deep Think,” which offers parallel hypothesis evaluation and configurable thinking budgets for complex math and coding, surpassing competitor benchmarks. Gemini 2.5 Flash enhances efficiency with thought summaries and reduced token use, available now on AI Studio, Vertex AI, and Gemini app. Gemini 3n targets smartphones and edge devices with lightweight multimodal processing and advanced multilingual support. Gemini Diffusion accelerates text generation through iterative noise refinement, while MedGemma and SignGemma focus on medical image analysis and sign language translation, respectively.

In AI storytelling and filmmaking, Veo 3 delivers lifelike video generation with integrated audio, lip syncing, and physics, accessible via Gemini, Flow, and Vertex AI for Ultra subscribers. Imagen 4 produces high-quality 2K images with improved texture detail and typography, supporting multiple platforms including Gemini and Whisk. Flow combines Veo 3, Imagen, and Gemini to enable cinematic scene creation with story management. Lyria RealTime offers interactive music generation via Gemini API, supporting detailed control over genre, instruments, and audio quality.

Say goodbye to the silent era of video generation: Introducing Veo 3 — with native audio generation. 🗣️
Quality is up from Veo 2, and now you can add dialogue between characters, sound effects and background noise.
Veo 3 is available now in the @GeminiApp for Google AI Ultra
— Google (@Google)
6:23 PM • May 20, 2025

Veo 3: "a big broadway musical about garlic bread, with elaborate costumes and a sondheim-like vibe"
— Ethan Mollick (@emollick)
5:46 AM • May 21, 2025

For coding and developer tools, Agentic Colab autonomously analyses and refines code across notebook cells with error fixes and data workflows. Gemini Code Assist 2.5 provides AI co-pilot and multi-file code review capabilities, free for individuals and GitHub users. Firebase Studio converts Figma designs into full-stack apps with backend provisioning, supported by Firebase AI Logic integrating Gemini and Vertex AI. Jules, powered by Gemini 2.5 Pro, is an asynchronous coding agent running in a secure Google Cloud VM for bug fixes, testing, and prototyping, including audio changelogs. Stitch transforms text and images into UI designs with frontend code, supporting Figma export and theming. Google AI Studio now integrates Gemini 2.5 Pro, Imagen 4, and Veo 3 within its code editor, linked to the GenAI SDK for instant app generation.

The Gemini API introduces native audio output, live and async calls, and smart video/audio features with text-to-speech voice customization. New AI Mode on Gemini 2.5 improves question breakdown for research, live camera queries, and data charting, now available in the U.S. Gemini in Chrome integrates AI directly in the browser for summarization and task automation, initially to U.S. users.

Additional innovations include Google Beam, a 3D video communication platform converting 2D streams into volumetric experiences with light field displays; Project Astra, an experimental AI assistant offering proactive task management via contextual device data; and Google Meet’s real-time speech translation using a DeepMind audio model preserving speaker tone, initially supporting English and Spanish. Google AI Ultra, a new subscription tier available in the U.S. for $249.99/month, bundles access to these advanced models and tools, including Gemini, Flow, Whisk, NotebookLM with expanded usage, plus 30 TB storage and YouTube Premium.

Sergey Brin also made an unannounced appearance, explaining that the promise of AGI pulled him out of retirement. He remarked, “Anyone who is a computer scientist should not be retired right now,” underscoring the rapid pace of AI advances. Brin also speculated that our world might exist within “a stack of simulations.”

Construction Underway for First Stargate AI Data Center in Texas

Microsoft Build 2025: Agents That Code, Coordinate, and Accelerate Discovery

At Build 2025, Microsoft laid out its vision for an “open agentic web,” unveiling over 50 AI updates across software development, system architecture, and scientific research. GitHub Copilot now handles asynchronous tasks like drafting pull requests, refactoring code, and fixing bugs under human review. Agent coordination is handled via Azure AI Foundry using Agent2Agent (A2A) and Anthropic’s Model Context Protocol (MCP), which enables shared memory and multi-agent fluency. MCP is now embedded directly into Windows, giving agents standardized access to directories, subsystems, and built-in permissions to safeguard misuse, demonstrated with Perplexity navigating a local file system via natural language.

GitHub Copilot now has a coding agent embedded right where you already collaborate with developers: on GitHub. And yes, you can access it from VS Code too. 🤖
— Thomas Dohmke (@ashtom)
4:04 PM • May 19, 2025

Edge compute gets a boost through Windows AI Foundry, allowing on-device agent execution across Intel, AMD, Nvidia, and Qualcomm chips, no custom runtime required. Developers can fine-tune foundation models with proprietary data via Copilot Tuning, deploy Grok 3, or choose from over 1,900 models on Azure. NLWeb, Microsoft’s new protocol, introduces personalized AI agents to websites without relying on the cloud.

You don’t need to be a data scientist or prompt engineer to get fine-tuned AI responses. Copilot Tuning lets anyone create task-specific models in just a few clicks. See how. youtu.be/6udJzJAyT5I#CopilotTuning#copilot#microsoft#microsoft365#ai#artificialintelligence
— Microsoft Mechanics (@MSFTMechanics)
12:20 AM • May 20, 2025

A standout demo featured Microsoft Discovery, an AI-driven research engine that acts like a virtual team of scientific “postdocs.” It screened 367,000 chemical candidates in just 200 hours to identify a safer coolant for data centre immersion—a process that traditionally takes years.

Built on a graph-based knowledge engine with Copilot integration, Discovery allows researchers to simulate, test, and explore scientific hypotheses through natural language. Microsoft plans to extend its use across pharma, semiconductors, and cosmetics, with integration into Nvidia hardware, Synopsys tools, and eventually quantum computing.

5 new AI-powered tools from around the web

Stocks, Bonds, Crypto, & Options Investing With Artificial Intelligence - Public.com

Invest in Stocks, Bonds, Options, Crypto, ETFs, Treasuries, and more with AI-powered fundamental data and custom analysis.

public.com

Entelligence AI

Entelligence AI automates code reviews and PR analysis with intelligent agents, cut review time, surface bugs early, and boost engineering productivity. Smarter, faster code reviews powered by AI.

www.entelligence.ai

Line0

The first AI pair programmer for backend developers. Build Express.js backend services with the help of AI. Made by Vratix.

www.line0.dev

Posium - AI Agents for End-to-End Testing

AI Agent for end-to-end testing. Generate end-to-end tests with 10x speed using Gen AI.

posium.ai

BuildShip Tools | Prompt, Build, Deploy MCP-ready tools for AI Agents

Create and deploy powerful tools for any AI agent - from Claude and ElevenLabs Voice to Cursor, just by vibe-coding. Connect any service (Stripe, Supabase, Github etc), test, tweak, and ship hosted tool workflows or or export the code to self-host.

buildship.tools