Good morning. It’s Monday, June 8th.

Apple has spent the last decade holding the most valuable real estate in tech: the phone, the watch, the earbuds, the laptop, the messages, the calendar, the photos, the whole human nervous system, and somehow Siri still feels like it stopped improving 10 years ago.

But today is the day we might get a Siri integrated with an actual, useable AI! Apple’s WWDC is this afternoon at 1:00pm Pacific. We’ll see if the rumors are true.

-Jeff
AI Breakfast

You read. We listen. Let us know what you think by replying to this email.

In partnership with AhaCreator

Influencer marketing works — until you actually try to run it at scale.

Finding the right creators, chasing replies, reviewing drafts, managing contracts, tracking deadlines, handling payments… it quickly turns into a full-time operations mess.

AhaCreator is an AI-native influencer marketing platform built to automate that entire workflow.

Its AI helps brands discover creators that actually fit their audience, detect fake engagement, manage outreach and follow-ups, review creator drafts against the brief, and keep campaigns moving from first contact to final delivery.

AhaCreator also handles the trust layer: standardized contracts, escrow protection, global payouts, and refund protection if a creator doesn’t deliver.

For startups, agencies, and brands trying to scale creator campaigns without adding more manual work, AhaCreator is basically an AI operations team for influencer marketing.

Thank you for supporting our sponsors!

OpenAI plans massive ChatGPT agent superapp

OpenAI is driving past 600 million monthly active users by orchestrating its most radical structural shift yet: killing the chat interface to build an autonomous agent superapp, all while navigating high-stakes negotiations over a potential government equity stake.

The core philosophy inside the company has shifted completely, with a senior employee declaring that "chat is dead." OpenAI is overhauling ChatGPT into an agent-driven ecosystem designed to anticipate user intent and execute complex multi-step tasks.
Instead of bouncing between tabs, users can now draft and natively send emails directly from interactive writing blocks. The upcoming interface will actively guide users toward heavy-duty coding environments and third-party integrations like Canva and Booking.com, leaning heavily into enterprise workflows to shore up revenue ahead of an IPO.

But giving autonomous agents deep tool access opens up a massive digital flank. To combat prompt injection attacks, where malicious instructions hidden inside web pages or docs hijack model reasoning, OpenAI introduced a nuclear option called Lockdown Mode. Built specifically for high-risk corporate environments, this defensive containment layer completely blocks data exfiltration routes by cutting off internet browsing, Deep Research, agent execution, automatic file downloads, and external network connectors. The security upgrade also introduces an active session manager to track and revoke device access.

This push into foundational infrastructure has triggered unprecedented political entanglement. OpenAI is in active talks with the Trump administration over the US government taking a direct equity stake in the startup, which is approaching an $850 billion valuation.

Google locks down $920M monthly compute deal with SpaceX

Google has locked down a massive multi-year deal with SpaceX, guaranteeing access to 110,000 Nvidia GPUs and server infrastructure starting October 2026. Worth up to $920 million a month through mid-2029, the compute influx will scale gradually to feed Google’s hungriest next-generation models.

That compute is translating into sharper, highly optimized tools. Google DeepMind just dropped new Gemma 4 models built on Quantization-Aware Training (QAT). By simulating quantization during the training phase rather than compressing after the fact, Google managed to shrink memory footprints down to 1GB for its smallest variant without tanking accuracy. It pulls this off via static activations, channel-wise quantization, and selective 2-bit compression, allowing heavy-duty AI to run locally on smartphones and laptops.

For enterprise data, Google Research and Google Cloud launched an agentic RAG system that abandons the lazy "one-and-done" search method of traditional models. Instead, an agent loop continuously rewrites queries and evaluates context sufficiency, refusing to stop searching until it hits a reliable answer, a technical shift that bumps factual accuracy by up to 34%.

Meanwhile, a code leak points to an imminent utility upgrade for NotebookLM, revealing upcoming export support for over 40 file formats.

These back-end upgrades are already bleeding into user-facing products and long-term timelines. Google is rolling out Search Profiles in the U.S., letting creators anchor their multi-platform media feeds directly into Google knowledge panels to capture Discover traffic.

Anthropic targets custom silicon with key OpenAI hardware hire

To break free from its reliance on Google and Amazon, Anthropic just poached Clive Chan, OpenAI’s second chip engineer, indicating a serious push into custom, energy-efficient silicon ahead of a potential IPO.

That hardware focus is translating into deeper capability. Anthropic doubled its Cowork platform limits to handle heavier workloads, matching a growing trend highlighted by Boris Cherny on how to run Claude Opus autonomously for days. Cherny outlines a framework using cloud-hosted Claude Code execution, auto-approval loops to cut manual prompts, dynamic sub-agent workflows, and specialized /goal or /loop commands to sustain long-running engineering tasks.

The raw intelligence is there, too; in specialized tests, Claude Opus 4.7 read complex molecular blueprints via NMR spectroscopy data, matching specialized chemistry software by pinning down hydrogen shifts to within roughly ±0.079 ppm. An internet hype is also spiking over a leaked report of an unreleased model whose early low-effort, zero-shot outputs allegedly surpass late 2025 frontier models.

But scaling this fast creates friction. In production, developers are wrestling with an "infinite blast radius" after a migration from Claude Sonnet 4.0 to 4.5 unpredictably broke a major enterprise API pipeline. Because the newer model began tossing structured parameters into free-text fields and asking unexpected questions, it proved that prompt engineering alone can't guarantee stability.

Frontier models and product moves

Agents and the agentic stack

Business, labor, and institutions

Hardware and infrastructure

Research and benchmarks

Science and medicine

Watch

Vokal is a collaboration workspace for founders and product teams to run, review, and reuse AI agent work.

Leni equips verified AI workflow automation for investors to handle financial modeling, market research, and portfolio reporting.

Ejentum serves as a reasoning harness for AI agents to prevent cognitive drift and improve task reliability.

Job Postings API offers a free hosted interface to view, monitor, and analyze millions of US job listings.

Papera is an interactive AI notebook that instantly turns text prompts into custom-designed, multi-block page layouts.

Thank you for reading today’s edition.

Your feedback is valuable. Respond to this email and tell us how you think we could add more value to this newsletter.

Interested in reaching smart readers like you? To become an AI Breakfast sponsor, reply to this email or DM us on X!

Thinking of starting your own newsletter? AI Breakfast readers who sign up with Beehiiv receive a 14-day free trial and 20% off for 3 months.

Keep Reading