- AI Breakfast
- Posts
- OpenAI Releases o3-Pro
OpenAI Releases o3-Pro
Plus, this AI is controlling the next generation of robots
Good morning. It’s Wednesday, June 11th.
On this day in tech history: In 1978, Intel launched the 8086, a 16-bit processor that became the foundation of the x86 architecture. Adopted in the first IBM PCs, it set a standard that still drives most personal computers today. The chip enabled faster processing and more complex software at the time, but its deeper impact was locking in the x86 model. Nearly five decades later, its legacy continues to define how PCs are built and how software is written.
In today’s email:
- OpenAI taps Google, slows o3-pro for precision 
- Google speeds up Veo 3, adds Mariner to Chrome 
- Redwood equips NEO for mobile, full-body tasks 
- 5 New AI Tools 
- Latest AI Research Papers 
You read. We listen. Let us know what you think by replying to this email.
Clone Any Voice. Speak Any Language. Sound Human.
Fish.Audio is redefining AI voice generation with the most realistic speech synthesis on the market. Whether you're creating voiceovers, audiobooks, ads, or full-blown AI characters, this platform delivers stunning quality and control.
- 200,000+ voices in the public library — or upload your own 
- Voice cloning from just 15 seconds of audio 
- Text-to-speech, speech-to-text, and full voice agent support 
- Supports 13 languages with native-level expressiveness 
- Used by over 150 creators and studios across YouTube, TikTok, and beyond 
Side-by-side tests show Fish.Audio outperforming ElevenLabs in emotional nuance and clarity. From Taylor Swift and Elon to original avatars, it’s become the go-to platform for creators who want voices that feel real.
🎙 Try it now: fish.audio/discovery

Today’s trending AI news stories
OpenAI Trades Speed for Precision with o3-pro; Taps Google for Compute
OpenAI is leaning hard into precision with o3-pro, a tool-native upgrade to its o3 model built for enterprise-grade reasoning. Tuned for precision over speed, it integrates Python, file analysis, web browsing, and vision, making it ideal for complex, multi-step tasks where accuracy trumps latency. Available via API and now default for Pro and Team ChatGPT users, o3-pro trades speed and cost at $20 input and $80 output per million tokens, for deeper tool use and more deliberate thinking. It can take minutes per response, but early testers say it handles uncertainty well, reasons in context, and chooses tools intelligently. It's slow, but by design.

Image: OpenAI
At the same time, OpenAI dropped o3’s pricing by 80%, pushing high-reasoning LLMs into mass-market territory. Input is now $2 per million tokens, output $8, with cacheable inputs as low as $0.50. “Flex mode” offers speed–cost tradeoffs, and benchmarks show o3 still holds its own against Claude Opus and Gemini Pro at a fraction of the price. This isn’t just a price cut, it’s a compression of the AI value curve, aimed at devs, startups, and researchers who need power without the premium.
We’re cutting the price of o3 by 80% and introducing o3-pro in the API, which uses even more compute.
o3:
Input: $2 / 1M tokens
Output: $8 / 1M tokensNow in effect.
We optimized our inference stack that serves o3. Same exact model—just cheaper.
platform.openai.com/docs/models/o3
— OpenAI Developers (@OpenAIDevs)
8:18 PM • Jun 10, 2025
The company’s open-weight model, once slated for June, has been delayed following a breakthrough it now plans to fold in. The model aims to bring o-series reasoning to the open ecosystem, potentially augmented by OpenAI’s cloud backend. Though timing slips, the goal is strategic: redefine what “open” can do, even as rivals like Mistral and Qwen gain ground.
Behind it all, OpenAI has quietly signed a compute deal with Google Cloud, shifting from a Microsoft-first infrastructure stance to a more pragmatic, multi-cloud play. In the LLM arms race, compute is the real constraint, and even competitors now share silicon.
Looking ahead, OpenAI sees a horizon beyond answers. In The Gentle Singularity, Altman suggests that 2026 could bring models capable of producing genuinely novel insights. The company’s roadmap points toward generative cognition rather than imitation. That vision is already fueling tangible results: OpenAI now reports $10 billion in annual recurring revenue, doubling from $5.5 billion a year ago, powered by ChatGPT subscriptions, enterprise deployments, and developer API usage. With 500 million weekly active users and 3 million business customers, the company’s growth has been extraordinary, but so have its costs. Read more.
Google Upgrades Veo 3 Speed, Embeds Project Mariner in Chrome
Google is advancing its AI-native tooling with two closely linked developments. Veo 3 Fast significantly boosts the rendering speed of 720p video, more than doubling that of its predecessor, while maintaining integration across both the Gemini app and Flow. Gemini Pro users can now generate three videos daily, and Flow Pro users are charged 20 credits per output, while Ultra-tier users retain access to higher quality and generation limits. Google is also experimenting with multimodal prompts like voice-to-video and plans to scale access to Workspace accounts and international users.
🔥Veo 3 keeps growing like crazy. To keep up, we’re introducing Veo 3 Fast in @GeminiApp and Flow. It’s >2x faster, has the same 720p resolution, and a bunch of serving optimizations. The big headline: we can serve more of it, even for the Yetis!
How to get started:
1) Get a
— Josh Woodward (@joshwoodward)
6:19 PM • Jun 9, 2025
Running parallel to this is the gradual release of Project Mariner, an experimental browser-based agent embedded directly in Chrome. Available to Gemini Ultra subscribers, Mariner operates across open tabs and can handle navigation, form-filling, and transactional tasks through a prompt-driven chat interface. Designed with strict permission gating, the agent asks for user approval before taking actions, even for basic lookups, reflecting Google’s emphasis on privacy-aware automation. Read more.
BREAKING 🚨: Project Mariner is being rolled out to Gemini Ultra subscribers!
It works more like an Operator but has access to your open tabs in Chrome.
— TestingCatalog News 🗞 (@testingcatalog)
7:02 PM • Jun 9, 2025
Redwood AI equips humanoid robot ‘NEO’ for mobile tasks, whole-body manipulation
1X Technologies has launched Redwood, a lightweight AI model that turns its NEO humanoid into a home-capable autonomous agent. Trained on real-world robot data, Redwood lets NEO move, perceive, and act in domestic spaces, handling tasks like laundry, door answering, and indoor navigation. The model generalizes well, adapting to novel objects and retrying failed grasps. It enables whole-body, multi-contact manipulation, synchronizing locomotion and arm movement for mobile, dynamic control, including bracing and leaning.
Crucially, Redwood runs entirely on NEO’s onboard GPU, with voice-driven intent prediction handled by a connected language model. Unlike simulation-tuned AI, Redwood is grounded in practical deployment, as demoed at NVIDIA GTC 2025. For developers, it offers a path to scalable, compute-efficient humanoid autonomy. For robotics, it’s a signpost toward real-world utility. Read more.

- Multimodal Language Models Develop Intuitive Object Representations 
- AlphaOne Lets Developers Tune LLM Reasoning Speed at Inference 
- Qualcomm Unveils First GenAI Smart Glasses That Work Without Cloud or Phone 
- Apple Opens the Gates to Smarter, Privacy-First App Development 
- China's AI chip tool QiMeng beats engineers, designs processors in just days 
- Qualcomm shares its vision for the future of smart glasses with on-glass Gen AI 
- IBM to Build Fault-Tolerant Quantum Computer 20,000x More Powerful by 2029 in New York 
- China's AlphaBot2 humanoid robot with first full-embodied AI works at auto factory 
- Top 15 Vibe Coding Tools Transforming AI-Driven Software Development in 2025 
- Google DeepMind and UK Government Unveil "Extract" Initiative 
- Krea AI Launches Free Beta of Krea 1, Its First In-House Image Model with Advanced Aesthetic Control 
- Yutori AI debuts Scouts, web-tracking agents for real-time alerts on anything from deals to rentals 
- Create Endless Cinematic Videos for Free with SkyReels-V2. Now Open-Sourced on GitHub 
- Scammers are using AI to enroll fake students in online classes, then steal college financial aid 

5 new AI-powered tools from around the web

arXiv is a free online library where researchers share pre-publication papers.


Thank you for reading today’s edition.

Your feedback is valuable. Respond to this email and tell us how you think we could add more value to this newsletter.
Interested in reaching smart readers like you? To become an AI Breakfast sponsor, reply to this email or DM us on 𝕏!





