• AI Breakfast
  • Posts
  • Altman declares ‘code red,’ fast-tracks new model ‘Garlic’ against Gemini surge

Altman declares ‘code red,’ fast-tracks new model ‘Garlic’ against Gemini surge

In partnership with

Good morning. It’s Wednesday, December 3rd.

On this day in tech history: In 2014, Facebook AI Research released the first open-source implementation of “fastText,” blending bag-of-words speed with subword embeddings. It looked humble next to deep networks, but its efficiency reshaped text classification benchmarks and proved that clever tokenization could beat brute-force scale. That idea, structural efficiency over parameter count, foreshadowed today’s small-model renaissance powering on-device AI and edge agents.

In today’s email:

  • Altman declares ‘code red,’ fast-tracks new model ‘Garlic’ against Gemini surge

  • Claude Code scales up with Bun as Anthropic demos AI DeFi hacks

  • Runway’s Gen-4.5 beats Google and OpenAI in text-to-video benchmark 

  • 5 New AI Tools

  • Latest AI Research Papers

You read. We listen. Let us know what you think by replying to this email.

Run ads IRL with AdQuick

With AdQuick, you can now easily plan, deploy and measure campaigns just as easily as digital ads, making them a no-brainer to add to your team’s toolbox.

You can learn more at www.AdQuick.com

Today’s trending AI news stories

Altman declares ‘code red,’ fast-tracks new model ‘Garlic’ against Gemini surge

OpenAI is scrambling to regain momentum as Google’s Gemini 3 ramps up, reclaiming benchmark leadership and prompting a 6–7% drop in ChatGPT’s active users. CEO Sam Altman has declared a company-wide ‘code red’, pausing commercial initiatives, including ads, shopping and health agents, and the Pulse personal-briefing tool, to focus engineering on core model improvements. Daily progress calls, temporary cross-team transfers, and an emphasis on lower latency, higher reliability, and richer reasoning now define OpenAI’s operating mode.

According to internal briefings reported by The Information, ‘Garlic’ is already posting strong results in coding, complex reasoning, and multi-step problem solving, areas where Google and Anthropic have recently surged with Gemini 3 and Opus 4.5. Garlic is being positioned as a next-generation architecture rather than a minor update, with early targets pointing to a release as GPT-5.2 or GPT-5.5 in early 2026.

OpenAI is also piloting Memory Search, a deeper system-level retrieval tool letting ChatGPT query long-term memory directly, mirroring Atlas Browser’s memory engine. If paired with GPT-5.2, it would shift ChatGPT toward persistent, context-rich workflows rather than episodic conversations.

Parallel to its technical sprint, OpenAI has entered a circular deal with Thrive Holdings, an offshoot of investor Thrive Capital. No cash changed hands; OpenAI supplies models, staff, and tech, while gaining access to operational data for training and potential returns. IT services and accounting, high-volume, rules-driven workflows, are the first target, positioning OpenAI as a modernization engine for private-equity portfolios. Read more.

Claude Code scales up with Bun as Anthropic demos AI DeFi hacks

The company recently completed its first acquisition, snapping up developer tools firm Bun to accelerate Claude Code, its AI coding assistant. Bun, built in Zig and leveraging Apple’s JavaScriptCore, combines a package manager, bundler, test runner, and script runner into a single executable. Its fast startup times, low memory footprint, and broad Node.js compatibility will enhance Claude Code’s execution speed, stability, and scalability.

At the same time, Anthropic’s research is pushing AI into high-stakes territory. Their team used Claude 4.5 Opus, Sonnet 4.5, and GPT-5 to test 405 DeFi smart contracts that had already been hacked, successfully creating full exploit scripts and simulating $4.6 million in gains. They also ran tests on 2,849 previously untouched contracts on the BNB Chain, uncovering new vulnerabilities for just $1.22 per test. The findings show that AI can now carry out automated attacks that are both technically possible and cost-effective, and that these techniques could eventually target regular software and critical systems too.

Underlying these advances is Anthropic’s unique alignment approach, captured in the leaked and verified “Soul Document.” The 14,000-token blueprint embeds safety, ethics, and operational guidelines directly into Claude’s weights, creating functional self-awareness, resilience, and context-aware reasoning. Claude prioritizes human oversight, anticipates prompt injection attacks, and enforces strict “bright lines” to prevent harmful outputs. Read more.

Runway’s Gen-4.5 beats Google and OpenAI in text-to-video benchmark 

Runway has released Gen-4.5, its latest text-to-video AI, delivering more realistic, responsive, and cinematic outputs. Objects move with believable weight and momentum, liquids flow naturally, and scenes follow complex prompts with higher visual fidelity. It handles both photorealistic and stylized visuals efficiently, running on Nvidia Hopper and Blackwell GPUs.

Gen-4.5 scored 1247 on the Artificial Analysis benchmark, just ahead of Google’s Veo 3 and Kling’s Version 2.5, showing improved modeling of physical interactions and scene coherence. Some limitations remain. The model can misjudge causality, lose object permanence, while a success bias can exaggerate task outcomes. Read more.

5 new AI-powered tools from around the web

arXiv is a free online library where researchers share pre-publication papers.

Thank you for reading today’s edition.

Your feedback is valuable. Respond to this email and tell us how you think we could add more value to this newsletter.

Interested in reaching smart readers like you? To become an AI Breakfast sponsor, reply to this email or DM us on 𝕏!