Good morning. It’s Wednesday, December 17th.

On this day in tech history: In 1976, AI researcher Jen Golbeck was born on December 16. A pioneer in explainable AI and social network analysis, her work on trust metrics in recommender systems (like predicting user behavior via graph theory) is a deep cut for semantic web nerds. Her "Computing with Social Trust" framework influenced hybrid AI models blending human data with ML, still relevant in bias detection today.

In today’s email:

OpenAI's Triple Strike
Gemini's total system dominance
Anthropic tests agentic tasks mode for Claude
5 New AI Tools
Latest AI Research Papers

You read. We listen. Let us know what you think by replying to this email.

^{In partnership with Doctly}

Your documents aren’t data. Until Doctly.

Most critical business information lives inside PDFs, scans, and unstructured files — unreadable by AI and slow for humans. Doctly converts messy documents into structured, machine-readable outputs so teams can power RAG systems, agentic workflows, and automated operations with confidence.

Why teams use Doctly

Intelligent Extraction: Understands document structure and meaning, not just text
Structured Outputs: JSON, CSV, or custom schemas ready for databases and workflows
Verifiable RAG on your knowledge base: Get precise answers grounded in your documents, with clear citations back to the source.
Agentic Automation: Auto-drafts forms, proposals, and reports from real data
Production-ready: Accurate, consistent, fully automated pipelines

Automate document workflows in minutes, not months.

Try for free

Today’s trending AI news stories

OpenAI's Triple Strike: GPT Image 1.5 launch, GPT-5 79x bio gains, and updated voice models

OpenAI’s new flagship image generator, GPT Image 1.5, is live, integrated into ChatGPT and available via API. This update cuts through it with massive gains in speed, razor-sharp instruction-following, and pixel-level precision. Now you iterate directly on images, shift styles, and refine details without ever regenerating the whole thing. The new Images workspace drops preset filters, trending prompts, and curated starters. Early tests put its quality on par with Google’s Nano Banana Pro.

— # (#)

Meanwhile, GPT-5 is now an active scientist. Partnering with Red Queen Bio, the model entered the wet-lab, optimizing protocols and adapting based on live experimental feedback. The payoff was massive: a 79x efficiency gain in molecular cloning and proving AI can combine biological concepts innovatively.

— # (#)

OpenAI also strengthened its voice layer with three new Realtime API models, reducing transcription errors, cutting text-to-speech word error rates by 35%, and boosting instruction-following by 22%, alongside expanded multilingual support.

— # (#)

OpenAI has also reversed the automated model router, removing the unpredictable system that frustrated users with inconsistent results. This rollback signals a return to a predictable default experience, prioritizing user control and stability at scale following a critical failure of optimization.

Former UK Chancellor George Osborne has been appointed to lead OpenAI’s global Stargate data center expansion, focusing on international infrastructure and government adoption. At the same time, Chief Communications Officer Hannah Wong will step down in January, leaving a team she guided through high-profile challenges. Read more.

Gemini's total system dominance: agent clarity and workflow acquisition

Google is no longer just iterating; it is executing total system upgrades across its AI infrastructure.

Eliminating friction in your daily output, Google’s new CC assistant is designed to kill the time-wasting morning scroll. Integrated deeply with your Gmail, Calendar, and Drive, CC delivers a personalized "Your Day Ahead" summary and executes tasks immediately (drafting, scheduling). This is contextual intelligence leveraged for maximum productivity.

— # (#)

Concurrently, the upgraded Gemini 2.5 Flash Native Audio model tightens the voice control loop, increasing developer instruction compliance to 90% and demonstrating a superior 71.5% accuracy on complex function calls, making voice the most precise way to control your workflows.

Google is collapsing the distance between raw research and executed action. NotebookLM, the research powerhouse, is now directly integrated into Gemini. Users can select multiple notebooks as a Retrieval-Augmented Generation (RAG) context, instantly turning deep documents into flexible knowledge bases.

— # (#)

This integration fuels the overhauled Gems Manager, which now features Super Gems (Opal apps) and an auto-generating Workflow Builder. The system is building a unified environment where research insights, personalized context, and precise action can be deployed immediately and at will. Control and context are now centralized. Read more.

Anthropic tests agentic tasks mode for Claude

Anthropic is testing a new Agentic Tasks Mode for Claude that reframes the assistant as a system for getting work done, not just answering prompts. The update introduces a switch between traditional chat and agent mode, steering users toward structured task delegation instead of open-ended conversation.

— # (#)

In agent mode, Claude presents five core work paths: Research, Analyze, Write, Build, and Do More, each tuned for a different stage of knowledge work. Research emphasizes source control and effort tuning, while Analyze focuses on validation, comparisons, and forecasting. Write generates structured outputs across documents, slides, and spreadsheets, and Build supports visual layouts, interactive assets, or code. A progress tracker and context manager show task steps and active inputs in real time. Read more.

5 new AI-powered tools from around the web

Stakpak

Stakpak 3.0 is an open-source DevOps agent in Rust, letting developers securely deploy and manage production infrastructure from the terminal or GitHub Actions with built-in safety.

stakpak.dev

Okara

Chat privately with AI without losing memory or context. Use Llama, Qwen, DeepSeek, and 20+ models in an encrypted AI chat built for professionals who value privacy.

okara.ai

QualGent

Ship tested code at the speed of thought, with AI QA agents that scale like infrastructure.

www.qualgent.ai

Quadratic

Quadratic makes spreadsheets smarter—ask questions in plain language, run Python or SQL, and connect directly to databases or files for faster, clearer analysis.

www.quadratichq.com

Dor Labs

With Dor Labs, you can turn a single prompt or reference into storyboards, thumbnails, and alternative shots. It even upscales them to publishable quality, speeding up pre-production—all while keeping your content private.

dorlabs.ai