In partnership with

Good morning. It’s Wednesday, February 26th.

On this day in tech history: In 1991, Tim Berners-Lee, a British computer scientist, introduced the world to "WorldWideWeb," the first web browser and a WYSIWYG (What You See Is What You Get) HTML editor.

In today’s email:

Claude Sonnet 3.7
Perplexity Web Browser
AI Audio Generation for Video
OpenAI drops Deep Research access to Plus users
DeepSeek Rushes For New AI Model
4 New AI Tools
Latest AI Research Papers

You read. We listen. Let us know what you think by replying to this email.

In partnership with ALTA

Atla launches Selene 1, a state-of-the-art LLM Judge

Atla releases Selene 1, an LLM trained to evaluate AI responses for quality, correctness, and more. Instead of generating answers, Selene scores and critiques them, and does so more effectively than any other model on the market.

Selene beats frontier models from leading labs–including OpenAI's o-series, Anthropic's Claude 3.5 Sonnet, DeepSeek's R1–across 11 commonly used benchmarks for evaluators.

Selene can be used for a range of evaluation tasks, from detecting hallucinations to checking correctness in specific domains. Its instruction-following capabilities also enable users to create and run evals with custom evaluation criteria.

Selene is available via API/SDK. Atla also released an Alignment Platform, which helps developers generate, test, and refine custom evaluation metrics for specific use cases.

Start for free

Today’s trending AI news stories

Claude 3.7 Sonnet Introduces ‘Thinking Mode’ to Balance Speed and Depth in AI Responses

Anthropic is raising the stakes in enterprise AI with Claude 3.7 Sonnet, a model that lets users fine-tune its depth of reasoning on demand. The ‘thinking mode’ toggle shifts between instant responses and deep analysis, challenging OpenAI’s modular approach and DeepSeek’s cost-cutting strategy. Benchmarks put Claude 3.7 at 78.2% accuracy on graduate-level reasoning tasks, outpacing DeepSeek-R1 in instruction-following. Rather than chasing raw speed, Anthropic is betting on AI that adapts to complexity without switching models.

— # (#)

Alongside Claude Code—a command-line AI developer assistant with built-in human oversight—the company is positioning itself as the alternative to fragmented AI stacks. If enterprises buy in, this could mark a shift away from model juggling toward AI that actually thinks before it speaks. Read more.

Perplexity teases web browser ‘Comet’

Comet is touted to be an agentic web browser. (Image: Perplexity)

Perplexity AI is taking its search ambitions up a notch with Comet, an AI-powered web browser. While details remain scarce, Perplexity claims Comet will “reinvent” browsing, much like it did with search. This move follows the company’s streak of rapid expansions—an AI assistant for Android, a deep research tool, and an API—each reinforcing its bid to own the AI-first internet experience.

Despite its growth, Perplexity faces mounting legal scrutiny. As Perplexity scales its monetization efforts including advertising and enterprise offerings, Comet’s success may hinge on whether the company can navigate both technical innovation and legal headwinds without losing momentum. Read more.

Luma AI Adds Free Audio Generation to DreamMachine Video Tool

Luma AI has added audio generation to its DreamMachine video tool, allowing users to create soundtracks for their AI-generated videos. The new "Audio" feature enables automatic sound generation with a single click or customized audio using text prompts.

— # (#)

Currently in beta, the feature is available for free to all users. This update expands DreamMachine’s capabilities, integrating AI-driven visuals and audio within a single workflow. Read more.

OpenAI drops Deep Research access to Plus users, heating up AI agent wars with DeepSeek and Claude

OpenAI has expanded access to Deep Research, its AI-powered analysis engine, bringing ChatGPT Plus, Team, Education, and Enterprise users into the fold. Running on a specialized o3 variant, it autonomously pulls, structures, and synthesizes insights across text, images, and PDFs—delivering intelligence that mimics human analysis.

— # (#)

It’s a calculated move. DeepSeek’s open-source R1 is shaking up the AI market, undercutting OpenAI’s paywalled grip. Perplexity is running R1 for a fraction of the cost, while Anthropic’s Claude 3.7 Sonnet leans into transparency with explicit reasoning trails.

Subscription tiers, however, remain firmly in place. Plus users get 10 queries a month, while Pro subscribers—paying ten times more—unlock 120. Read more.

DeepSeek rushes to launch new AI model as China goes all in

Fresh off the market-shaking debut of R1, DeepSeek is now accelerating the launch of its R2 model, originally set for May. R2 is expected to enhance code generation and expand multilingual reasoning, further strengthening DeepSeek’s competitive position.

Once viewed with skepticism by Beijing over its massive chip acquisitions, DeepSeek now enjoys quiet state backing, with government-linked enterprises integrating its models at scale. Nvidia recently teamed up with DeepSeek to optimize R1 for Blackwell, driving a 25x revenue increase and 20x lower inference costs. Developers can now access an FP4-optimized DeepSeek checkpoint on Hugging Face.

The company has also resumed API access after a three-week pause due to capacity constraints. Developers can now top up credits, though server strain persists during peak hours. Its domestic rivals are accelerating development—Alibaba launched a preview of its own reasoning model, QwQ-Max, on the same day DeepSeek reopened API access. Read more.

5 new AI-powered tools from around the web

Analyst Agent by Fabi.ai: Build, deploy & share specialized AI data agents

Fabi.ai's dedicated AI data agents deliver reliable insights while giving data teams complete control over data quality and access. Enable true self-service analytics with specialized, domain-specific agents that work exclusively with your curated data.

www.fabi.ai/product/analyst-agent

Thunder Compute: Self-host AI/ML with the world's cheapest GPU cloud

Get your hands dirty with the same GPUs powering OpenAI, virtualized to save you 80%. Start with $20/month of free credit.

www.thundercompute.com

Build Full-Stack Web Applications in Minutes with Codev | AI App Builder

Transform your ideas into production-ready web applications with Codev. Our AI-powered platform converts text descriptions into full-stack Next.js apps instantly.

www.co.dev

didocs.ai - Your digital documents powered by AI

Quickly extract knowledge from multiple documents at once with AI.

didocs.ai/en

Proxed.AI - Secure iOS API with DeviceCheck for AI Models

Build secure AI wrappers with Proxed.AI - an iOS SDK with Apple DeviceCheck integration that safely manages API keys and unifies ChatGPT, Claude, LLaMA, and Mistral models. Start building for free during Beta.

proxed.ai