• AI Breakfast
  • Posts
  • AI's New (Legal?) Approach to Web Scraping

AI's New (Legal?) Approach to Web Scraping

In partnership with

Good morning. It’s Monday, November 18th.

Did you know: On this day in 1970, Douglas Engelbart was granted a patent for his "X-Y Position Indicator for a Display System", more commonly known as the Computer Mouse.

In today’s email:

  • LLM’s New Web Scraping Technique

  • Elon & Sam’s Old Emails

  • NVIDIA Chips Overheating AI Supercomputers

  • 6 New AI Tools

  • Latest AI Research Papers

You read. We listen. Let us know what you think by replying to this email.

In partnership with WRITER

The fastest way to build AI apps

We’re excited to introduce Writer AI Studio, the fastest way to build AI apps, products, and features. Writer’s unique full-stack design makes it easy to prototype, deploy, and test AI apps – allowing developers to build with APIs, a drag-and-drop open-source Python framework, or a no-code builder, so you have flexibility to build the way you want.

Writer comes with a suite of top-ranking LLMs and has built-in RAG for easy integration with your data. Check it out if you’re looking to streamline how you build and integrate AI apps.

Today’s trending AI news stories

New "llms.txt" web standard could fundamentally change how LLMs read and process online content

Image: llms.txt

The "llms.txt" web standard proposes a structured mechanism for optimizing how language models access and interpret website content. Introduced by AI expert Jeremy Howard, this initiative seeks to address AI systems’ limitations in processing extensive textual data by offering concise, AI-centric content guides. Positioned as an adjunct to existing web tools like robots.txt and sitemap.xml, llms.txt allows websites to encapsulate essential information in a streamlined format by introducing Markdown-ready summaries and AI-targeted indexing.

Yet, it doesn’t sidestep tough questions: Who owns the rights when AI reinterprets site content? How do creators monetize when bots do the browsing? Adoption hinges on developers seeing value in this format, but if embraced, it could reshape the AI-web interface. GitHub hosts the specs, leaving the door open for input on this potentially pivotal standard. Read more.

Elon Musk didn't want OpenAI to seem like Microsoft's "marketing bitch"

Newly disclosed emails from ongoing court proceedings reveal the escalating tensions between Elon Musk and OpenAI, culminating in his 2018 exit from the board. Musk, who initially co-founded OpenAI to challenge Google's DeepMind, grew disillusioned with the organization’s direction. By 2017, OpenAI leaders raised concerns over Musk’s push for control of a potential for-profit entity, fueling accusations of power consolidation.

The relationship further fractured in 2018 when Musk criticised OpenAI’s leadership, rejected a proposed cryptocurrency funding model, and suggested incorporating Tesla’s AI. His demand for an organizational overhaul or a clean break led to his resignation.

While OpenAI cited conflict-of-interest issues, the leaked communications suggest a deeper discord. Musk, now pursuing AGI through xAI, continues to challenge OpenAI’s evolution, including its shift to a for-profit structure and Microsoft’s perceived influence. Read more.

New Nvidia AI chips overheating in servers, the Information reports

Nvidia’s new Blackwell AI chips are facing significant overheating issues when used in server racks designed for up to 72 units, according to reports. These chips, which have already experienced delays, are causing concerns among customers who are unsure if they can complete the necessary setup of new data centers on time.

Nvidia has been working with suppliers to redesign the server racks multiple times in an attempt to mitigate the overheating problem. Despite this, the company maintains that such engineering iterations are standard.

The Blackwell chips, announced in March, promise a substantial performance increase, being 30 times faster than previous models at tasks like chatbot responses. However, delays in shipping may impact major customers such as Meta, Google, and Microsoft. Read more.

6 new AI-powered tools from around the web

arXiv is a free online library where researchers share pre-publication papers.

Thank you for reading today’s edition.

Your feedback is valuable. Respond to this email and tell us how you think we could add more value to this newsletter.

Interested in reaching smart readers like you? To become an AI Breakfast sponsor, reply to this email or DM us on X!