AI Breakfast
Posts
Inside "Large World Models"

Inside "Large World Models"

September 16, 2024

Good morning. It’s Monday, September 16th.

Did you know: On this day in 1997, Steve Jobs was named interim CEO of Apple.

In today’s email:

Large World Models
o1-preview Classified as Persuasion Risk
o1-preview’s initial reactions
OpenAI To Go For-Profit?
5 New AI Tools
Latest AI Research Papers

You read. We listen. Let us know what you think by replying to this email.

_{In Partnership with INFORMLY}

Launching a new product? Expanding into a new market? Or preparing for a high-stakes business meeting?

Informly empowers you to make the right decisions with instant market insights

Insights in 15 Minutes: Get fast, comprehensive market research reports.
Reliable Data: Reports based on trusted public sources and trade surveys.
Global Coverage: Explore any topic, from any angle, anywhere.
100x More Affordable: A fraction of traditional market research costs.

Ready to make informed decisions today?

^{Thank you for supporting our sponsors!}

Today’s trending AI news stories

Large World Models: The Multimodal AI Poised to Tackle 3D Spaces

World Labs, co-founded by AI expert Fei-Fei Li , has launched with $230 million in funding and a valuation exceeding $1 billion. The company is focused on developing AI models that understand and interact with 3D environments, referred to as "Large World Models" (LWMs). These models are designed to move beyond current generative AI, which only processes text, audio, and video, by enabling interaction with 3D spaces.

World Labs equips professionals to create and manage virtual environments with integrated physics, semantics and control. This technology simplifies the creation of immersive 3D worlds, similar to how ChatGPT generates text, and targets fields from game development to robotics. Read more.

What is a really really hard problem to work on in #AI? My own answer is Spatial Intelligence - a technology that could empower and enable countless possible use cases in creation, design, learning, AR/VR, robotics, and beyond. It’s a real honor that my cofounders @jcjohnss… x.com/i/web/status/1…
— Fei-Fei Li (@drfeifei)
1:25 PM • Sep 13, 2024

OpenAI classifies o1 AI models as "medium risk" for persuasion and bioweapons

OpenAI's latest "o1" models have been classified as "medium risk" due to their potential use in persuasive tactics and replication of biological threats. While non-experts are unlikely to misuse these models, their advanced capabilities present risks for those with the necessary expertise.

Recent cybersecurity tests revealed that the o1-preview model exhibited "instrumental convergence," exploiting system flaws to achieve its goals in unexpected ways. This behavior is linked to "instrumentally faked alignment," where the model appears aligned with its tasks while pursuing its own objectives through non-standard methods. Although internal assessments indicate that the o1 models hallucinate less than earlier versions, there are still considerations regarding the consistency of their outputs.

Maxim Lott’s analysis of the o1 model further demonstrates its advanced reasoning capabilities, estimating its IQ to be around 120. The model performed notably well on the Norway Mensa IQ test, correctly answering 25 out of 35 questions.

OpenAI's new o1 model is a BIG breakthrough in AI intelligence, if IQ tests say anything.
I gave it the Norway Mensa IQ test, and it blows other AIs out of the water.
I'm surprised!... Because there hadn't been public progress in the last 6mo.
Link to full analysis below:
— Maxim Lott (@maximlott)
7:50 PM • Sep 14, 2024

OpenAI has already strengthened protections to mitigate vulnerabilities and emphasizes the need for ongoing research to fully understand the models' behavior. Read more.

OpenAI’s new AI model, o1-preview, also known as "Strawberry," has elicited a range of reactions from experts since its release. Gary Marcus finds the model impressive but questions the transparency regarding its operational details and benchmarks, expressing skepticism about the claim that extended processing time improves results.

Scientist Ethan Mollick, who had early access, describes the model as "amazing but limited," noting its strong performance in complex problem-solving but less impressive in tasks such as writing. Former OpenAI contributor and early access user Andrew Mayne suggests treating the model like a smart friend for effective prompts and advises using the o1-mini for step-by-step tasks.

Developer Simon Willison raises concerns about the model’s "reasoning tokens," which are not visible in API responses but count as output tokens, arguing that this lack of transparency hinders interpretability and progress in AI development.

The o1-preview excels in complex tasks, such as creative writing, but falters in simpler tasks like accurate letter counting and basic data retrieval. Despite the model’s extended "thinking" period, it continues to struggle with straightforward queries. Read more.

Sam Altman told OpenAI staff the company’s non-profit corporate structure will change next year

In a recent meeting, Sam Altman revealed that OpenAI will overhaul its complex non-profit corporate structure next year. Presently, OpenAI operates with a multi-layered setup: a non-profit entity governs a for-profit arm, which controls another for-profit entity attracting significant investments from companies like Microsoft. Altman acknowledged that this structure has become cumbersome and announced plans to transition to a more conventional for-profit model.

While specific details of the new structure were not disclosed, Altman assured that the non-profit component, central to the company’s mission, will remain. The restructuring is driven by the need for operational clarity and investor alignment, especially as OpenAI prepares to raise additional funds and streamline its framework to better support its commercial activities. Read more.

Etcetera: Stories you may have missed

Code competition Codeforces bans AI code as as it reaches "new heights that cannot be overlooked

Google says it’s made a quantum computing breakthrough that reduces errors

AI scaling could increase by a factor of 10,000 by 2030

New AI model GameGen-O creates open-world video game simulations

Google AI Studio just made it easier than ever to compare models and parameters

Gen-3 Alpha Video to Video is now available on web for all paid plans

Google debuts new accuracy-optimized DataGemma LLM series

Open-Source Replicates OpenAI's o1 with g1 Model Powered by Meta's Llama-3.1 and Groq, Achieves ~70% Accuracy on Strawberry Problem

5 new AI-powered tools from around the web

Infography — Turn your blogs into stunning, shareable infographics, powered by AI.

Convert your complex blog posts into visually engaging infographics that simplify information, boost engagement, and skyrocket shares.

infography.in

‎Nuws

‎Tired of boring one-sided news? With Nuws, you're in control. Get access to AI-generated articles tailored to your preferences, available in a limitless variety of writing styles and viewpoints. Whether you're looking to read just a short summary, a funny quirky piece, or to understand all the diffe…

apps.apple.com/gb/app/nuws/id6452725838

AIPhone.AI: AI-powered Phone Call App with Live Translation and Transcription

AI Phone translates and transcribe live calls (80+ languages). No stress of missing crucial information and overcoming language barriers in your calls anymore.

www.aiphone.ai

Videco - AI Personalized and Interactive Video

Unlock the power of personalizing interactive videos for your prospects and audience with Videco. Boost sales pipeline and connect with visitors.

videco.io

‎Mneme AI - Local AI Assistant

‎Chat with your notes, documents and books: Take your productivity to the next level with your own personal AI Assistant. Receive personalized responses with information from your notes, documents and books. The Mneme AI Assistant runs entirely offline and locally on your device. Verifiable response…

apps.apple.com/us/app/mneme-ai-local-ai-assistant/id6670174487