- AI Breakfast
- Posts
- Anthropic's AI Agent
Anthropic's AI Agent
Good morning. It’s Wednesday, October 23rd.
Did you know: On this day in 2001, Apple introduced the 1st Generation iPod?
In today’s email:
Anthropic’s AI Agent
Unreal Engine 5 metaverse
Ideogram’s “Infinite Canvas”
Building no-code AI Agents
Runway’s “One Act”
5 New AI Tools
Latest AI Research Papers
You read. We listen. Let us know what you think by replying to this email.
Today’s trending AI news stories
Anthropic's Latest AI Model Can Interact With Your Computer
Anthropic's Claude 3.5 Sonnet introduces AI that can interact with a computer much like a human. Now in beta, this model allows developers to program the AI to view screens, move cursors, click buttons, and input text using an API. While still experimental and prone to errors, Claude 3.5 Sonnet is already being used by companies like Asana, Canva, and Replit to automate tasks and test software.
Claude "learns" by processing screenshots and counting pixels to figure out where to point the cursor, but more advanced moves like dragging windows still elude it. Despite some missteps, it’s already outperforming other models in tests, slowly closing the gap between AI and human-level interaction.
Available on platforms like Amazon Bedrock and Google Cloud's Vertex AI, Claude is a work-in-progress, with Anthropic fine-tuning its abilities through developer feedback, aiming for smoother, more reliable performance in computer tasks down the line. Read more.
NetVRk launches its AI-powered metaverse with Unreal Engine 5 graphics
NetVRk has launched the alpha version of its AI-driven metaverse, built on Unreal Engine 5. This platform incorporates a Language Model (LLM) to enhance interactions, allowing players to engage with emotionally intelligent non-player characters (NPCs) that exhibit real-time memory and responsiveness.
Users can customize and trade these NPCs as non-fungible tokens (NFTs) while earning rewards. With a team of about 14 members and $10 million raised, NetVRk's vision combines AI, user-generated content, and blockchain functionalities to redefine virtual engagement and interaction. Read more.
AI startup Ideogram launches infinite Canvas for manipulating, combining generated images
Ideogram, a Canadian AI startup, has launched "Canvas," an infinite workspace allowing users to manipulate and combine generated images. This feature enables resizing, reordering, and uploading personal visuals.
Today, we’re introducing Ideogram Canvas, an infinite creative board for organizing, generating, editing, and combining images.
Bring your face or brand visuals to Ideogram Canvas and use industry-leading Magic Fill and Extend to blend them with creative, AI-generated content.
— Ideogram (@ideogram_ai)
4:05 PM • Oct 22, 2024
Known for its precise text embedding in images, Ideogram also introduced two new tools: Magic Fill, which edits specific areas using text prompts, and Extend, which expands images while maintaining style consistency. Available through various subscription plans. Read more.
Asana launches a no-code tool for designing AI agents - aka your new 'teammates'
Asana has launched AI Studio, a no-code tool that empowers teams to create AI agents designed to optimize workflows. Introduced at the Work Innovation Summit, this feature extends Asana's existing rules engine, allowing users to customize AI agents for various project phases, from intake to reporting. Teams can upload context-specific data, tailoring AI-generated content and task updates. Positioned as virtual teammates, these agents help with tasks like drafting materials and tracking efficiency, aiming to alleviate the 53% of employee time spent on low-value tasks. Early access is available for Enterprise customers, with advanced tiers coming soon. Read more.
Runway's Act-One uses smartphone cameras to replicate facial expression motion capture
Runway AI Inc. has introduced Act-One, a tool that simplifies facial animation in AI-generated video. Users can record their facial expressions using a smartphone camera, mapping them onto digital characters to enhance animation quality. While available to all Runway users, advanced features of the Gen-3 Alpha model require credits.
Act-One allows for improved facial accuracy, enabling creators to replicate diverse expressions on avatars, making it accessible for animators, game developers, and indie filmmakers without the need for costly equipment. Built-in safeguards promote ethical use, allowing users to explore storytelling in a responsible manner. Read more.
Stability claims its newest Stable Diffusion models generate more 'diverse' images
Elon Musk's xAI API launches, letting developers build atop Grok
Gartner: 2025 will see the rise of AI agents (and other top trends)
Microsoft releases autonomous Copilot agents from closed beta
AI video startup Genmo launches Mochi 1, an open source rival to Runway, Kling, and others
ByteDance intern fired for planting malicious code in AI models
News Corp sues Perplexity for ripping off WSJ and New York Post
Daze, a creative, AI-powered messaging app for Gen Z, is blowing up prelaunch
Google DeepMind and MIT's new "Fluid" model outperforms diffusion models in image generation
More than 11,000 creatives condemn unauthorized use of content for AI development
Microsoft and OpenAI are giving news outlets $10 million to use AI tools
5 new AI-powered tools from around the web
arXiv is a free online library where researchers share pre-publication papers.
Thank you for reading today’s edition.
Your feedback is valuable. Respond to this email and tell us how you think we could add more value to this newsletter.
Interested in reaching smart readers like you? To become an AI Breakfast sponsor, reply to this email or DM us on X!