AI Breakfast
Posts
AI Video Goes Open-Source

AI Video Goes Open-Source

AI Breakfast
October 16, 2024

Good morning. It’s Wednesday, October 16th.

Did you know: Concepts of the self-operating machine date back to the 16th century?

In today’s email:

Open-Source AI Video Generation
Adobe Firefly Video
Full-body Motion Capture App
OpenAI Strengthens Its Ranks
Google Goes Nuclear
5 New AI Tools
Latest AI Research Papers

You read. We listen. Let us know what you think by replying to this email.

Daily News for Curious Minds

Be the smartest person in the room by reading 1440! Dive into 1440, where 4 million Americans find their daily, fact-based news fix. We navigate through 100+ sources to deliver a comprehensive roundup from every corner of the internet – politics, global events, business, and culture, all in a quick, 5-minute newsletter. It's completely free and devoid of bias or political influence, ensuring you get the facts straight. Subscribe to 1440 today.

Today’s trending AI news stories

New AI model for hi-res video generation, Pyramid Flow, is available as open-source software

A team of researchers has introduced Pyramid Flow, an open-source AI model for generating high-resolution (768p) video imagery. This model provides a cost-effective alternative for creating virtual video content, bypassing the need for traditional filming. Pyramid Flow utilizes a multi-stage generation process, producing videos in lower resolutions before reaching the final output.

New Kling, Runway, Luma competitor?
Open text and image video generation model
Pyramidal Flow Matching for Efficient Video Generative Modeling
— AK (@_akhaliq)
4:52 AM • Oct 10, 2024

The inference shell can create a five-second video in 56 seconds at 384p resolution while using significantly less computing power and minimizing token requirements for generation. The developers have released the model's code on GitHub under an MIT License, along with sample videos demonstrating its performance and the open-source datasets utilized for training, totaling 10 million short videos. Read more.

Adobe Firefly Video: Generative AI Video Model

Adobe has begun rolling out its Firefly Video Model, an AI tool that generates video from text prompts, entering a competitive space alongside OpenAI’s Sora and Meta’s recent video AI efforts. While major players like ByteDance and Meta have released similar tools, Adobe is distinguishing itself by training models on legally cleared data, ensuring that commercial use is hassle-free.

Although a general release date for the tool is still pending, Adobe has opened access to users on its waitlist. PepsiCo's Gatorade and Mattel’s Barbie packaging designs are already utilizing Adobe's image AI, signaling practical applications for its technology.

Our @Adobe Firefly Video model, the first designed for commercial use, will help our creative community tell a lot of stories. Couldn't be happier for the team that make it happen. Also, I've been to #AdobeMAX since 2003 and I love our customer reaction to magical tech ;). Drives… x.com/i/web/status/1…
— Alexandru Costin (@acostin)
1:47 PM • Oct 14, 2024

Adobe's focus with Firefly is on fine-grain control, offering video creators tools that integrate seamlessly with traditional footage. This positions Adobe to cater to everyday users, especially video editors, by enabling precise control over elements like camera motion and angles.

Alongside the Firefly Video Model, Adobe has also enhanced its Photoshop tools and introduced new features in Illustrator and InDesign, further solidifying its commitment to advancing AI capabilities within its Creative Cloud suite. Read more.

New app performs real-time, full-body motion capture with a smartphone

Northwestern University engineers have introduced MobilePoser, a new app that performs real-time, full-body motion capture using sensors already embedded in consumer devices like smartphones, smartwatches, and wireless earbuds.

By utilizing inertial measurement units (IMUs) and advanced AI algorithms, MobilePoser accurately estimates joint positions, rotations, and walking speed without requiring specialized equipment. A physics-based optimizer further enhances motion prediction accuracy.

With a tracking error of 8 to 10 centimeters, the system adapts to various device setups and has potential applications in gaming, fitness, and healthcare, allowing for improved tracking of mobility and posture analysis. Read more.

OpenAI Strengthens Its Ranks

OpenAI has landed two strategic hires after a mass exit earlier this month. First up is Sebastian Bubeck, previously Microsoft’s VP of generative AI research, credited with shaping the Phi models that power efficient on-device AI through smaller language and vision frameworks. As the appetite for nimble, privacy-first models surges, Bubeck is set to tackle efficiency and model development—areas where OpenAI has lagged behind.

Joining him is Dane Stuckey, ex-CISO of Palantir, who brings a wealth of experience in digital forensics and incident response. Stuckey’s new role coincides with OpenAI’s push to strengthen its security infrastructure, particularly in its partnerships with the U.S. Department of Defense. OpenAI recently lifted its ban on selling AI technology to the military and has been working on various projects with the Pentagon, including cybersecurity initiatives. Read more.

Google signs deal with nuclear company as data center power demand surges

Google has partnered with Kairos Power to address the rising power demands of its data centers, which are increasingly reliant on clean, reliable energy. Under the agreement, Google will source power from Kairos' small modular reactors (SMRs), the first of which is projected to be online by 2030, with 500 megawatts added to the grid by 2035.

Kairos' reactors, using molten fluoride salt instead of traditional water coolant, offer a more scalable and potentially cost-effective nuclear solution. This move aligns with a broader trend of tech companies, such as Microsoft and Amazon.

Nuclear energy, offering round-the-clock, emissions-free power, is seen as vital for meeting the industry's growing needs without compromising on sustainability targets. Read more.

5 new AI-powered tools from around the web

The Data Automation Platform for Every Business

Parseflow is the data automation platform that helps businesses of all sizes automate data processing. Stop manual data entry and save money with Parseflow

www.parseflow.io

Code2.AI - Turn Your Ideas into Code in Minutes

code2.ai

Strella - AI-Powered Customer Research

As companies move faster, so should their research. Strella's customer research platform uses AI-moderated interviews and real-time synthesis to deliver human insights 10x faster.

www.strella.io