- AI Breakfast
- Posts
- AI Video Goes Open-Source
AI Video Goes Open-Source
Good morning. It’s Wednesday, October 16th.
Did you know: Concepts of the self-operating machine date back to the 16th century?
In today’s email:
Open-Source AI Video Generation
Adobe Firefly Video
Full-body Motion Capture App
OpenAI Strengthens Its Ranks
Google Goes Nuclear
5 New AI Tools
Latest AI Research Papers
You read. We listen. Let us know what you think by replying to this email.
Daily News for Curious Minds
Be the smartest person in the room by reading 1440! Dive into 1440, where 4 million Americans find their daily, fact-based news fix. We navigate through 100+ sources to deliver a comprehensive roundup from every corner of the internet – politics, global events, business, and culture, all in a quick, 5-minute newsletter. It's completely free and devoid of bias or political influence, ensuring you get the facts straight. Subscribe to 1440 today.
Today’s trending AI news stories
New AI model for hi-res video generation, Pyramid Flow, is available as open-source software
A team of researchers has introduced Pyramid Flow, an open-source AI model for generating high-resolution (768p) video imagery. This model provides a cost-effective alternative for creating virtual video content, bypassing the need for traditional filming. Pyramid Flow utilizes a multi-stage generation process, producing videos in lower resolutions before reaching the final output.
New Kling, Runway, Luma competitor?
Open text and image video generation model
Pyramidal Flow Matching for Efficient Video Generative Modeling
— AK (@_akhaliq)
4:52 AM • Oct 10, 2024
The inference shell can create a five-second video in 56 seconds at 384p resolution while using significantly less computing power and minimizing token requirements for generation. The developers have released the model's code on GitHub under an MIT License, along with sample videos demonstrating its performance and the open-source datasets utilized for training, totaling 10 million short videos. Read more.
Adobe Firefly Video: Generative AI Video Model
Adobe has begun rolling out its Firefly Video Model, an AI tool that generates video from text prompts, entering a competitive space alongside OpenAI’s Sora and Meta’s recent video AI efforts. While major players like ByteDance and Meta have released similar tools, Adobe is distinguishing itself by training models on legally cleared data, ensuring that commercial use is hassle-free.
Although a general release date for the tool is still pending, Adobe has opened access to users on its waitlist. PepsiCo's Gatorade and Mattel’s Barbie packaging designs are already utilizing Adobe's image AI, signaling practical applications for its technology.
Our @Adobe Firefly Video model, the first designed for commercial use, will help our creative community tell a lot of stories. Couldn't be happier for the team that make it happen. Also, I've been to #AdobeMAX since 2003 and I love our customer reaction to magical tech ;). Drives… x.com/i/web/status/1…
— Alexandru Costin (@acostin)
1:47 PM • Oct 14, 2024
Adobe's focus with Firefly is on fine-grain control, offering video creators tools that integrate seamlessly with traditional footage. This positions Adobe to cater to everyday users, especially video editors, by enabling precise control over elements like camera motion and angles.
Alongside the Firefly Video Model, Adobe has also enhanced its Photoshop tools and introduced new features in Illustrator and InDesign, further solidifying its commitment to advancing AI capabilities within its Creative Cloud suite. Read more.
New app performs real-time, full-body motion capture with a smartphone
Northwestern University engineers have introduced MobilePoser, a new app that performs real-time, full-body motion capture using sensors already embedded in consumer devices like smartphones, smartwatches, and wireless earbuds.
By utilizing inertial measurement units (IMUs) and advanced AI algorithms, MobilePoser accurately estimates joint positions, rotations, and walking speed without requiring specialized equipment. A physics-based optimizer further enhances motion prediction accuracy.
With a tracking error of 8 to 10 centimeters, the system adapts to various device setups and has potential applications in gaming, fitness, and healthcare, allowing for improved tracking of mobility and posture analysis. Read more.
OpenAI Strengthens Its Ranks
OpenAI has landed two strategic hires after a mass exit earlier this month. First up is Sebastian Bubeck, previously Microsoft’s VP of generative AI research, credited with shaping the Phi models that power efficient on-device AI through smaller language and vision frameworks. As the appetite for nimble, privacy-first models surges, Bubeck is set to tackle efficiency and model development—areas where OpenAI has lagged behind.
Joining him is Dane Stuckey, ex-CISO of Palantir, who brings a wealth of experience in digital forensics and incident response. Stuckey’s new role coincides with OpenAI’s push to strengthen its security infrastructure, particularly in its partnerships with the U.S. Department of Defense. OpenAI recently lifted its ban on selling AI technology to the military and has been working on various projects with the Pentagon, including cybersecurity initiatives. Read more.
Google signs deal with nuclear company as data center power demand surges
Google has partnered with Kairos Power to address the rising power demands of its data centers, which are increasingly reliant on clean, reliable energy. Under the agreement, Google will source power from Kairos' small modular reactors (SMRs), the first of which is projected to be online by 2030, with 500 megawatts added to the grid by 2035.
Kairos' reactors, using molten fluoride salt instead of traditional water coolant, offer a more scalable and potentially cost-effective nuclear solution. This move aligns with a broader trend of tech companies, such as Microsoft and Amazon.
Nuclear energy, offering round-the-clock, emissions-free power, is seen as vital for meeting the industry's growing needs without compromising on sustainability targets. Read more.
ComfyGen AI automates multi-stage text-to-image workflows from simple prompts
Physicists uncover behavior in quantum superconductors that provides a new level of control
AI chatbots need some healthy disagreement to work best together, tech exec says
AI model simulates Counter-Strike with 10 FPS on a single RTX 3090
OpenR: An Open-Source AI Framework Enhancing Reasoning in Large Language Models
US mulls capping Nvidia AI chips exports to some countries, Bloomberg News reports
Anthropic just made it harder for AI to go rogue with its updated safety policy
Alibaba's international arm says its new AI translation tool beats Google and ChatGPT
Google supercharges Shopping tab with AI and personalized recommendation feed
Cognizant adds multi-agent functionality to AI application platform
OpenAI says ChatGPT has much less gender bias than all of us
Meta researchers develop method to make AI models "think" before answering
How Tesla's plans for 'unsupervised FSD' and robotaxis could run into red tape
YouTube rolling out new miniplayer, fine-tunable playback speed, sleep timer, and more
NYT sends AI startup Perplexity 'cease and desist' notice over content use
5 new AI-powered tools from around the web
arXiv is a free online library where researchers share pre-publication papers.
Thank you for reading today’s edition.
Your feedback is valuable. Respond to this email and tell us how you think we could add more value to this newsletter.
Interested in reaching smart readers like you? To become an AI Breakfast sponsor, reply to this email or DM us on X!