Curated by THEOUTPOST
On Fri, 1 Nov, 4:02 PM UTC
2 Sources
[1]
This Is a Glimpse of the Future of AI Robot
Physical Intelligence, a well-funded startup chasing breakthroughs in robotic intelligence, has developed a robot capable of doing various household chores remarkably well. The idea of a robot that does a wide range of household chores, from unloading the dryer to folding laundry to cleaning up a messy table, has long seemed like pure science fiction -- perhaps most famously embodied by the 1960s fantasy that was Rosey in The Jetsons. Physical Intelligence, a startup in San Francisco, has shown that such a dream might actually not be so far off, demonstrating a single artificial intelligence model that has learned to do a wide range of useful home chores -- including all of the above -- by being trained on an unprecedented amount of data. The feat raises the prospect of bringing something as magical and generally capable as other AI models like ChatGPT into the physical world. The advent of large language models (LLMs) -- general-purpose learning algorithms fed vast swaths of text from books and the internet -- has given chatbots vastly more general capabilities. Physical Intelligence aims to create something similarly capable in the physical world by training a similar kind of algorithm with enormous amounts of robotic data instead. "We have a recipe that is very general, that can take advantage of data from many different embodiments, from many different robot types, and which is similar to how people train language models," says the company's CEO Karol Hausman. The company has spent the past eight months developing its "foundation model," called π0 or pi-zero. π0 was trained using huge amounts of data from several types of robots doing various domestic chores. The company often has humans teleoperate the robots to provide the necessary teaching.
[2]
Incredible generalist robots do your laundry and dishes
Pi-zero enables the robots to open the washing dryer, fill the laundry hamper, close the dry and then fold the clothes in ways specific to each item Emerging startup Physical Intelligence has no interest in building robots. Instead, the team has something better in mind: powering the hardware with the continuously learning generalist 'brains' of AI software, so existing machines will be able to autonomously carry out a growing amount of tasks that require precise movements and dexterity - including housework. Over the past year we've seen robot dogs dancing, even some equipped to shoot flames, as well as increasingly advanced humanoids and machines built for specialist roles on assembly lines. But we're still waiting for our Rosey the Robot from The Jetsons. But we may be there soon. San Francisco's Physical Intelligence (Pi) has revealed its generalist AI model for robotics, which can empower existing machines to perform various tasks - in this case, getting the washing out of the dryer and folding clothes, delicately packing eggs into their container, grinding coffee beans and 'bussing' tables. It's not a stretch to imagine that this system could see these mobile metal helpers rolling through the house, vacuuming, packing and unpacking the dishwasher, making the bed, looking in the refrigerator and pantry to catalog their contents and coming up with a plan for dinner - and, hey, why not, also cooking that dinner. It's with this vision that Pi reveals its "general-purpose robot foundational model" known as π (pi-zero). "We believe this is a first step toward our long-term goal of developing artificial physical intelligence, so that users can simply ask robots to perform any task they want, just like they can ask large language models (LLMs) and chatbot assistants," the company explains. "Like LLMs, our model is trained on broad and diverse data and can follow various text instructions. Unlike LLMs, it spans images, text, and actions and acquires physical intelligence by training on embodied experience from robots, learning to directly output low-level motor commands via a novel architecture. It can control a variety of different robots, and can either be prompted to carry out the desired task, or fine-tuned to specialize it to challenging application scenarios." In their research, pi-zero demonstrates how a variety of jobs requiring different levels of dexterity and movements can be performed by hardware trained by the AI. In total, the foundational model carried out 20 tasks, all requiring different skills and manipulations. "Our goal in selecting these tasks is not to solve any particular application, but to start to provide our model with a general understanding of physical interactions - an initial foundation for physical intelligence," the team notes. Now, I'm the last person at New Atlas to get excited about robotics, largely because most of what we've seen have been specialist machines - and, to be honest, I've had my fill of humanoids moving boxes from point A to B. In biology, specialists are very good at exploiting one niche - for example bees, butterflies and the koala - and do it exceptionally well. That is, until external forces such as habitat loss or disease, reveals their limitations. However, generalists - like a racoon or a grizzly bear - may not be as good at occupying one niche as others, but they're far more adaptable to a wider range of habitats and food sources. Which ultimately makes them more suited to dynamic changes in the environment. Similarly, generalist robots will be able to do more than expertly build a brick wall; and, capable of learning, they will be able to adapt to different challenges in the physical world and have a suite of ever-evolving skills. Pi-zero uses internet-scale vision-language model (VLM) pre-training with flow matching to synchronize its movements with its AI learnings. Its pre-training included 10,000 hours of "dexterous manipulation data" from seven different robot configurations, as well as 68 tasks. This was in addition to existing robot manipulation datasets from OXE, DROID and Bridge. "Dexterous robot manipulation requires pi-zero to output motor commands at a high frequency, up to 50 times per second," the team notes. "To provide this level of dexterity, we developed a novel method to augment pre-trained VLMs with continuous action outputs via flow matching, a variant of diffusion models. Starting from diverse robot data and a VLM pre-trained on Internet-scale data, we train our vision-language-action flow matching model, which we can then post-train on high-quality robot data to solve a range of downstream tasks. "To our knowledge, this represents the largest pre-training mixture ever used for a robot manipulation model," the researchers noted in their study. While the company is still in its early days of research and development, Pi co-founder and CEO Karol Hausman - a scientist who previously worked on robotics at Google - believes its foundational model will overcome existing hurdles in the field of generalisation, including the amount of time and cost involved in training the hardware on physical world data in order to learn new tasks. The Pi team also includes co-founder Sergey Levine, who has pioneered robotics development at Stanford University and Brian Ichter, former research scientist at Google. In 2023, satirist and architect Karl Sharro went viral with his tweet: "Humans doing the hard jobs on minimum wage while the robots write poetry and paint is not the future I wanted." The same year, Hollywood ground to a halt as members of the Writers Guild of America went on strike, seeing the bleak path ahead for creatives in the face of this new age of technology. And while AI may still be coming - and has already come - for many of our jobs (you don't have to remind us journalists of that), Pi's vision feels more in line with those of the mid-20th century futurists, who saw a world in which the machines made our lives easier. Call me naive, perhaps, but if a robot comes for my housework, it can take it. You can see more videos of the drills the team put the pi-zero robots through on the Pi blog post, but here's one that demonstrates its impressive - and delicate - work. The research paper on pi-zero's development and training can be found here.
Share
Share
Copy Link
Physical Intelligence, a San Francisco startup, has developed π0 (pi-zero), a generalist AI model for robotics that enables various robots to perform a wide range of household tasks with remarkable dexterity and adaptability.
San Francisco-based startup Physical Intelligence has introduced π0 (pi-zero), a revolutionary "general-purpose robot foundational model" that promises to bring the capabilities of large language models (LLMs) into the physical world of robotics 12. This development marks a significant step towards creating artificial physical intelligence that can perform a wide range of household tasks with remarkable dexterity and adaptability.
π0 is designed to empower existing robotic hardware with a continuously learning, generalist "brain" capable of autonomously carrying out various tasks requiring precise movements and dexterity 2. The model has demonstrated proficiency in multiple household chores, including:
The development of π0 involved an unprecedented amount of robotic data and innovative training techniques:
Physical Intelligence's approach to creating a generalist AI model for robotics has several significant implications:
While Physical Intelligence is still in its early stages of research and development, the potential impact of π0 on the robotics industry and daily life is substantial. The company's vision aligns with the long-standing dream of having capable household robots, reminiscent of pop culture icons like Rosey from The Jetsons 1.
As the field of AI continues to evolve, the integration of physical intelligence with advanced language models could lead to more versatile and helpful robotic assistants in homes, workplaces, and various industries. However, this progress also raises questions about the future of human labor and creativity in an increasingly automated world 2.
Reference
Figure AI unveils Helix, an advanced Vision-Language-Action model that enables humanoid robots to perform complex tasks, understand natural language, and collaborate effectively, marking a significant leap in robotics technology.
9 Sources
9 Sources
Robotics startup Physical Intelligence raises $400 million in funding, backed by Jeff Bezos and OpenAI, to develop versatile AI software for robots capable of performing diverse tasks across industries.
13 Sources
13 Sources
MIT researchers have created a new method called Heterogeneous Pretrained Transformers (HPT) that uses generative AI to train robots for multiple tasks more efficiently, potentially revolutionizing the field of robotics.
6 Sources
6 Sources
Google DeepMind unveils Gemini Robotics and Gemini Robotics-ER, advanced AI models designed to control robots with improved generalization, adaptability, and dexterity. These models, built on the Gemini 2.0 language model, aim to create more intuitive and capable robots for various tasks.
27 Sources
27 Sources
Norwegian startup 1X unveils Neo Gamma, a humanoid robot designed for household tasks, sparking discussions about the future of home automation and human-robot interactions.
3 Sources
3 Sources
The Outpost is a comprehensive collection of curated artificial intelligence software tools that cater to the needs of small business owners, bloggers, artists, musicians, entrepreneurs, marketers, writers, and researchers.
© 2025 TheOutpost.AI All rights reserved