2 Sources
[1]
All the World's a Robot-Staging Ground for Tech Entrepreneurs Building 'Physical AI'
PROVIDENCE, R.I. (AP) -- Computer scientist Louis Castricato was in his eighth year studying large language models -- the artificial intelligence technology behind chatbots like ChatGPT and Claude -- when he started to feel like he was hitting a dead end. "We basically have passed the point of doing real fundamental LLM research," Castricato said. "Now it's just applications." The researcher quit his studies at Brown University and started a new company, called Overworld. Its ambition is in its name: AI that can understand and navigate a world, not just words. There's still plenty of money to be made from AI chatbots -- investors are counting on it as they commit trillions of dollars to leading developers like Anthropic and OpenAI. But a growing number of AI entrepreneurs are dedicating themselves to what they see as the next frontier: "world models" that teach AI systems, and sometimes robots, how to react in a physical environment. They include some of the field's most prominent scientists, such as "Godmother of AI" Fei-Fei Li, who describes the concept of a world model as "one of the most important and most overloaded terms in AI today." Scientists are applying AI in new dimensions with 'world models' At the heart of world model research is the idea that AI can't be truly intelligent if it can only read a book. It also needs to read the room. "Where language models learn the statistical structure of text, world models learn the statistical structure of space and time: how light falls on a surface, how a garden looks from an angle no camera has captured, how objects respond to force and follow the laws of physics," wrote Li, founder of the San Francisco startup World Labs, in an essay published this month. Another proponent is AI pioneer Yann LeCun, who quit his job as Meta's chief AI scientist last year to start Paris-based Advanced Machine Intelligence Labs. "World model is quickly becoming a buzzword," LeCun said on a recent "Unsupervised Learning" podcast. He said he views it as something that enables an AI agent "to predict the consequences of its own actions." There are multiple ways of defining world models, often based on the technologies someone hopes to build with it -- be it robots or a more interactive video game. Robots can't learn much from AI models trained on books Training on all of humanity's books, news articles and visual media, as AI language models have done, has led to AI assistants that are changing the nature of office-based work and some creative fields. But some proponents see limitations in generative AI models that work by repeatedly predicting the next word or pixel to produce new dialogue, images or lines of code. Chatbots can't pick up a coffee mug, notes Martin Hebert, dean of computer science at Carnegie Mellon University. "There's all the geometry of the world, the dynamic of how I move my hand, the physical interaction of the contact with the cup," Hebert said. "This is much more complex than just predicting the next word in a sentence." For scientists like Hebert, who has spent more than four decades researching robotics, the most useful application for world models is as a faster and cheaper path to "physical AI" -- another tech industry buzzword. "Some people may have different definitions, but physical and embodied AI are kind of the evolution of what we used to call robotics," Hebert said in an interview. Some of the AI advances that have made chatbots so useful can also be applied to building AI with a broad enough awareness of its environment to work like a robot's brain, he said. "In your body and spinal cord you have a very general model of how to balance, how to walk around, and you can adapt to your knee hurting in the morning, so you now walk a little differently," he said. "You don't need to think about that. You have a general model somewhere in your nervous system and brain that allows your body to adapt very quickly." Simulated worlds are drawing interest from investors Smarter robots aren't the only end game for world models. Castricato started Overworld last year and the tiny Rhode Island-based startup is now building video game worlds where a scene, say, of a spooky forest, can adapt as a virtual character moves through it and interacts with the objects in it. "There's no other world model where you can just walk through doors or where you can interact with a detailed environment like this," he said in an interview. "We optimize for interaction above anything else." While the near-term applications aren't as readily apparent as AI coding tools, world model makers are attracting interest from venture capitalists like Steve Jang, co-founder and managing partner at Kindred Ventures. The firm is investing in Overworld and other world model-focused companies, including Causal Labs, which is building AI models for weather prediction, and Extropic, which is building specialized computer chips suited to world models. "I think that the future is many different types of models with many different philosophies and architectures," Jang said. "I don't think that it'll be one large, dense model to rule them all." In her recent essay, Li sought to create a "taxonomy of world models" to help sort out the confusion about the competing visions. "A video model that produces gorgeous but physically impossible flames, a language model improvising a playable game, and a physics engine that faithfully simulates combustion all go by the same name," she wrote. She divided world models into three categories. The most commercially viable today are "renderers" that prioritize the visual fidelity of the virtual worlds they create but can't be trusted to teach robots much. Then, there are "simulators" that create virtual training grounds that faithfully represent the physical structure of a world; and "planners" that try to predict what an AI agent or robot should do in an unstructured world. "A robot that can plan is a robot that can work, and the entire industry is racing to be the one that gets there first," she wrote.
[2]
Tech entrepreneurs seeking the next AI frontier are pivoting from chatbots to 'world models'
AI is evolving beyond text-based chatbots, with researchers now focusing on "world models." These advanced systems aim to teach AI to understand and interact with the physical world, much like humans do. This shift is attracting significant investment, with a new generation of entrepreneurs and prominent scientists exploring how AI can learn spatial and temporal structures, paving the way for smarter robots and more immersive virtual environments. Computer scientist Louis Castricato was in his eighth year studying large language models - the artificial intelligence technology behind chatbots like ChatGPT and Claude - when he started to feel like he was hitting a dead end. "We basically have passed the point of doing real fundamental LLM research," Castricato said. "Now it's just applications." The researcher quit his doctoral studies at Brown University and started a new company, called Overworld. Its ambition is in its name: AI that can understand and navigate a world, not just words. There's still plenty of money to be made from AI chatbots - investors are counting on it as they commit trillions of dollars to leading developers like Anthropic and OpenAI. But a growing number of AI entrepreneurs are dedicating themselves to what they see as the next frontier: "world models" that teach AI systems, and sometimes robots, how to react in a physical environment. They include some of the field's most prominent scientists, such as "Godmother of AI" Fei-Fei Li, who describes the concept of a world model as "one of the most important and most overloaded terms in AI today." Scientists are applying AI in new dimensions with 'world models' At the heart of world model research is the idea that AI can't be truly intelligent if it can only read a book. It also needs to read the room. "Where language models learn the statistical structure of text, world models learn the statistical structure of space and time: how light falls on a surface, how a garden looks from an angle no camera has captured, how objects respond to force and follow the laws of physics," wrote Li, founder of the San Francisco startup World Labs, in an essay published this month. Another proponent is AI pioneer Yann LeCun, who quit his job as Meta's chief AI scientist last year to start Paris-based Advanced Machine Intelligence Labs. "World model is quickly becoming a buzzword," LeCun said on a recent "Unsupervised Learning" podcast. He said he views it as something that enables an AI agent "to predict the consequences of its own actions." There are multiple ways of defining world models, often based on the technologies someone hopes to build with it - be it robots or a more interactive video game. Robots can't learn much from AI models trained on books Training on all of humanity's books, news articles and visual media, as AI language models have done, has led to AI assistants that are changing the nature of office-based work and some creative fields. But some proponents see limitations in generative AI models that work by repeatedly predicting the next word or pixel to produce new dialogue, images or lines of code. Chatbots can't pick up a coffee mug, notes Martial Hebert, dean of computer science at Carnegie Mellon University. "There's all the geometry of the world, the dynamic of how I move my hand, the physical interaction of the contact with the cup," Hebert said. "This is much more complex than just predicting the next word in a sentence." For scientists like Hebert, who has spent more than four decades researching robotics, the most useful application for world models is as a faster and cheaper path to "physical AI" - another tech industry buzzword. "Some people may have different definitions, but physical and embodied AI are kind of the evolution of what we used to call robotics," Hebert said in an interview. Some of the AI advances that have made chatbots so useful can also be applied to building AI with a broad enough awareness of its environment to work like a robot's brain, he said. "In your body and spinal cord you have a very general model of how to balance, how to walk around, and you can adapt to your knee hurting in the morning, so you now walk a little differently," he said. "You don't need to think about that. You have a general model somewhere in your nervous system and brain that allows your body to adapt very quickly." Simulated worlds are drawing interest from investors Smarter robots aren't the only end game for world models. Castricato started Overworld last year and the tiny Rhode Island-based startup is now building video game worlds where a scene, say, of a spooky forest, can adapt as a virtual character moves through it and interacts with the objects in it. "There's no other world model where you can just walk through doors or where you can interact with a detailed environment like this," he said in an interview. "We optimise for interaction above anything else." While the near-term applications aren't as readily apparent as AI coding tools, world model makers are attracting interest from venture capitalists like Steve Jang, cofounder and managing partner at Kindred Ventures. The firm is investing in Overworld and other world model-focused companies, including Causal Labs, which is building AI models for weather prediction, and Extropic, which is building specialized computer chips suited to world models. "I think that the future is many different types of models with many different philosophies and architectures," Jang said. "I don't think that it'll be one large, dense model to rule them all." In her recent essay, Li sought to create a "taxonomy of world models" to help sort out the confusion about the competing visions. "A video model that produces gorgeous but physically impossible flames, a language model improvising a playable game, and a physics engine that faithfully simulates combustion all go by the same name," she wrote. She divided world models into three categories. The most commercially viable today are "renderers" that prioritize the visual fidelity of the virtual worlds they create but can't be trusted to teach robots much. Then, there are "simulators" that create virtual training grounds that faithfully represent the physical structure of a world; and "planners" that try to predict what an AI agent or robot should do in an unstructured world. "A robot that can plan is a robot that can work, and the entire industry is racing to be the one that gets there first," she wrote.
Share
Copy Link
AI entrepreneurs are moving beyond chatbots to develop world models that teach AI systems how to understand and navigate physical environments. Led by pioneers like Fei-Fei Li and Yann LeCun, this shift toward physical AI promises smarter robots and immersive virtual worlds, attracting significant venture capital interest despite less obvious near-term applications.
Computer scientist Louis Castricato spent eight years studying large language models before concluding that fundamental LLM research had reached its limits. "We basically have passed the point of doing real fundamental LLM research," Castricato explained. "Now it's just applications."
1
His response was to quit his doctoral studies at Brown University and launch Overworld, a startup dedicated to building AI that can understand and navigate a world, not just process words. This pivot reflects a broader movement among tech entrepreneurs who see world models as the next AI frontier, even as investors continue pouring trillions into chatbot developers like Anthropic and OpenAI.
Source: ET
The shift involves some of AI's most prominent figures. Fei-Fei Li, known as the "Godmother of AI," founded San Francisco-based startup World Labs and describes world models as "one of the most important and most overloaded terms in AI today."
2
Meanwhile, AI pioneer Yann LeCun left his position as Meta's chief AI scientist last year to establish Paris-based Advanced Machine Intelligence Labs, acknowledging that "world model is quickly becoming a buzzword."1
At its core, world model research addresses a fundamental limitation: AI systems cannot achieve true intelligence by only processing text. "Where language models learn the statistical structure of text, world models learn the statistical structure of space and time: how light falls on a surface, how a garden looks from an angle no camera has captured, how objects respond to force and follow the laws of physics," Li wrote in a recent essay.
2
LeCun frames it as enabling an AI agent "to predict the consequences of its own actions," a capability that extends far beyond predicting the next word in a sentence.1
The definitions of world models vary depending on intended applications, whether for smarter robots or interactive video game worlds. But the underlying principle remains consistent: AI systems with broad environmental awareness must understand physical dynamics that text-based models cannot capture.
Martial Hebert, dean of computer science at Carnegie Mellon University with over four decades in robotics research, illustrates the challenge plainly: chatbots cannot pick up a coffee mug. "There's all the geometry of the world, the dynamic of how I move my hand, the physical interaction of the contact with the cup," Hebert noted. "This is much more complex than just predicting the next word in a sentence."
For Hebert, world models offer a faster and cheaper path to physical AI, which he describes as "the evolution of what we used to call robotics."
1
He draws parallels to human biology: "In your body and spinal cord you have a very general model of how to balance, how to walk around, and you can adapt to your knee hurting in the morning, so you now walk a little differently. You don't need to think about that."2
This adaptability represents the kind of general environmental awareness that AI research now aims to replicate.While smarter robots capture headlines, startups are exploring varied applications. Overworld, launched last year in Rhode Island, focuses on building interactive video game worlds where environments adapt as virtual characters move through them. "There's no other world model where you can just walk through doors or where you can interact with a detailed environment like this," Castricato said. "We optimize for interaction above anything else."
1
Despite less obvious near-term applications compared to AI coding tools, venture capitalists are taking notice. Steve Jang, co-founder and managing partner at Kindred Ventures, is investing in Overworld alongside other world model-focused companies including Causal Labs, which builds AI models for weather prediction, and Extropic, which develops specialized computer chips suited to world models.
1
This investment pattern suggests confidence that AI systems capable of understanding physical environments will unlock applications beyond what large language models can achieve, even as the technology remains in early stages compared to mature chatbot platforms.Summarized by
Navi
1
Policy and Regulation

2
Policy and Regulation

3
Policy and Regulation
