7 Sources
7 Sources
[1]
Fei-Fei Li's World Labs speeds up the world model race with Marble, its first commercial product | TechCrunch
World Labs, the startup founded by AI pioneer Fei-Fei Li, is launching its first commercial world model product. Marble is now available via freemium and paid tiers that let users turn text prompts, photos, videos, 3D layouts or panoramas into editable, downloadable 3D environments. The launch of the generative world model, first released in limited beta preview two months ago, comes a little over a year after World Labs came out of stealth with $230 million in funding, and puts the startup ahead of competitors building world models. World models are AI systems that generate an internal representation of an environment, and can be used to predict future outcomes and plan actions. Startups like Decart and Odyssey have released free demos, and Google's Genie is still in limited research preview. Marble differs from these -- and even World Labs's own real-time model, RTFM -- because it creates persistent, downloadable 3D environments rather than generating worlds on-the-fly as you explore. This, the company says, results in less morphing or inconsistency, and lets users export worlds as Gaussian splats, meshes or videos. Marble is also the first model of its kind to offer AI-native editing tools and a hybrid 3D editor that lets users block out spatial structures before AI fills in the visual details. "This is a brand new category of model that's generating 3D worlds, and this is something that's going to get better over time. It's something we've already improved quite a lot," Justin Johnson, co-founder of World Labs, told TechCrunch. Last December, World Labs showed how its early models could generate interactive 3D scenes based on a single image. While impressive, the somewhat cartoonish scenes weren't fully explorable since movements were limited to a small area, and there were occasional rendering errors. In my trial of the beta preview, I found Marble generated impressive worlds from image prompts alone -- from game-like environments to photorealistic versions of my living room. Scenes morphed at the edges, though that's apparently been improved in today's launch. That said, a world I'd generated in the beta using a single prompt looked better and matched my intent more closely than the same prompt does now. I haven't yet tested the editing features, though Johnson says they make Marble practical for near-term gaming, VFX and virtual reality (VR) projects. "One of our main themes for Marble going forward is creative control," Johnson said. "There should always be a quick pathway to generate something, but you should be able to dive even deeper and get a lot of control over the things that you're generating. You don't want the machine to just take the wheel and pull all that creativity away from you." Marble's take on creative control starts with input flexibility. The beta only accepted single images, forcing the model to invent unseen details for a 360-degree view. With the full launch, users can now upload multiple images or short clips to show a space from different angles and have the model generate fairly realistic digital twins. Then we have Chisel, an experimental 3D editor that lets users block out coarse spatial layouts (think walls, boxes, or planes) and then add text prompts to guide the visual style. Marble generates the world, decoupling structure from style -- similar to how HTML provides the structure of a website and CSS adds in color. Unlike text-based editing, Chisel lets you directly manipulate objects. "I can just go in there and grab the 3D block that represents the couch and move it somewhere else," Johnson said. Another new feature that gives you more editing control is the ability to expand a world. "Once you generate a world, you can expand it up to once," Johnson said. "When you move to a piece of the world that's starting to break apart, you can basically tell the model to expand there or generate more world in the vicinity of where you currently are, and then it can add more detail in that region." Users who want to create extremely large spaces can combine multiple worlds with "composer mode." Johnson demonstrated this for me with two worlds he had already built - a room made of cheese with grape chairs, and another of a futuristic meeting room in space. Marble is available via four subscription tiers: Free (four generations from text, image, or panorama), Standard ($20/month, 12 generations plus multi-image/video input and advanced editing), Pro ($35/month, 25 generations with scene expansion and commercial rights), and Max ($95/month, all features and 75 generations). Johnson thinks the initial use cases for Marble will be gaming, visual effects for film, and virtual reality. Game developers have mixed feelings about the tech. A recent Game Developers Conference survey found a third of respondents believed generative AI has a negative impact on the games industry - 12% more than the survey indicated year earlier. Intellectual property theft, energy consumption and a decrease in quality from AI-generated content were among the top concerns aired. And last year, a Wired investigation found game studios like Activision Blizzard are using AI to cut corners and combat attrition. In gaming, Johnson sees developers using Marble to generate background environments and ambient spaces and then importing those assets into game engines like Unity or Unreal Engine to add interactive elements, logic and code. "It's not designed to replace the entire existing pipeline for gaming, but to just give you assets that you can drop into that pipeline," he said. For VFX work, Marble sidesteps the inconsistency and poor camera control that plague AI video generators, per Johnson. Its 3D assets let artists stage scenes and control camera movements with frame-perfect precision, he said. While Johnson said World Labs isn't focusing on virtual reality (VR) applications right now, he noted the industry is "starved for content" and excited about the launch. Marble is already compatible with the Vision Pro and Quest 3 VR headsets, and every generated world can be viewed in VR today. Marble may also have potential use cases for robotics. Johnson noted that unlike image and video generation, robotics doesn't have the benefit of a large repository of training data. But with generators like Marble, it becomes easier to simulate training environments. According to a recent manifesto by Fei-Fei Li, CEO and co-founder of World Labs, Marble represents the first step towards creating "a truly spatially intelligent world model." Li believes "the next generation of world models will enable machines to achieve spatial intelligence on an entirely new level." If large language models can teach machines to read and write, Li hopes systems like Marble can teach them to see and build. She says the ability to understand how things exist and interact in three-dimensional spaces can eventually help machines make breakthroughs beyond gaming and robotics, and even into science and medicine. "Our dreams of truly intelligent machines will not be complete without spatial intelligence," Li wrote.
[2]
World Labs is betting on 'world generation' as the next AI frontier
In recent years, we've seen generative AI move quickly through different eras: chatbots, image-generation, voice, video-generation, and more. But Dr. Fei-Fei Li, longtime AI pioneer and co-director of Stanford's Institute for Human-Centered Artificial Intelligence (HAI), is staking out what she thinks is the next frontier: spatial intelligence, a nascent field that she believes is the "defining challenge of the next decade," as she wrote in a Substack post this week. That's why Li co-founded World Labs in 2024 -- and raised $230 million last fall -- to build world models, or generative AI models that can "perceive, generate, reason, and interact with the 3D world," per the company. And this week, World Labs released its first commercial product, Marble, which allows users to generate their own downloadable 3D worlds from text, image, or video prompts. Li wrote that she foresees spatial intelligence potentially transforming sectors from storytelling and filmmaking to architecture, robotics, and scientific discovery. "We see that [the] world model is just as big and exciting, if not more [than the previous eras]," Li, who is also CEO of the company, told The Verge in an interview. "Bringing 3D to life, and understanding the richness of spatial and 3D stuff, is just a whole next level beyond the baseline of most of these other single modes," Ben Mildenhall, co-founder of World Labs, told The Verge. He added that, for solely human teams, "It's such a massive problem to build these worlds. It requires such a large team and so many pieces of software and so much time and effort ... Think about the radical change there, that can come if you empower people to build stuff much more rapidly -- ideate, iterate, and edit things in a much tighter loop." Marble offers four subscription tiers: Free, which allows for up to four world generations; Standard ($20 per month), which allows for up to 12 generations and more editing options; Pro ($35 per month), which allows for 25 generations and commercial rights; and Max ($95 per month), which allows for up to 75 generations, as well as everything the pro tier offers. The Verge was able to generate an open-air castle with waterfalls and, in other users' generations, explore ruined structures reclaimed by nature and Hobbit-like spherical homes. It was possible to take a few steps into such environments before essentially running into a wall in the 3D generation, and in the non-free tiers, the downloaded files are compatible with tools like Unreal Engine and Unity. Mildenhall said that he'd seen some people who are willing to put in hours of work are able to stage out "fairly large environments" using Marble. Mildenhall said he could imagine authors using it to build out their imagined world, or people working on VFX or location scouts in the filmmaking industry. At the enterprise level, he said, he could imagine companies using Marble or one of World Labs's future products to analyze and visualize their wide swaths of data. "Even given the limitations of this model, we are seeing the light beyond where we are in some emerging behaviors," Li said, adding that people can put together spaces in a way that's "beyond human imagination."
[3]
Fei-Fei Li's World Labs Launches Marble, a Multimodal 3D World Model for Public Use | AIM
Marble can create 3D worlds from text, images, video, or coarse 3D layouts. World Labs, the startup founded by AI pioneer Fei-Fei Li, has released its generative world model, Marble, publicly available after a two-month beta with early users. "Marble can create 3D worlds from text, images, video, or coarse 3D layouts," World Labs said. "Users can interactively edit or expand worlds." The company raised $230 million last year in September. Alongside the launch, World Labs introduced Marble Labs, a workspace for creators to explore workflows, case studies, and documentation. "It is where artists, engineers, and designers push the boundaries of world models," the company said. There are many other startups like Decart and Odyssey building world models and putting out free demos, and Google's Genie is still only in a research preview. Marble supports text-to-world and image-to-world generation, as well as multi-image and video inputs for greater control over scene structure. Worlds can be exported as Gaussian splats, triangle meshes, collider meshes, or videos with pixel-level camera control. Marble now includes AI-native world editing tools, enabling object removal, style changes, and structural modifications. World Labs also unveiled Chisel, an experimental 3D sculpting mode that lets users design coarse layouts and apply styles through prompts. "Chisel decouples structure from style," according to the company. Users can expand worlds through one-step enlargement or compose multiple worlds to build large spaces. Enhanced video export can add detail, motion, and cleanup while preserving spatial structure. World Labs said Marble is an early step toward broader spatial intelligence. "Future world models will let humans and agents alike interact with generated worlds in new ways," the company said.
[4]
Fei-Fei Li's World Labs unveils its world-generating AI model
Marble can reconstruct, generate, and simulate 3D worlds -- think of it as a type of "world model." In an interview with Fast Company, Li describes world models as a "significant" evolution of the generative AI era. "The large world model is really a significant step towards unlocking AI's capability," a category she calls "spatial." Spatial intelligence refers to a system's ability to perceive, model, reason about, and take actions within physical or geometric space -- similar to how humans or animals choose their actions based on their understanding of their surroundings. World Labs launched in September of 2024, when it began working on the Marble model. Two months ago it released a preview of the model to a group of creatives, who began buliding worlds and giving feedback. This week, Li posted a sort of manifesto on Substack arguing that spatial intelligence is the next frontier in AI. For humans, she says, spatial intelligence of the physical world around us provides the scaffolding upon which we build our cognition. "Spatial intelligence will transform how we create and interact with real and virtual worlds -- revolutionizing storytelling, creativity, robotics, scientific discovery, and beyond," she writes. World Labs believes that endowing machines (including robots) with such "spatial intelligence" could be transformative for a number of industries in the coming years. Using a web interface, users can feed Marble a scene description, images or videos, or coarse 3D layouts and the model will generate a realistic 3D environment. A user might input a set of images from the bedroom where they grew up, then upload the images to Marble, which will then intelligently sew them together to create an immersive digital 3D version of the room.
[5]
World Labs launches Marble, a commercial world model for generating entire virtual environments - SiliconANGLE
World Labs launches Marble, a commercial world model for generating entire virtual environments World Labs Technologies, a company founded by AI pioneer Fei-Fei Li and focused on developing breakthrough artificial intelligence models, today announced the launch of its first commercial world model product: Marble. Marble lets users generate entire virtual worlds from text prompts, photos, panoramas or 3D models and download fully editable 3D environments. World Labs initially debuted the world model in limited beta mode two months ago. In the preview, the company showed that the model can generate 3D worlds that users can then explore as long as they want, with no morphing and no inconsistency. According to the company, Marble represents a leap over previous models by producing larger, more stylistically diverse world with cleaner 3D geometry. World models are useful because they allow AI models connected to the real world to understand and predict the world's behavior. This is critical for developing more capable AI systems, such as autonomous vehicles and robots, by producing realistic training data. Marble's world models can also be used for entertainment, such as generating entire, complex worlds for cinema and video games. Many video games, for example, use virtual worlds for players to participate in that rely on 3D editing tools to recreate realistic or semi-realistic environments for users to play the game. Marble can generate worlds in a broad variety of styles, including cartoon, science fiction, futuristic, fantasy, anime, realistic and retro-styled low poly-count (where the objects and walls appear to be a "low graphics" as if rendered on an older computer). Li is well known for creating ImageNet in 2009, a landmark AI dataset that revolutionized the field of computer vision. ImageNet contains over 14 million images organized according to a hierarchy of English nouns and their relationships. Its creation transformed computer vision from a niche research pursuit into one of the most dynamic fields of AI and laid the groundwork for visual reasoning and, ultimately, for today's generative world models. World models and visual reasoning form the basis for spatial intelligence, Li said, a concept that will transform how users create and interact with real and virtual environments. "Today, leading AI technology such as large language models have begun to transform how we access and work with abstract knowledge," Li explained in a blog post. "Yet they remain wordsmiths in the dark; eloquent but inexperienced, knowledgeable but ungrounded." Building spatially intelligent AI, Li argued, requires creating world models that are generative and capable of understanding, reasoning, and producing the semantic context of not only objects but also their relationships. This mandates creating and reasoning in dynamic, complex worlds, real or virtual, beyond the current capabilities of modern LLMs. The current market contains several contenders working on world models, including Google LLC's Genie, Nvidia Corp.'s Cosmos and AI startup Decart AI Inc. Unlike many world foundation models in the industry currently, including World Labs' own Real-Time Frame Model, Marble allows users to generate persistent worlds and download them as 3D models rather than producing them on the fly. Like AI image editors, Marble also offers tools for users to modify virtual worlds. Chisel, an experimental 3D editor, allows users to define virtual spaces with layouts such as walls, rooms and terrain and then use a text prompt to refine how the rough "sketch" should be used. Another feature allows users to expand the world by extending already available portions or take pieces of the world and then bridge them together seamlessly. The model expands the world by generating more of the 3D space based on existing rules and style. Users who want to build extremely large spaces can combine already generated worlds with a "composer mode," allowing them to stitch together different styles. Marble is available in four pricing tiers: Free, with four virtual world generations from text, images or panoramas; Standard at $20 per month with 12 generations, multimedia support and extended editing; Pro at $35 per month with 25 generations and commercial rights; and Max at $95 per month with 75 generations and a full feature set.
[6]
Fei-Fei Li's Spatial A.I. Startup World Labs Unveils Its First Product
Spatial intelligence aims to teach A.I. systems physical concepts humans intuitively grasp, such as parking a car without bumping the curb. Last January, renowned A.I. researcher Fei-Fei Li took a leave of absence from Stanford to trade academia for startup life. Nearly two years later, her venture World Labs has unveiled its first commercial product: a world model Marble. Marble can create 3D virtual worlds from text, images, video or even rough layouts. It builds on an earlier World Labs prototype that created 3D scenes from 2D images, but with limitations, such as restricted interactive areas. Sign Up For Our Daily Newsletter Sign Up Thank you for signing up! By clicking submit, you agree to our <a href="http://observermedia.com/terms">terms of service</a> and acknowledge we may use your information to send you emails, product samples, and promotions on this website and other properties. You can opt out anytime. See all of our newsletters So-called world models like Marble are central to Li's vision of the future of A.I. Because these models can reason about and interact with complex environments, they are essential for building A.I. that understands not just language, but the physical world itself. World Labs aims to imbue its systems with spatial intelligence, teaching them physical concepts humans intuitively grasp, such as parking a car without bumping the curb, catching a tossed object, or pouring a drink without looking. "Today, leading A.I. technology such as large language models (LLMs) have begun to transform how we access and work with abstract knowledge," Li wrote in a Nov. 10 blog post. "Yet they remain wordsmiths in the dark; eloquent but inexperienced, knowledgable but ungrounded." An emphasis on visual and spatial intelligence has long been Li's "North Star," said the researcher, who in 2006 played a role in the release of ImageNet, a database of 15 million images that spurred the rise of deep learning. Li also co-directs Stanford's Institute for Human-Centered A.I. and serves as a United Nations advisor on A.I. policy. These days, however, Li is focused on World Labs, which has raised $230 million to pursue its spatial intelligence vision. Its backers include Radical Ventures, Andreessen Horowitz and Nvidia, as well as prominent tech figures such as Geoffrey Hinton, Eric Schmidt, Marc Benioff and Reid Hoffman. https://observer.com/wp-content/uploads/sites/2/2025/11/hero.mp4 Marble has been in beta for a few months and is now publicly available. It can create a full 3D world from a single image or text prompt. Users can also merge multiple environments by uploading several images within a prompt. According to World Labs, the model can combine photos or short videos of real-world spaces to generate immersive, realistic virtual worlds. The model includes a range of editing tools that let users customize their creations. A feature called Chisel allows users to sketch out a coarse 3D layout, while other tools make it possible to expand worlds or build entirely new scenes within the same environment. Looking ahead, World Labs plans to develop world models with more interactive capabilities for both humans and A.I. agents. While Li may be the most prominent figure developing world models, she isn't the only one in the field. Google DeepMind and Nvidia have explored similar technologies with their their Genie and Cosmos models, respectively. Yann LeCun, Meta's chief A.I. scientist, is reportedly in the early stages of fundraising for his own world model startup. Li said the applications of spatial intelligence tools like Marble will "span varying timelines." The model is already being used by filmmakers, game designers and architects to enhance creative workflows. In the medium term, Li expects such technology to advance robotics, while future applications in science, healthcare, and education could enable breakthroughs in experiment simulation, drug discovery and immersive learning. "Spatial intelligence will transform how we create and interact with real and virtual worlds -- revolutionizing storytelling, creativity, robotics, scientific discovery, and beyond," said Li. "This is A.I.'s next frontier."
[7]
World Labs Launches Marble as Spatial Intelligence Becomes the New AI Battleground | PYMNTS.com
By completing this form, you agree to receive marketing communications from PYMNTS and to the sharing of your information with our sponsor, if applicable, in accordance with our Privacy Policy and Terms and Conditions. Marble is the first product released by World Labs started by Fei Fei Li, which has concentrated its research on spatial intelligence and internal world modeling. The company said the goal is to push AI forward from reading and writing to understanding motion, geometry and cause and effect. World Labs describes Marble as a multimodal system that can generate 3D scenes from text prompts, images, videos or spatial sketches, according to its product announcement. The model can extrapolate a single image into a navigable 3D environment and includes an interface called Chisel that lets users draw a rough layout and refine it with natural language. Marble represents World Labs' transition from research to commercialization. It is available through freemium and paid tiers that support exports in Gaussian splats, traditional meshes and video files, which allows integration with creative pipelines, simulation tools and real-time rendering engines. World Labs positions spatial intelligence as the next major layer of AI development. PYMNTS reported that experts view spatial understanding as essential for AI systems that must operate in real-world environments. Most current models work with flat, static data, which limits their usefulness in tasks that require depth, motion or physical reasoning. Marble is designed to bridge that gap by modeling how objects occupy space and interact over time. World Labs enters a relatively emerging field that has expanded quickly over the past year. The South China Morning Post reported that Tencent is expanding its world model efforts, investing in large-scale training runs designed to simulate physical environments, support robotics development and improve generative 3D workflows. The report noted that Tencent views spatial intelligence as a strategic priority and has reorganized parts of its AI research teams to accelerate progress in 3D simulation and multimodal spatial learning. Tencent's push reflects a broader trend among major Asian and U.S. companies that are rushing to build systems capable of generating consistent environments, predicting motion and training agentic models with spatial context. These efforts are tied to growth in digital twins, autonomous robotics and immersive applications that depend on accurate physical simulation. TechCrunch noted that early adopters of Marble include creative studios, simulation developers and teams in gaming and VFX seeking faster ways to build environments. The ability to convert text prompts into structured 3D spaces has drawn interest from organizations that currently rely on manual modeling workflows that are costly and time intensive. PYMNTS has previously reported that spatial intelligence is also gaining importance in enterprise automation. As companies use AI to support planning, forecasting and real-time decision-making, the ability to understand physical layout and movement improves responsiveness and reliability. Marble's next test will be its ability to demonstrate value beyond creative production. Expected early interest from entertainment, robotics and simulation sets the stage, but broader enterprise adoption will hinge on accuracy, workflow compatibility and compute efficiency. Spatial intelligence could become a core requirement for future AI agents, but many companies will evaluate performance and integration before committing to 3D-first AI systems.
Share
Share
Copy Link
AI pioneer Fei-Fei Li's World Labs has launched Marble, its first commercial world model that generates downloadable 3D environments from text, images, or video prompts. The product marks a significant milestone in spatial intelligence AI and positions World Labs ahead of competitors in the emerging world model market.
World Labs, the AI startup founded by renowned computer vision pioneer Fei-Fei Li, has officially launched Marble, its first commercial world model product that generates downloadable 3D environments from various input types
1
. The launch represents a significant milestone in the emerging field of spatial intelligence, positioning World Labs ahead of competitors in the race to commercialize world generation technology2
.
Source: Observer
The company, which raised $230 million in funding last year and emerged from stealth mode just over a year ago, has been developing Marble following a two-month limited beta preview that allowed early users to test and provide feedback on the technology
3
.Marble distinguishes itself from competing world models through its ability to create persistent, downloadable 3D environments rather than generating worlds on-the-fly during exploration
1
. This approach results in reduced morphing and inconsistency issues that plague real-time generation systems, while enabling users to export worlds as Gaussian splats, triangle meshes, or videos with pixel-level camera control5
.
Source: The Verge
The platform accepts diverse input formats including text prompts, single or multiple images, short video clips, 3D layouts, and panoramas, allowing for greater creative control over scene structure
3
. Users can generate worlds across various aesthetic styles, from photorealistic environments to cartoon, science fiction, fantasy, anime, and retro-styled low-polygon designs5
.Marble introduces several innovative editing capabilities that set it apart from existing solutions. The platform includes Chisel, an experimental 3D sculpting tool that allows users to create coarse spatial layouts using basic geometric shapes like walls, boxes, and planes, then apply visual styles through text prompts
1
. This approach decouples structure from style, similar to how HTML provides website structure while CSS handles visual presentation.The system also supports world expansion capabilities, allowing users to extend existing environments when they encounter boundaries or need additional space
1
. For larger projects, a "composer mode" enables users to combine multiple generated worlds seamlessly, creating expansive virtual environments that would traditionally require extensive manual development5
.Related Stories
World Labs enters a competitive but nascent market where several companies are developing world model technologies. Current competitors include Google's Genie, which remains in limited research preview, and startups like Decart and Odyssey that have released free demonstrations
1
. However, Marble's commercial availability and comprehensive feature set position it as the first fully realized product in this emerging category.The company has structured Marble with four subscription tiers to accommodate different user needs and budgets. The Free tier offers four world generations from text, image, or panorama inputs, while the Standard plan ($20/month) provides 12 generations with multimedia support and advanced editing features
5
. Professional users can access the Pro tier ($35/month) with 25 generations and commercial rights, or the Max tier ($95/month) offering 75 generations and full feature access.World Labs envisions Marble serving multiple industries, with initial focus on gaming, visual effects for film, and virtual reality development
1
. The technology addresses significant pain points in these sectors, where creating detailed 3D environments traditionally requires large teams, multiple software tools, and extensive time investments2
.
Source: SiliconANGLE
Fei-Fei Li, who previously created the landmark ImageNet dataset that revolutionized computer vision, positions spatial intelligence as the next major frontier in AI development
4
. In her recent manifesto, Li argues that while current large language models excel at processing abstract knowledge, they remain "wordsmiths in the dark" lacking grounded understanding of physical space and spatial relationships5
.Summarized by
Navi
[1]
[4]