Apple's LiTo AI model reconstructs 3D objects with realistic lighting from a single image

Reviewed byNidhi Govil

3 Sources

Share

Apple researchers have developed LiTo, an AI model that reconstructs 3D objects from a single image while preserving realistic lighting effects like reflections and highlights across different viewing angles. The Surface Light Field Tokenization approach uses latent space to jointly model object geometry and view-dependent appearance, outperforming existing methods that typically require multiple images.

Apple Researchers Unveil Advanced 3D Reconstruction Technology

Apple researchers have developed a groundbreaking AI model called LiTo that reconstructs hyperreal 3D objects from a single image while maintaining realistic lighting effects across different viewing angles

1

. The study, titled Surface Light Field Tokenization, introduces a novel approach that jointly models object geometry and view-dependent appearance within a unified framework. Unlike most prior works that focus on either reconstructing 3D geometry or predicting view-independent diffuse appearance, LiTo captures complex visual phenomena including specular highlights and Fresnel reflections under varying lighting conditions

1

.

Source: Analytics Insight

Source: Analytics Insight

How LiTo Transforms Single Images into 3D Objects

The machine learning model achieves this feat by leveraging latent space, a mathematical representation that stores information about both an object's physical structure and how light interacts with its surface

3

. The process involves an encoder-decoder architecture where an encoder first compresses the image into a compact representation, then a decoder reconstructs it as a 3D object complete with shadows, reflections, and lighting changes

3

. What distinguishes this approach is its ability to generate 3D objects from a single image, eliminating the need for more common methods that require images from different angles to enable 3D reconstruction

2

.

Training Process Behind the Technology

To train the AI model, Apple researchers selected thousands of objects rendered from 150 different viewing angles and 3 lighting conditions

1

. Rather than feeding all this information directly into the system, they randomly selected small subsets of these samples and compressed them into a latent representation. The decoder was then trained to reconstruct the full object and its appearance under different angles and light conditions from just that subset of data

1

. Through this training process, the system learned to capture both the object's geometry and how its appearance changes depending on viewing direction. Subsequently, another model was trained to take a single image of an object and predict the corresponding latent representation, enabling the decoder to reconstruct the full 3D object with view-dependent effects

1

.

Performance Comparisons and Future Implications

Apple published reconstruction comparisons between LiTo and an existing model called TRELLIS on the project page, demonstrating superior performance in capturing realistic lighting effects

1

. The ability to reconstruct 3D objects from a single image with accurate reflections, highlights, and other effects consistent across different viewing angles represents a significant advancement in computer vision and 3D modeling

2

. This technology could have wide-ranging applications in augmented reality, product visualization, e-commerce, and digital content creation, particularly as Apple continues to develop its Vision Pro spatial computing platform. The research demonstrates how leveraging surface light field samples through RGB-depth images enables more accurate representation of complex lighting interactions, potentially setting a new standard for single-image 3D reconstruction methods.

Source: 9to5Mac

Source: 9to5Mac

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2026 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo