arXiv

SAM 3D: 3Dfy Anything in Images

Title: SAM 3D: Transforming Images into 3D Models with Ease

Abstract:

This paper introduces SAM 3D, a generative AI model designed for visually grounded 3D object reconstruction. The system predicts geometry, texture, and spatial layout directly from a single input image. SAM 3D demonstrates exceptional performance in natural images, effectively handling challenges such as occlusion and scene clutter by leveraging contextual visual recognition cues.

To support this capability, we developed a hybrid annotation pipeline involving both humans and models. This approach allowed us to generate 3D reconstruction data—covering object shape, texture, and pose—at a scale never before seen. We trained the model using a contemporary, multi-stage framework that integrates synthetic pretraining with real-world alignment, thereby overcoming the traditional "data barrier" in 3D modeling.

Our results show substantial improvements over recent studies, achieving a win rate of at least 5:1 in human preference tests when evaluating reconstructions of real-world objects and scenes. In addition to the model, we will make available the source code, model weights, an online demonstration platform, and a new, rigorous benchmark designed for in-the-wild 3D object reconstruction.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

Nvidia-Backed Robotics Startup Generalist AI Valued at $2 Billion
Bloomberg

Nvidia-Backed Robotics Startup Generalist AI Valued at $2 Billion

Nvidia-backed robotics startup Generalist AI has reached a $2 billion valuation. Founders Pete Florence, Andy Zeng, and ...

TechCrunch

Oura Ring 5 review: Thinner, lighter, better

The Oura Ring 5 is 40% smaller and lighter than its predecessor, offering superior comfort and a discreet, jewelry-like ...

Financial Times

How AI has de-skilled translation

AI fragments specialist translation into routine tasks, effectively de-skilling the profession. This shift reduces compl...

Zurich Insurance Expands Data-Center Offering Beyond the US
Bloomberg

Zurich Insurance Expands Data-Center Offering Beyond the US

Zurich Insurance Group is expanding its data center insurance products internationally, extending coverage beyond the Un...

Emerging-Market Stocks Fall as Broadcom Miss Disrupts AI Trade
Bloomberg

Emerging-Market Stocks Fall as Broadcom Miss Disrupts AI Trade

Broadcom’s earnings miss triggered a sell-off in AI stocks, dragging down emerging-market equities. This disruption high...

Revolut Co-Founder, CTO Vlad Yatsenko to Step Down From Role
Bloomberg

Revolut Co-Founder, CTO Vlad Yatsenko to Step Down From Role

Revolut co-founder and CTO Vlad Yatsenko is stepping down from his executive role. The resignation marks a significant l...