arXiv

UniCAD: A Unified Benchmark and Universal Model for Multi-Modal Multi-Task CAD

Title: UniCAD: Establishing a Unified Benchmark and Universal Model for Multi-Modal, Multi-Task CAD

Abstract:

Computer-Aided Design (CAD) serves as the foundation for contemporary engineering and manufacturing, facilitating the development of highly precise and editable 3D models. Despite this critical role, research in the field has traditionally focused on individual tasks in isolation. The advancement of multi-modal, multi-task learning for CAD has been significantly impeded by the lack of a standardized, unified benchmark. To bridge this divide, we present UniCAD, an extensive benchmark designed for multi-modal CAD learning. This benchmark encompasses a wide array of functionalities, including point-to-CAD reconstruction, text and image-to-CAD generation, and CAD-related question answering, accommodating various input modalities.

Complementing this benchmark, we introduce UniCAD-MLLM, a versatile multi-modal large language model. This model is capable of processing diverse inputs—such as text, images, sketches, and point clouds—and executing these heterogeneous tasks end-to-end within a single, cohesive framework. Our extensive experimental evaluations, conducted on both the UniCAD and Fusion360 benchmarks, reveal that UniCAD-MLLM delivers state-of-the-art results across all tested tasks. It consistently surpasses existing baselines, whether they are specialized for specific tasks or designed for multi-task scenarios. To foster further advancements in the field, we will make the dataset, source code, and pre-trained models publicly available.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

Glazer Family Members Said to Study Manchester United Stake Sale
Bloomberg

Glazer Family Members Said to Study Manchester United Stake Sale

Reports indicate the Glazer family is evaluating a potential sale of their Manchester United stake, with family members ...

Ares' Blair Jacbobson: Disconnect Over Private Credit Headlines
Bloomberg

Ares' Blair Jacbobson: Disconnect Over Private Credit Headlines

Ares’ Blair Jacobson argues that private credit headlines misrepresent reality, highlighting a disconnect between media ...

Nvidia-Backed Robotics Startup Generalist AI Valued at $2 Billion
Bloomberg

Nvidia-Backed Robotics Startup Generalist AI Valued at $2 Billion

Nvidia-backed robotics startup Generalist AI has reached a $2 billion valuation. Founders Pete Florence, Andy Zeng, and ...

TechCrunch

Oura Ring 5 review: Thinner, lighter, better

The Oura Ring 5 is 40% smaller and lighter than its predecessor, offering superior comfort and a discreet, jewelry-like ...

Financial Times

How AI has de-skilled translation

AI fragments specialist translation into routine tasks, effectively de-skilling the profession. This shift reduces compl...

Zurich Insurance Expands Data-Center Offering Beyond the US
Bloomberg

Zurich Insurance Expands Data-Center Offering Beyond the US

Zurich Insurance Group is expanding its data center insurance products internationally, extending coverage beyond the Un...