arXiv

Towards Compact Autonomous Driving Perception with Balanced Learning and Multi-sensor Fusion

Title: Advancing Compact Autonomous Driving Perception via Balanced Learning and Multi-Sensor Fusion

This study introduces a streamlined deep multi-task learning architecture designed to execute a wide array of autonomous driving perception tasks within a single forward pass. Rather than relying on a suite of separate models, this system simultaneously generates multiple outputs, including semantic segmentation from various viewpoints, depth estimation, LiDAR segmentation, and bird’s eye view projections. To address the challenges of imbalanced learning inherent in managing numerous tasks, the authors propose an adaptive loss weighting algorithm.

The model leverages data pre-processing and intermediate sensor fusion techniques to integrate inputs from multiple modalities. These inputs are gathered from RGB cameras, dynamic vision sensors (DVS), and LiDAR units positioned at various locations on the ego vehicle, enabling a more comprehensive understanding of dynamic environments.

Ablation studies demonstrate that the model variant trained using the proposed method yields superior performance. Additionally, a comparative analysis highlights its effectiveness against combinations of recent state-of-the-art models. Notably, the proposed architecture maintains high performance despite having significantly fewer parameters, which allows for faster inference times and reduced GPU memory consumption. The results remain consistent across three distinct CARLA simulation datasets and one real-world nuScenes-lidarseg dataset. To facilitate further research, the code and associated resources have been made publicly available at https://github.com/oskarnatan/compact-perception.


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

TikTok Billionaire Tops Ambani as Asia’s Second-Richest
Bloomberg

TikTok Billionaire Tops Ambani as Asia’s Second-Richest

TikTok founder surpasses Mukesh Ambani to become Asia’s second-richest person, marking a significant shift in the region...

Publishers in UK can opt out of Google AI search results
BBC News

Publishers in UK can opt out of Google AI search results

UK publishers can now opt out of Google’s AI search summaries, a CMA ruling designed to boost their bargaining power and...

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.
Bloomberg

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.

Kioxia’s market cap nears Toyota’s, signaling a major shift in Japan’s corporate hierarchy. This narrowing gap highlight...

Reuters

Morning Bid: Marvell, a fitting name for the latest AI darling

Reuters highlights Marvell as a top AI stock, noting its name perfectly suits its status as the newest market darling.

Financial Times

Tim Hayward: I built the Jaguar E-Type of computer keyboards

Tim Hayward compares his bespoke keyboard designs to the Jaguar E-Type. He explores high-end customization for personal ...

Financial Times

AI Labs: Zuckerberg’s $100bn gamble

Meta’s $100 billion AI investment aims to secure AI dominance, but questions remain whether sheer spending can outpace c...