arXiv

AgenticDiffusion: Agentic Diffusion-based Path Planning for Vision-Based UAV Navigation

Title: AgenticDiffusion: Leveraging Agentic Diffusion for Vision-Centric UAV Path Planning

Abstract: Navigating Unmanned Aerial Vehicles (UAVs) indoors demands robust capabilities in scene comprehension, efficient exploration, and precise trajectory control, particularly when constrained by a restricted field of view. Current vision-based navigation systems largely depend on single-view inputs, which hinders their capacity to interpret occlusions, assess target visibility, and grasp the broader spatial layout. To address these limitations, this study introduces AgenticDiffusion, a comprehensive multi-view navigation architecture. This framework integrates language-driven reasoning, open-vocabulary target localization, vision-based diffusion planning, and Nonlinear Model Predictive Control (NMPC) into a cohesive aerial navigation pipeline.

By processing synchronized first-person-view (FPV) and top-view imagery alongside natural language commands, the system identifies the most advantageous viewpoints for navigation and formulates a mission strategy before executing any movement. Specifically, an open-vocabulary grounding model pinpoints targets, enabling viewpoint-specific diffusion planners to construct navigation trajectories for the UAV. By leveraging the synergies of complementary camera perspectives, AgenticDiffusion minimizes redundant target searches and enhances overall navigation efficiency within complex, cluttered indoor spaces.

The proposed framework underwent validation across four distinct real-world UAV navigation tests, covering adaptive viewpoint selection, multi-stage mission execution, long-horizon navigation, and the identification of safe landing zones. Experimental data from 40 real-world trials revealed an overall mission success rate of 80%, while the diffusion planners demonstrated a perfect 100% success rate in trajectory generation.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

AI Concentration Risk Is the Problem: 3-Minutes MLIV
Bloomberg

AI Concentration Risk Is the Problem: 3-Minutes MLIV

The article argues that AI concentration risk, rather than the technology itself, is the primary concern. It highlights ...

Reuters

Foxconn announces strategic collaboration with Intel on next-gen AI infrastructure

Foxconn and Intel announced a strategic partnership to develop next-generation AI infrastructure. This collaboration aim...

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)
Bloomberg

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

SpaceX aims for a record $75 billion valuation through an initial public offering. This historic IPO marks a significant...

Broadcom AI Chip Outlook Disappoints Investors
Bloomberg

Broadcom AI Chip Outlook Disappoints Investors

Broadcom’s AI chip projections disappointed investors, dampening market sentiment. The outlook fell short of expectation...

Reuters

Europe's tech 'liberation day'? Computer says not yet

Europe’s expected tech breakthrough remains unrealized, as current systems indicate that a true "liberation day" has not...

Hiranandani Group CEO on Powering India's Digital Future
Bloomberg

Hiranandani Group CEO on Powering India's Digital Future

Hiranandani Group CEO discusses driving India's digital transformation.