Global News Digest

arXiv

Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Latent Priors

Title: Enhancing Self-Supervised Monocular Depth and Pose Estimation in Endoscopy via Latent Priors

Abstract: Achieving precise 3D mapping in endoscopic procedures is essential for comprehensive, quantitative assessment of lesions within the gastrointestinal (GI) tract, a task that hinges on dependable depth and pose estimation. Despite the monocular nature of endoscopic equipment, current approaches often struggle with generalizability in difficult endoscopic environments, typically due to their reliance on synthetic data or overly complex architectures. To address these limitations, we introduce a resilient self-supervised framework for monocular depth and pose estimation that integrates a Generative Latent Bank with a Variational Autoencoder (VAE). By utilizing a Generative Latent Bank, the system draws upon extensive depth data from natural images to condition the depth network. This process injects latent feature priors that significantly boost the realism and robustness of depth predictions. Concurrently, we recast pose estimation within a VAE structure, conceptualizing pose transitions as latent variables. This strategy serves to regularize scale, stabilize prominence along the z-axis, and enhance sensitivity in the x-y plane. Our dual-refinement pipeline delivers accurate depth and pose estimations, effectively navigating the challenging textures and lighting conditions inherent to the GI tract. Comprehensive assessments on the SimCol and EndoSLAM datasets demonstrate that our approach outperforms existing self-supervised methods in the realm of endoscopic depth and pose estimation.


Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Related Articles

Schroders Renewable Unit Targets AI Assets as Power Demand Soars
Bloomberg

Schroders Renewable Unit Targets AI Assets as Power Demand Soars

Schroders’ renewable unit targets AI infrastructure, pivoting to meet soaring energy demand from artificial intelligence...

State Street's Paglia on SBI Group Partnership, ETFs
Bloomberg

State Street's Paglia on SBI Group Partnership, ETFs

State Street's Paglia discusses the SBI Group partnership and ETFs, but the source text is missing. Please provide the a...

Nvidia Boss Says Workers Should Be Paid ‘as Much as Possible’
Bloomberg

Nvidia Boss Says Workers Should Be Paid ‘as Much as Possible’

Nvidia CEO Jensen Huang advocates for paying workers “as much as possible,” emphasizing maximum compensation. This stanc...

TSE Talking With Regulator For Easing ETF Listing Rules
Bloomberg

TSE Talking With Regulator For Easing ETF Listing Rules

The Tokyo Stock Exchange is discussing with regulators to ease ETF listing rules. This aims to simplify market access an...

S&P DJI CEO on Japan Markets, Mega IPOs
Bloomberg

S&P DJI CEO on Japan Markets, Mega IPOs

S&P DJI CEO discusses Japan's financial markets and major IPOs.