arXiv

Correcting Visual Blur Induced by Attention Distraction to Reduce Hallucinations: Algorithm and Theory

Title: Mitigating Hallucinations by Rectifying Visual Blur Caused by Attention Diversion: A Theoretical and Algorithmic Study

Abstract: While multimodal large language models (MLLMs) are prone to object hallucinations, the visual perceptual mechanisms driving this issue remain largely unclear. This study demonstrates that such hallucinations are closely linked to a phenomenon analogous to human attention distraction. In humans, divided focus leads to diminished visual acuity and erroneous descriptions; similarly, in AI models, this manifests as spatial irregularities in multi-head attention and a temporal decay of attention assigned to image tokens during the decoding phase. Our theoretical analysis indicates that this dispersion of attention heightens model complexity and impairs the generalization capability of classification tasks. Based on these insights, we introduce the Attention-Focused Approach for Improved Image Perception (AFIP). This method addresses attention diversion by enriching cross-head attention and strengthens visual grounding via dynamic enhancement of historical attention. Comprehensive experiments across various benchmarks and models confirm that AFIP effectively reduces hallucinations without requiring additional training.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

Glazer Family Members Said to Study Manchester United Stake Sale
Bloomberg

Glazer Family Members Said to Study Manchester United Stake Sale

Reports indicate the Glazer family is evaluating a potential sale of their Manchester United stake, with family members ...

Ares' Blair Jacbobson: Disconnect Over Private Credit Headlines
Bloomberg

Ares' Blair Jacbobson: Disconnect Over Private Credit Headlines

Ares’ Blair Jacobson argues that private credit headlines misrepresent reality, highlighting a disconnect between media ...

Nvidia-Backed Robotics Startup Generalist AI Valued at $2 Billion
Bloomberg

Nvidia-Backed Robotics Startup Generalist AI Valued at $2 Billion

Nvidia-backed robotics startup Generalist AI has reached a $2 billion valuation. Founders Pete Florence, Andy Zeng, and ...

TechCrunch

Oura Ring 5 review: Thinner, lighter, better

The Oura Ring 5 is 40% smaller and lighter than its predecessor, offering superior comfort and a discreet, jewelry-like ...

Financial Times

How AI has de-skilled translation

AI fragments specialist translation into routine tasks, effectively de-skilling the profession. This shift reduces compl...

Zurich Insurance Expands Data-Center Offering Beyond the US
Bloomberg

Zurich Insurance Expands Data-Center Offering Beyond the US

Zurich Insurance Group is expanding its data center insurance products internationally, extending coverage beyond the Un...