arXiv

Belief-Aware VLM Model for Human-like Reasoning

Title: Belief-Aware VLM Model for Human-like Reasoning

Abstract: Conventional neural network architectures for intent inference are predominantly dependent on observable states, which often hinders their capacity to generalize across a wide array of tasks and dynamic settings. While recent breakthroughs in Vision Language Models (VLMs) and Vision Language Action (VLA) models have introduced common-sense reasoning capabilities through large-scale multimodal pretraining—facilitating zero-shot performance—these systems still lack explicit mechanisms to represent and update belief states. This deficiency restricts their ability to reason in a manner akin to humans or to track evolving human intent over extended periods. To overcome these limitations, we introduce a belief-aware VLM framework that combines reinforcement learning with retrieval-based memory. Rather than constructing an explicit belief model, our approach approximates belief through a vector-based memory system that retrieves pertinent multimodal context, which is then integrated into the VLM to facilitate reasoning. Furthermore, we enhance decision-making processes by applying a reinforcement learning policy within the VLM’s latent space. Our evaluations on publicly accessible VQA datasets, including HD-EPIC, reveal consistent performance gains over zero-shot baselines, underscoring the critical role of belief-aware reasoning.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

AI Concentration Risk Is the Problem: 3-Minutes MLIV
Bloomberg

AI Concentration Risk Is the Problem: 3-Minutes MLIV

The article argues that AI concentration risk, rather than the technology itself, is the primary concern. It highlights ...

Reuters

Foxconn announces strategic collaboration with Intel on next-gen AI infrastructure

Foxconn and Intel announced a strategic partnership to develop next-generation AI infrastructure. This collaboration aim...

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)
Bloomberg

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

SpaceX aims for a record $75 billion valuation through an initial public offering. This historic IPO marks a significant...

Broadcom AI Chip Outlook Disappoints Investors
Bloomberg

Broadcom AI Chip Outlook Disappoints Investors

Broadcom’s AI chip projections disappointed investors, dampening market sentiment. The outlook fell short of expectation...

Reuters

Europe's tech 'liberation day'? Computer says not yet

Europe’s expected tech breakthrough remains unrealized, as current systems indicate that a true "liberation day" has not...

Hiranandani Group CEO on Powering India's Digital Future
Bloomberg

Hiranandani Group CEO on Powering India's Digital Future

Hiranandani Group CEO discusses driving India's digital transformation.