arXiv

Entity Binding Failures in Speech LLM Reasoning: Diagnosis and Chain-of-Thought Intervention

Title: Tackling Entity Binding Failures in Speech LLM Reasoning: Diagnosis and Chain-of-Thought Intervention

Abstract:

Speech Large Language Models (SLLMs) currently lag behind their text-based counterparts when handling complex reasoning tasks. Our investigation reveals that this performance disparity does not stem from a generalized cognitive deficit. Through an evaluation of three distinct SLLMs, we demonstrate that speech-to-text (S2T) systems perform on par with or better than text-to-text (T2T) models in spatial, syntactic, and factual domains. However, in logical tasks that demand entity tracking, S2T accuracy drops to chance levels. We identify this specific decline as an entity binding failure, where continuous speech features lead models to lose precise associations between entities and their properties during implicit reasoning processes.

To address this issue, we introduce Entity-Aware Chain-of-Thought (EA-CoT), a method that compels SLLMs to explicitly list entities and link them to claims prior to reasoning. EA-CoT effectively closes the performance gap, achieving absolute accuracy improvements of up to 24.4%, even in scenarios where spoken names are misrecognized. Ablation studies confirm that these enhancements are driven solely by explicit semantic binding, suggesting that the observed modality gap is a resolvable bottleneck rather than an inherent limitation.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

AI Concentration Risk Is the Problem: 3-Minutes MLIV
Bloomberg

AI Concentration Risk Is the Problem: 3-Minutes MLIV

The article argues that AI concentration risk, rather than the technology itself, is the primary concern. It highlights ...

Reuters

Foxconn announces strategic collaboration with Intel on next-gen AI infrastructure

Foxconn and Intel announced a strategic partnership to develop next-generation AI infrastructure. This collaboration aim...

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)
Bloomberg

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

SpaceX aims for a record $75 billion valuation through an initial public offering. This historic IPO marks a significant...

Broadcom AI Chip Outlook Disappoints Investors
Bloomberg

Broadcom AI Chip Outlook Disappoints Investors

Broadcom’s AI chip projections disappointed investors, dampening market sentiment. The outlook fell short of expectation...

Reuters

Europe's tech 'liberation day'? Computer says not yet

Europe’s expected tech breakthrough remains unrealized, as current systems indicate that a true "liberation day" has not...

Hiranandani Group CEO on Powering India's Digital Future
Bloomberg

Hiranandani Group CEO on Powering India's Digital Future

Hiranandani Group CEO discusses driving India's digital transformation.