arXiv

Concept-wise Attention for Fine-grained Concept Bottleneck Models

Title: Concept-wise Attention for Fine-grained Concept Bottleneck Models

Abstract:

Recent advancements in Concept Bottleneck Models (CBM) have leveraged the image-text alignment capabilities of large pre-trained vision-language models, such as CLIP, to achieve remarkable performance. Despite these gains, concept modeling faces two primary challenges. First, current approaches are frequently hindered by pre-training biases, which result in granularity misalignment or an over-reliance on structural priors. Second, the conventional fine-tuning process using Binary Cross-Entropy (BCE) loss treats concepts in isolation. This independent treatment overlooks the mutual exclusivity inherent among concepts, resulting in suboptimal alignment.

To overcome these obstacles, we introduce CoAt-CBM (Concept-wise Attention for Fine-grained Concept Bottleneck Models), a novel framework designed to deliver both adaptive fine-grained image-concept alignment and enhanced interpretability. CoAt-CBM utilizes learnable concept-wise visual queries to dynamically generate fine-grained concept-specific visual embeddings. These embeddings are subsequently employed to construct a concept score vector. Furthermore, we propose a novel concept contrastive optimization strategy that directs the model to manage the relative significance of these concept scores. This approach ensures that concept predictions accurately mirror the image content, thereby improving alignment. Comprehensive experiments confirm that CoAt-CBM consistently surpasses state-of-the-art methodologies. The source code will be released upon acceptance of this work.


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

TechCrunch

The world’s largest privately owned laser just turned on

Xcimer Energy activated the Phoenix laser, the world’s largest privately owned laser, aiming to commercialize fusion pow...

Uber Targets Doubling Its Fleet of Electric Motorcycles in Kenya
Bloomberg

Uber Targets Doubling Its Fleet of Electric Motorcycles in Kenya

Uber plans to double its electric motorcycle fleet in Kenya. This expansion aims to enhance sustainable transport option...

AI Saves Time But Most Companies Waste the Gain, Study Shows
Bloomberg

AI Saves Time But Most Companies Waste the Gain, Study Shows

A study reveals that while AI saves employee time, most companies fail to capitalize on these gains, squandering potenti...

JPMorgan Lifts S&P Target on Earnings 'Supercycle'
Bloomberg

JPMorgan Lifts S&P Target on Earnings 'Supercycle'

JPMorgan raised its S&P 500 target, citing an earnings “supercycle” that reflects heightened confidence in corporate pro...

Europe Sleepwalking Into Economic Ruin, Serb Leader Says
Bloomberg

Europe Sleepwalking Into Economic Ruin, Serb Leader Says

Serbian leader warns Europe is sleepwalking into economic ruin.

Delta Electronics Flags Power Crunch
Bloomberg

Delta Electronics Flags Power Crunch

Delta Electronics warns of a looming power deficit due to surging demand and constrained production, predicting serious ...