Global News Digest

arXiv

Query Circuits: Explaining How Language Models Answer User Prompts

Title: Query Circuits: Decoding Language Model Responses to Specific User Prompts

Abstract: Providing local, input-level explanations is essential for understanding why a language model generates a specific output. While current techniques successfully map global capability circuits—such as indirect object identification—they fail to explain why a model responds to a particular query in a specific manner. To bridge this gap, we present "query circuits," a method that directly tracks the information flow within a model as it transforms a given input into an output. Because these circuits are identified internally rather than relying on surrogate models like sparse autoencoders, they offer explanations that are both more faithful and computationally efficient.

To ensure query circuits are practical, we tackle two primary obstacles. First, we propose Normalized Deviation Faithfulness (NDF), a robust metric designed to assess how accurately a discovered circuit recovers the model’s decision for a single input; this metric is versatile and applicable to circuit discovery in contexts beyond our specific framework. Second, we create sampling-based techniques to efficiently locate circuits that are sparse yet accurately reflect the model's behavior.

Our evaluation across several benchmarks, including IOI, arithmetic, MMLU, and ARC, reveals that extremely sparse query circuits exist within models and can restore a significant portion of their performance on individual queries. For instance, a circuit comprising merely 1.3% of the model’s connections is capable of recovering approximately 60% of its performance on MMLU questions. Ultimately, query circuits represent a significant advancement toward scalable and faithful explanations of how language models process individual inputs. The project page is available at https://tony10101105.github.io/query-circuit/.


Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Related Articles

Schroders Renewable Unit Targets AI Assets as Power Demand Soars
Bloomberg

Schroders Renewable Unit Targets AI Assets as Power Demand Soars

Schroders’ renewable unit targets AI infrastructure, pivoting to meet soaring energy demand from artificial intelligence...

State Street's Paglia on SBI Group Partnership, ETFs
Bloomberg

State Street's Paglia on SBI Group Partnership, ETFs

State Street's Paglia discusses the SBI Group partnership and ETFs, but the source text is missing. Please provide the a...

Nvidia Boss Says Workers Should Be Paid ā€˜as Much as Possible’
Bloomberg

Nvidia Boss Says Workers Should Be Paid ā€˜as Much as Possible’

Nvidia CEO Jensen Huang advocates for paying workers ā€œas much as possible,ā€ emphasizing maximum compensation. This stanc...

TSE Talking With Regulator For Easing ETF Listing Rules
Bloomberg

TSE Talking With Regulator For Easing ETF Listing Rules

The Tokyo Stock Exchange is discussing with regulators to ease ETF listing rules. This aims to simplify market access an...

S&P DJI CEO on Japan Markets, Mega IPOs
Bloomberg

S&P DJI CEO on Japan Markets, Mega IPOs

S&P DJI CEO discusses Japan's financial markets and major IPOs.