arXiv

Translation Heads: Disentangling meaning from language in LLM-based machine translation

Title: Translation Heads: Disentangling meaning from language in LLM-based machine translation

Abstract: While Mechanistic Interpretability (MI) aims to elucidate the internal workings of neural networks, the sheer scale of Large Language Models (LLMs) has previously constrained MI research in Machine Translation (MT) to word-level examinations. This study adopts a mechanistic lens to investigate sentence-level MT, focusing on attention heads to decipher how LLMs internally encode and allocate translation responsibilities. We break down MT into two distinct subtasks: identifying the target language (generating text in the correct language) and maintaining sentence equivalence (preserving the original meaning). Through an analysis of three open-source model families across 20 translation directions, we identify that separate, sparse groups of attention heads are specialized for each subtask. Leveraging this finding, we develop subtask-specific steering vectors. Our results demonstrate that adjusting merely 1% of these relevant heads allows for instruction-free MT performance that rivals instruction-based prompting. Conversely, selectively ablating these heads specifically impairs their associated translation functions.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

Who’s Excited for SpaceX’s I.P.O.? Space Nerds.
New York Times

Who’s Excited for SpaceX’s I.P.O.? Space Nerds.

Space enthusiasts are the most eager for SpaceX’s IPO, driven by their passion for space exploration.

TechCrunch

Apple touts $1.4 trillion in App Store billings and sales, 90% without a commission

Apple reported $1.4 trillion in App Store billings for 2025, noting 90% were commission-free. Digital sales rose to $149...

Dimon and SpaceX Executives to Pitch IPO to Clients
Bloomberg

Dimon and SpaceX Executives to Pitch IPO to Clients

JPMorgan Chase CEO Jamie Dimon and SpaceX executives are pitching IPO details to clients.

Financial Times

Europe is finally flexing its innovation muscles

The EU’s new tech sovereignty package signals a positive shift from defensive regulation to proactive innovation, markin...

Apollo’s Zelter Expects High-Grade Debt Sales to Top US Treasuries
Bloomberg

Apollo’s Zelter Expects High-Grade Debt Sales to Top US Treasuries

Apollo’s Zelter expects high-grade debt sales to surpass US Treasuries. He anticipates investment-grade debt outperformi...

EU Insurance Watchdog Warns on Loan Risks
Bloomberg

EU Insurance Watchdog Warns on Loan Risks

EIOPA warns insurers to closely monitor loan risks, though initial reports lack specific details on the nature or scope ...