arXiv

Attention-Based Sampler for Diffusion Language Models

Title: Attention-Based Sampler for Diffusion Language Models

Original: arXiv:2604.08564v2 Announce Type: replace-cross

Abstract: While auto-regressive models (ARMs) currently dominate the landscape of language modeling, their reliance on strictly sequential sampling creates inherent bottlenecks in both inference speed and modeling adaptability. Diffusion-based large language models (dLLMs) have emerged as a promising alternative, enabling parallel sampling and greater flexibility. Nevertheless, existing dLLM sampling techniques predominantly depend on token-level data, neglecting the broader structural context of the sequence and frequently resulting in inferior outcomes.

This study investigates the selection of sampling order through the lens of log-likelihood maximization. We demonstrate that this optimization task is NP-hard, leading us to develop an approximation method based on optimal sampling ranks to render the problem computationally feasible. Furthermore, we establish that this tractable objective is maximized when tokens are sampled in descending order of their attention-matrix column sums. This discovery offers a rigorous theoretical foundation for attention-guided sampling, presenting a robust alternative to conventional greedy search strategies.

To apply these insights, we introduce Attn-Sampler, a novel training-free sampling algorithm, and incorporate dynamic attention thresholding to further boost practical efficiency. Comprehensive evaluations across various benchmarks confirm the efficacy of our approach, showing that it delivers higher generation quality alongside improved parallelism during sampling.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

China’s Robotaxi Dilemma Shows AI Policy Tension Between Growth and Jobs
Bloomberg

China’s Robotaxi Dilemma Shows AI Policy Tension Between Growth and Jobs

China’s robotaxi expansion highlights the policy tension between driving economic growth through AI and protecting emplo...

Exams watchdog warns of rise in high-tech cheating
BBC News

Exams watchdog warns of rise in high-tech cheating

Ofqual warns of rising high-tech cheating, with smart devices involved in 44% of misconduct cases. Invigilators are trai...

Thailand’s Richest Man Plans $4.3 Billion Expansion Amid AI Boom
Bloomberg

Thailand’s Richest Man Plans $4.3 Billion Expansion Amid AI Boom

Thailand’s wealthiest individual is investing $4.3 billion in expansion, capitalizing on the booming artificial intellig...

US Tech Sector Announces Most Job Cuts in Nearly Two Years
Bloomberg

US Tech Sector Announces Most Job Cuts in Nearly Two Years

The US tech sector recorded its highest wave of layoffs in nearly two years, signaling a significant downturn for the in...

Iran Says No Progress in US Talks | The Opening Trade 6/4/2026
Bloomberg

Iran Says No Progress in US Talks | The Opening Trade 6/4/2026

Iran reports no progress in US talks on June 4, 2026. The Opening Trade highlights the ongoing diplomatic impasse betwee...

The Do’s and Don’ts of Buying Used Tech Gadgets
New York Times

The Do’s and Don’ts of Buying Used Tech Gadgets

Refurbished tech offers a cost-effective alternative amid component shortages and inflated prices. This guide outlines e...