arXiv

T$^\star$: Progressive Block Scaling for Masked Diffusion Language Models Through Trajectory Aware Reinforcement Learning

June 4, 2026 · Hanchen Xia, Baoyou Chen, Yutang Ge, Guojiang Zhao, Siyu Zhu · Original Source

Title: T$^\star$: Progressive Block Scaling for Masked Diffusion Language Models Through Trajectory Aware Reinforcement Learning

Abstract: This paper introduces T$^\star$, a straightforward TraceRL-based training curriculum designed for the progressive scaling of block sizes in masked diffusion language models (MDMs). Beginning with an MDM initialized using autoregressive (AR) methods and configured for small blocks, T$^\star$ facilitates a seamless transition to larger block sizes. This approach allows for decoding with significantly higher parallelism while maintaining minimal performance loss on mathematical reasoning benchmarks. Additionally, our further analysis indicates that T$^\star$ might converge toward an alternative decoding schedule that delivers comparable results.

Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Bloomberg

Glazer Family Members Said to Study Manchester United Stake Sale

June 4, 2026

Reports indicate the Glazer family is evaluating a potential sale of their Manchester United stake, with family members ...

Bloomberg

Ares' Blair Jacbobson: Disconnect Over Private Credit Headlines

June 4, 2026

Ares’ Blair Jacobson argues that private credit headlines misrepresent reality, highlighting a disconnect between media ...

Bloomberg

Nvidia-Backed Robotics Startup Generalist AI Valued at $2 Billion

June 4, 2026

Nvidia-backed robotics startup Generalist AI has reached a $2 billion valuation. Founders Pete Florence, Andy Zeng, and ...

TechCrunch

Oura Ring 5 review: Thinner, lighter, better

June 4, 2026

The Oura Ring 5 is 40% smaller and lighter than its predecessor, offering superior comfort and a discreet, jewelry-like ...

Financial Times

How AI has de-skilled translation

June 4, 2026

AI fragments specialist translation into routine tasks, effectively de-skilling the profession. This shift reduces compl...

Bloomberg

Zurich Insurance Expands Data-Center Offering Beyond the US

June 4, 2026

Zurich Insurance Group is expanding its data center insurance products internationally, extending coverage beyond the Un...

Global News Digest

T$^\star$: Progressive Block Scaling for Masked Diffusion Language Models Through Trajectory Aware Reinforcement Learning

Related Articles

Glazer Family Members Said to Study Manchester United Stake Sale

Ares' Blair Jacbobson: Disconnect Over Private Credit Headlines

Nvidia-Backed Robotics Startup Generalist AI Valued at $2 Billion

Oura Ring 5 review: Thinner, lighter, better

How AI has de-skilled translation

Zurich Insurance Expands Data-Center Offering Beyond the US