arXiv

Test-Time Compute Scaling for ASR with Depth-Conditioned Looped Transformers

Title: Scaling Test-Time Compute in ASR via Depth-Conditioned Looped Transformers

Abstract: Standard end-to-end Automatic Speech Recognition (ASR) architectures rely on acoustic encoders of fixed depth during inference. This constraint complicates the effort to enhance recognition accuracy by leveraging additional test-time computation, as it traditionally necessitates training a more extensive model. While recurrently reusing a shared Transformer block appears to be a logical solution, our analysis reveals that simple looping fails to fully capitalize on the extra computational resources available. To address this, we propose LARM, a depth-conditioned looped Transformer that transforms recurrent encoder depth into a tunable axis for test-time computation.

LARM integrates several key mechanisms: sparse CTC checkpoints, supervision-clock embeddings, FiLM-based depth conditioning, and delayed soft-posterior feedback. Together, these elements organize the recurrent process into distinct recognition checkpoints interspersed with latent refinement stages, enabling the shared weights to adapt and specialize across different recurrent steps. Experiments on the LibriSpeech dataset demonstrate that LARM’s Word Error Rate (WER) decreases as the number of inference loops grows, delivering performance that rivals deeper models with unshared parameters. These findings indicate that the strategy of scaling test-time compute can be successfully extended from autoregressive language model reasoning to the domain of continuous, non-autoregressive speech recognition.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

Who’s Excited for SpaceX’s I.P.O.? Space Nerds.
New York Times

Who’s Excited for SpaceX’s I.P.O.? Space Nerds.

Space enthusiasts are the most eager for SpaceX’s IPO, driven by their passion for space exploration.

TechCrunch

Apple touts $1.4 trillion in App Store billings and sales, 90% without a commission

Apple reported $1.4 trillion in App Store billings for 2025, noting 90% were commission-free. Digital sales rose to $149...

Dimon and SpaceX Executives to Pitch IPO to Clients
Bloomberg

Dimon and SpaceX Executives to Pitch IPO to Clients

JPMorgan Chase CEO Jamie Dimon and SpaceX executives are pitching IPO details to clients.

Financial Times

Europe is finally flexing its innovation muscles

The EU’s new tech sovereignty package signals a positive shift from defensive regulation to proactive innovation, markin...

Apollo’s Zelter Expects High-Grade Debt Sales to Top US Treasuries
Bloomberg

Apollo’s Zelter Expects High-Grade Debt Sales to Top US Treasuries

Apollo’s Zelter expects high-grade debt sales to surpass US Treasuries. He anticipates investment-grade debt outperformi...

EU Insurance Watchdog Warns on Loan Risks
Bloomberg

EU Insurance Watchdog Warns on Loan Risks

EIOPA warns insurers to closely monitor loan risks, though initial reports lack specific details on the nature or scope ...