arXiv

Read What You Hear: Reference-Free Hypotheses Evaluation with Acoustic Discrepancy

Title: Read What You Hear: Reference-Free Hypotheses Evaluation with Acoustic Discrepancy

Abstract: While traditional automatic speech recognition (ASR) assessment typically depends on reference transcriptions, reference-free methodologies usually rely on internal confidence scores or supplementary language models. To address this, we introduce READ (Reference-free Hypothesis Evaluation with Acoustic Discrepancy), a new metric that assesses ASR hypotheses directly from the audio signal by prioritizing acoustic grounding. READ leverages a pretrained auto-regressive text-to-speech (TTS) model to calculate the conditional likelihood of speech tokens based on a text hypothesis, thereby quantifying the fine-grained acoustic divergence between the spoken audio and the written text. Notably, READ requires no additional training to be utilized for hypothesis refinement. Our experiments demonstrate that READ correlates with distinct recognition errors and enhances ASR performance, yielding a relative error rate reduction of up to 20%, with the most significant improvements observed in noisy environments.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

TechCrunch

Helion, the Sam Altman-backed fusion startup, raises $465M to build a power plant for Microsoft

Sam Altman-backed Helion raised $465M to build a fusion plant for Microsoft, aiming for grid connection by 2028 using di...

Fed Weighs Need For Rate Hikes, US May Payrolls Due Out Friday | Real Yield 6/4/2026
Bloomberg

Fed Weighs Need For Rate Hikes, US May Payrolls Due Out Friday | Real Yield 6/4/2026

The Fed weighs rate hikes as US non-farm payrolls data drops Friday.

Shark Tank Star Shrinks Data Center Footprint After Backlash
Bloomberg

Shark Tank Star Shrinks Data Center Footprint After Backlash

After public backlash, a Shark Tank entrepreneur reduced the size of a Utah data center project. This decision followed ...

Hatch’s New Bedside Sleep Clock Wirelessly Tracks Sleep Quality
Bloomberg

Hatch’s New Bedside Sleep Clock Wirelessly Tracks Sleep Quality

Hatch’s $250 screen-free sleep clock wirelessly tracks breathing, heart rate, and movement using low-power signals, offe...

Anduril's Stephens on Innovating in an Age of War
Bloomberg

Anduril's Stephens on Innovating in an Age of War

At Bloomberg Tech 2026, Anduril’s Stephens discussed AI’s role in defense and military innovation amid global conflict.

Liftoff Mobile CEO Talks IPO, Advertising and Strategy
Bloomberg

Liftoff Mobile CEO Talks IPO, Advertising and Strategy

Liftoff Mobile’s CEO discusses IPO plans, navigating ad market trends, and outlining the company's strategic direction f...