arXiv

CleanCodec: Efficient and Robust Speech Tokenization via Perceptually Guided Encoding

June 4, 2026 · Eugene Kwek, Feng Liu, Rui Zhang, Wenpeng Yin · Original Source

Title: CleanCodec: Efficient and Robust Speech Tokenization via Perceptually Guided Encoding

Abstract:

Neural audio codecs serve as a critical element in speech processing workflows by converting audio signals into discrete tokens for subsequent modeling. Nevertheless, current codecs often face challenges in balancing reconstruction fidelity with token efficiency. They tend to encode perceptually irrelevant details, such as recording artifacts and background noise, which detracts from the representation of linguistically and acoustically significant content. To address this, we reframe audio tokenization as a selective information bottleneck challenge and introduce CleanCodec, a denoising audio codec designed to retain only perceptually salient features while discarding imperceptible data. Operating at a rate of just 12.5 tokens per second, CleanCodec sets a new standard for tokenization efficiency, significantly surpassing existing solutions in both speaker similarity and speech intelligibility. Furthermore, assessments on downstream applications, including text-to-speech and voice conversion, reveal enhanced performance and inference speeds up to 17 times faster, underscoring substantial efficiency improvements.

Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Bloomberg

AI Concentration Risk Is the Problem: 3-Minutes MLIV

June 4, 2026

The article argues that AI concentration risk, rather than the technology itself, is the primary concern. It highlights ...

Reuters

Foxconn announces strategic collaboration with Intel on next-gen AI infrastructure

June 4, 2026

Foxconn and Intel announced a strategic partnership to develop next-generation AI infrastructure. This collaboration aim...

Bloomberg

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

June 4, 2026

SpaceX aims for a record $75 billion valuation through an initial public offering. This historic IPO marks a significant...

Bloomberg

Broadcom AI Chip Outlook Disappoints Investors

June 4, 2026

Broadcom’s AI chip projections disappointed investors, dampening market sentiment. The outlook fell short of expectation...

Reuters

Europe's tech 'liberation day'? Computer says not yet

June 4, 2026

Europe’s expected tech breakthrough remains unrealized, as current systems indicate that a true "liberation day" has not...

Bloomberg

Hiranandani Group CEO on Powering India's Digital Future

June 4, 2026

Hiranandani Group CEO discusses driving India's digital transformation.

Global News Digest

CleanCodec: Efficient and Robust Speech Tokenization via Perceptually Guided Encoding

Related Articles

AI Concentration Risk Is the Problem: 3-Minutes MLIV

Foxconn announces strategic collaboration with Intel on next-gen AI infrastructure

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

Broadcom AI Chip Outlook Disappoints Investors

Europe's tech 'liberation day'? Computer says not yet

Hiranandani Group CEO on Powering India's Digital Future