arXiv

ChannelTok: Efficient Flexible-Length Vision Tokenization

Title: ChannelTok: Streamlined Flexible-Length Vision Tokenization

Abstract:

Current state-of-the-art flexible vision tokenizers deliver exceptional quality but at a prohibitive expense, typically depending on bulky parameter-heavy backbones and sluggish, multi-stage generative decoders. We break away from this intricate, spatial-token framework by proposing ChannelTok, a straightforward, lightweight, and rapid channel-wise flexible-length tokenizer. By regarding each latent channel as an individual visual token, our approach facilitates a parameter-efficient hybrid architecture combining CNNs and Transformers. Additionally, we utilize a stochastic tail-dropping strategy during training, which instinctively prompts channels to arrange themselves according to semantic significance. This mechanism permits flexible compression during inference through the simple retention of the initial $k$ channels, while simultaneously supporting variable-length autoregressive image generation. We substantiate our method with comprehensive experiments on ImageNet, showing steady performance across varying token budgets. Our findings set a new benchmark for quality and efficiency: our model delivers state-of-the-art perceptual quality (rFID 2.92), decodes $8.6\times$ faster, and requires $2.1\times$ fewer parameters (159M) compared to the next best competitor. This study confirms that channel-wise tokenization is a highly effective and practical framework for efficient visual representation.

Project page: https://channeltok.github.io


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

Shark Tank Star Shrinks Data Center Footprint After Backlash
Bloomberg

Shark Tank Star Shrinks Data Center Footprint After Backlash

After public backlash, a Shark Tank entrepreneur reduced the size of a Utah data center project. This decision followed ...

Hatch’s New Bedside Sleep Clock Wirelessly Tracks Sleep Quality
Bloomberg

Hatch’s New Bedside Sleep Clock Wirelessly Tracks Sleep Quality

Hatch’s $250 screen-free sleep clock wirelessly tracks breathing, heart rate, and movement using low-power signals, offe...

Anduril's Stephens on Innovating in an Age of War
Bloomberg

Anduril's Stephens on Innovating in an Age of War

At Bloomberg Tech 2026, Anduril’s Stephens discussed AI’s role in defense and military innovation amid global conflict.

Liftoff Mobile CEO Talks IPO, Advertising and Strategy
Bloomberg

Liftoff Mobile CEO Talks IPO, Advertising and Strategy

Liftoff Mobile’s CEO discusses IPO plans, navigating ad market trends, and outlining the company's strategic direction f...

Samsung Sponsor Spotlight
Bloomberg

Samsung Sponsor Spotlight

The request lacks source text for the "Samsung Sponsor Spotlight" article. Please provide the original content to enable...

AI Isn’t Replacing Credit Hedge Fund Traders Yet, Barclays Says
Bloomberg

AI Isn’t Replacing Credit Hedge Fund Traders Yet, Barclays Says

Barclays states AI hasn’t replaced credit hedge fund traders yet. Human expertise remains vital for complex decisions, m...