arXiv

MorphoQuant: Modality-Aware Quantization for Omni-modal Large Language Models

Title: MorphoQuant: Modality-Aware Quantization for Omni-modal Large Language Models

Abstract:

Post-Training Quantization (PTQ) techniques typically encounter significant challenges when applied to 4-bit Omni-modal Large Language Models (OLLMs). These difficulties arise from the stark heterogeneity in data distributions and the varying outlier patterns inherent across different modalities. To overcome these obstacles, we present MorphoQuant, a specialized PTQ framework designed to maintain cross-modal morphology and reduce the loss of critical outlier information.

Central to our approach is the Distribution-Aware Bias Compensation (DABC) mechanism. DABC functions by selectively integrating long-tailed outliers into channel-wise biases. This strategy effectively protects the magnitude of outliers while allowing for high-precision discretization of dense inlier data, thus ensuring accurate discretization across the varied distribution landscapes of different modalities. Furthermore, we introduce Morphology-Directed Quantization Function Optimization (MDQFO), a technique that co-optimizes the quantization grid alongside the bias mask. This process guarantees fine-grained alignment throughout the model.

We conducted extensive evaluations using the Qwen2.5-Omni model on benchmarks such as Video-MME and MMMU, where our method demonstrated clear superiority. Most notably, our W4A4 configuration achieved a score of 76.63% on ScienceQA. This performance not only significantly exceeds current state-of-the-art W4A4 methods but also surprisingly outperforms the W4A16 baseline. These results highlight the exceptional balance between accuracy and efficiency offered by our framework.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

AI Concentration Risk Is the Problem: 3-Minutes MLIV
Bloomberg

AI Concentration Risk Is the Problem: 3-Minutes MLIV

The article argues that AI concentration risk, rather than the technology itself, is the primary concern. It highlights ...

Reuters

Foxconn announces strategic collaboration with Intel on next-gen AI infrastructure

Foxconn and Intel announced a strategic partnership to develop next-generation AI infrastructure. This collaboration aim...

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)
Bloomberg

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

SpaceX aims for a record $75 billion valuation through an initial public offering. This historic IPO marks a significant...

Broadcom AI Chip Outlook Disappoints Investors
Bloomberg

Broadcom AI Chip Outlook Disappoints Investors

Broadcom’s AI chip projections disappointed investors, dampening market sentiment. The outlook fell short of expectation...

Reuters

Europe's tech 'liberation day'? Computer says not yet

Europe’s expected tech breakthrough remains unrealized, as current systems indicate that a true "liberation day" has not...

Hiranandani Group CEO on Powering India's Digital Future
Bloomberg

Hiranandani Group CEO on Powering India's Digital Future

Hiranandani Group CEO discusses driving India's digital transformation.