Global News Digest

arXiv

Visual-Noise Guided In-Context Distillation for Multimodal Large Language Model Unlearning

Title: Visual-Noise Guided In-Context Distillation for Multimodal Large Language Model Unlearning

Abstract

Multimodal Large Language Models (MLLMs) have demonstrated significant advancements in vision-language applications; however, their tendency to memorize and disclose sensitive or restricted information has sparked serious concerns regarding privacy and overall safety. Machine Unlearning (MU) has emerged as a viable solution, enabling the removal of specific unwanted knowledge from trained models without the need for complete retraining, thereby maintaining general utility. Despite this potential, achieving effective unlearning in MLLMs is notably difficult.

Current training-based approaches frequently face difficulties in striking a balance between unlearning efficacy and model performance. Conversely, training-free strategies, such as in-context unlearning, safeguard model utility by eschewing parameter updates. Yet, these methods fail to eliminate memorized data at the parameter level and remain susceptible to reverse-engineering attacks. Furthermore, in-context unlearning proves inadequate in multimodal environments, where visual inputs exert strong conditioning signals that can trigger unwanted outputs.

To overcome these limitations, we introduce Visual-Noise Guided In-Context Distillation (VGID), a framework for MLLM unlearning based on distillation. VGID dynamically generates an unlearning-focused teacher distribution from the frozen base model via dual-modal intervention. This process integrates textual in-context unlearning with visual perturbation. The distribution induced by these interventions acts as a teacher signal, steering the student model toward parameter-level unlearning. Notably, this approach eliminates the necessity for external teacher models or explicit annotations of undesirable responses.

Experimental evaluations indicate that VGID delivers robust unlearning performance while maintaining competitive model utility. In a representative scenario, the method reduced the ROUGE-L score of the forget set by 0.371, accompanied by a minimal decrease of 0.055 in the ROUGE-L score of the retain set.


Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Related Articles

Schroders Renewable Unit Targets AI Assets as Power Demand Soars
Bloomberg

Schroders Renewable Unit Targets AI Assets as Power Demand Soars

Schroders’ renewable unit targets AI infrastructure, pivoting to meet soaring energy demand from artificial intelligence...

State Street's Paglia on SBI Group Partnership, ETFs
Bloomberg

State Street's Paglia on SBI Group Partnership, ETFs

State Street's Paglia discusses the SBI Group partnership and ETFs, but the source text is missing. Please provide the a...

Nvidia Boss Says Workers Should Be Paid ‘as Much as Possible’
Bloomberg

Nvidia Boss Says Workers Should Be Paid ‘as Much as Possible’

Nvidia CEO Jensen Huang advocates for paying workers “as much as possible,” emphasizing maximum compensation. This stanc...

TSE Talking With Regulator For Easing ETF Listing Rules
Bloomberg

TSE Talking With Regulator For Easing ETF Listing Rules

The Tokyo Stock Exchange is discussing with regulators to ease ETF listing rules. This aims to simplify market access an...

S&P DJI CEO on Japan Markets, Mega IPOs
Bloomberg

S&P DJI CEO on Japan Markets, Mega IPOs

S&P DJI CEO discusses Japan's financial markets and major IPOs.