arXiv

Fast Unlearning at Scale via Margin Self-Correction

Title: Scalable Fast Unlearning Through Margin Self-Correction

Abstract: Model unlearning aims to modify a trained language model so that it behaves as though it never encountered specific training instances, all while maintaining performance on other tasks and eliminating the need for expensive full retraining. Current methods generally involve fine-tuning a pretrained model within a fixed computational budget, subsequently choosing the best version by testing multiple saved checkpoints against downstream validation data. This process introduces two major inefficiencies that hinder scalability: the continuation of training past the optimal balance between forgetting and retaining information, and the checkpoint selection phase, which demands additional storage and multiple evaluation cycles.

To overcome these hurdles, we propose MArgin Self-Correction (MASC), a streamlined unlearning technique featuring an online stopping mechanism that eliminates the need for downstream evaluation. When presented with a text sequence to be forgotten, MASC dynamically narrows the logit gap between the original next token and the most probable alternative tokens. The algorithm concludes the unlearning process once this gap averages out to a small value across a substantial majority of token positions within the forget sequences.

Experimental results on the TOFU, MUSE News, and MUSE Books benchmarks demonstrate that MASC delivers a forget-retain balance comparable to existing baselines but at a significantly lower computational expense. Furthermore, our analysis reveals that increasing the model size (i.e., the number of parameters) enhances trade-offs for both MASC and SimNPO; specifically, forget metrics remain consistent while the utility retained by the model improves.


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

TikTok Billionaire Tops Ambani as Asia’s Second-Richest
Bloomberg

TikTok Billionaire Tops Ambani as Asia’s Second-Richest

TikTok founder surpasses Mukesh Ambani to become Asia’s second-richest person, marking a significant shift in the region...

Publishers in UK can opt out of Google AI search results
BBC News

Publishers in UK can opt out of Google AI search results

UK publishers can now opt out of Google’s AI search summaries, a CMA ruling designed to boost their bargaining power and...

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.
Bloomberg

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.

Kioxia’s market cap nears Toyota’s, signaling a major shift in Japan’s corporate hierarchy. This narrowing gap highlight...

Reuters

Morning Bid: Marvell, a fitting name for the latest AI darling

Reuters highlights Marvell as a top AI stock, noting its name perfectly suits its status as the newest market darling.

Financial Times

Tim Hayward: I built the Jaguar E-Type of computer keyboards

Tim Hayward compares his bespoke keyboard designs to the Jaguar E-Type. He explores high-end customization for personal ...

Financial Times

AI Labs: Zuckerberg’s $100bn gamble

Meta’s $100 billion AI investment aims to secure AI dominance, but questions remain whether sheer spending can outpace c...