arXiv

ReLoRA: Knowledge-Reusing Adaptation for Fast Rollout of Evolving LLM Services

Title: ReLoRA: Efficiently Reusing Knowledge for Rapid Deployment of Evolving Large Language Model Services

Abstract

Large Language Models (LLMs) are increasingly utilized as dynamic services that undergo continuous evolution. Consequently, frequent updates to the foundational model often render previously deployed task-specific Low-Rank Adaptation (LoRA) adapters obsolete. For service providers overseeing a vast portfolio of downstream model services, the standard practice of retraining each LoRA adapter from scratch following every base model update is computationally expensive and significantly hinders service rollout timelines. Conversely, the more straightforward approach of simply applying the original LoRA adapter to the newly updated base model frequently results in diminished service quality, primarily due to incompatibility between the adapter and the new backbone.

To resolve this challenge, we introduce ReLoRA, a framework designed for knowledge-reusing re-adaptation. This approach efficiently restores LoRA adapters to a service-ready state for evolving LLMs, ensuring that task performance is either maintained or enhanced. ReLoRA relies on two primary optimization stages:

  1. Adaptive LoRA Initialization: This step employs Bayesian optimization to establish a starting point that is aware of compatibility issues. It achieves this by integrating data from the previously deployed task adapter with information regarding the evolution of the base model.
  2. Scheduled Regularization Fine-Tuning: The process begins with strong regularization to rapidly guide the adapter toward a high-quality solution space. This is followed by a phase of relaxed regularization to allow for precise, task-specific refinement.

This strategic design facilitates the swift recovery of service quality while minimizing the overhead associated with re-adaptation. Our extensive experimental results indicate that, relative to baseline methods, ReLoRA accelerates time-to-readiness by as much as 8.9$\times$ and boosts accuracy by up to 4.6%.


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

TikTok Billionaire Tops Ambani as Asia’s Second-Richest
Bloomberg

TikTok Billionaire Tops Ambani as Asia’s Second-Richest

TikTok founder surpasses Mukesh Ambani to become Asia’s second-richest person, marking a significant shift in the region...

Publishers in UK can opt out of Google AI search results
BBC News

Publishers in UK can opt out of Google AI search results

UK publishers can now opt out of Google’s AI search summaries, a CMA ruling designed to boost their bargaining power and...

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.
Bloomberg

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.

Kioxia’s market cap nears Toyota’s, signaling a major shift in Japan’s corporate hierarchy. This narrowing gap highlight...

Reuters

Morning Bid: Marvell, a fitting name for the latest AI darling

Reuters highlights Marvell as a top AI stock, noting its name perfectly suits its status as the newest market darling.

Financial Times

Tim Hayward: I built the Jaguar E-Type of computer keyboards

Tim Hayward compares his bespoke keyboard designs to the Jaguar E-Type. He explores high-end customization for personal ...

Financial Times

AI Labs: Zuckerberg’s $100bn gamble

Meta’s $100 billion AI investment aims to secure AI dominance, but questions remain whether sheer spending can outpace c...