arXiv

FIRM: Federated In-client Regularized Multi-objective Alignment for Large Language Models

Title: FIRM: Federated In-client Regularized Multi-objective Alignment for Large Language Models

Abstract:

Aligning Large Language Models (LLMs) with human preferences typically requires navigating the tension between competing goals, such as ensuring helpfulness while maintaining harmlessness. This training process is not only computationally demanding but also raises substantial data privacy issues when centralized. While Federated Learning (FL) presents a viable solution, current Federated Multi-Objective Optimization (FMOO) techniques are hindered by significant communication bottlenecks; their dependence on sending multiple gradients to a central server does not scale effectively for large models.

To address these challenges, we present FIRM (Federated In-client Regularized Multi-objective alignment), a new algorithm designed to enhance communication efficiency while mitigating client disagreement drift. FIRM operates by having each client resolve a regularized multi-objective optimization problem locally. This approach removes the necessity for the multi-gradient transmissions characteristic of previous methods, as in-client regularization directly addresses client disagreement drift. As a result, clients are required to send only one set of adapted parameters, thereby preserving high communication efficiency.

We demonstrate that our algorithm converges to Pareto-stationary points and, to the best of our knowledge, offer the first finite-time convergence guarantees within this specific federated multi-objective alignment context. Our empirical results indicate that FIRM yields smoother training dynamics, less client disagreement drift, and better reward trade-offs relative to baseline methods. Additionally, we introduce a technique to embed preferences among objectives, supported by empirical Pareto plots that illustrate FIRM’s ability to smoothly adjust objective trade-offs in accordance with specified preferences.


Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Related Articles

Law’s Billable Hour Is Being Shredded by AI
Bloomberg

Law’s Billable Hour Is Being Shredded by AI

AI is dismantling the billable hour by automating routine legal tasks. This technological shift threatens the traditiona...

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026
Bloomberg

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026

SoftBank in Early Talks to Back $800 Million Agile Robots Round
Bloomberg

SoftBank in Early Talks to Back $800 Million Agile Robots Round

SoftBank is in early talks to back Agile Robots’ $800 million funding round. The Japanese tech giant is currently in pre...

Amundi Is Diversifying Risk Via Commodity Currencies, Gold
Bloomberg

Amundi Is Diversifying Risk Via Commodity Currencies, Gold

Amundi diversifies risk by investing in commodity-linked currencies and gold. This strategy hedges against market volati...

Reuters

Marvell Technology surges after Nvidia's Huang calls it 'next trillion-dollar company'

Marvell Technology shares surged after Nvidia CEO Jensen Huang labeled the firm the “next trillion-dollar company.”

Russia Says It Found Foreign Spyware on Top Officials’ Phones
Bloomberg

Russia Says It Found Foreign Spyware on Top Officials’ Phones

Russia’s FSB claims to have discovered foreign spyware on senior officials’ phones. Moscow attributes the intrusion to h...