Global News Digest

arXiv

Coordination Graphs for Constrained Multi-Agent Reinforcement Learning

Title: Coordination Graphs for Constrained Multi-Agent Reinforcement Learning

Abstract: Constrained Multi-Agent Reinforcement Learning (CMARL) is hindered by two interconnected difficulties: the exponential expansion of the joint action space as agent count increases, and the complex coupling of agents imposed by constraints that go beyond simple reward structures. To tackle these issues, we propose Coordination Graphs for Constrained Multi-Agent Reinforcement Learning (CG-CMARL), a novel framework that integrates coordination graphs with Lagrangian duality. This approach breaks down the joint decision-making problem into pairwise interactions, managed by a set of shared Q-functions—one dedicated to the main objective and others to individual constraints. Consequently, the quantity of models required for learning remains constant, regardless of the number of agents. During inference, the Max-Sum message passing algorithm facilitates action coordination across the factor graph, while Lagrangian multipliers manage the balance between objectives and constraints. This mechanism enables a single trained model to explore the entire Pareto front without the need for retraining. We establish convergence guarantees under reasonable assumptions and derive a compositional error bound that isolates distinct, interpretable error sources, each linked to specific design elements and independently adjustable. Empirical evaluations on cooperative navigation scenarios, involving teams of up to 10 agents tasked with reaching target locations while adhering to pairwise constraints, demonstrate that our method generates Pareto fronts that outperform established baselines, which are typically trained at fixed reward-shaping ratios. Furthermore, the approach scales effectively to team sizes where centralized methods are computationally infeasible.


Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Related Articles

Schroders Renewable Unit Targets AI Assets as Power Demand Soars
Bloomberg

Schroders Renewable Unit Targets AI Assets as Power Demand Soars

Schroders’ renewable unit targets AI infrastructure, pivoting to meet soaring energy demand from artificial intelligence...

State Street's Paglia on SBI Group Partnership, ETFs
Bloomberg

State Street's Paglia on SBI Group Partnership, ETFs

State Street's Paglia discusses the SBI Group partnership and ETFs, but the source text is missing. Please provide the a...

Nvidia Boss Says Workers Should Be Paid ā€˜as Much as Possible’
Bloomberg

Nvidia Boss Says Workers Should Be Paid ā€˜as Much as Possible’

Nvidia CEO Jensen Huang advocates for paying workers ā€œas much as possible,ā€ emphasizing maximum compensation. This stanc...

TSE Talking With Regulator For Easing ETF Listing Rules
Bloomberg

TSE Talking With Regulator For Easing ETF Listing Rules

The Tokyo Stock Exchange is discussing with regulators to ease ETF listing rules. This aims to simplify market access an...

S&P DJI CEO on Japan Markets, Mega IPOs
Bloomberg

S&P DJI CEO on Japan Markets, Mega IPOs

S&P DJI CEO discusses Japan's financial markets and major IPOs.