arXiv

SAGE: A Quantitative Evaluation of Socialized Evolution in Agent Ecosystems

Title: SAGE: A Quantitative Assessment of Socialized Evolution Within Agent Ecosystems

Abstract:

Traditional evaluations of self-improving language agents typically isolate the subject: an agent tackles a task, obtains feedback, and iteratively adjusts its approach. However, as agents increasingly operate in environments where peers’ strategies and results are publicly observable, a critical, under-researched question emerges: under what circumstances does shared experience yield performance gains that self-improvement alone cannot produce?

To address this, we present SAGE (Social Agent Group Evolution), an evaluation framework designed to compare two conditions matched for computational resources. In the "SocialEvo" condition, agents from five distinct model families co-evolve with access to the complete history of all peers. In the "SelfEvo" condition, each agent undergoes the same number of task attempts but is restricted to viewing only its own past—a standard practice in current self-improvement research.

We applied SAGE across three domains: open-ended machine learning research, long-horizon economic planning, and strategic multiplayer gaming, assessing performance over multiple evolutionary rounds. Our results indicate that group history is not a universal performance booster; the top-performing agents do not surpass the limits set by their own self-evolution. Nevertheless, agents that hit a plateau through self-improvement alone can achieve substantial breakthroughs when peer experiences are accessible.

In competitive scenarios, counterfactual controls demonstrate that improvements are general rather than specific to particular opponents. Furthermore, the study of different shared history formats reveals that filtered peer traces and reflective summaries frequently outperform raw logs. This suggests that social gains rely more on the ability to abstract and transfer knowledge from public records than on the sheer volume of exposure. Ultimately, these findings highlight that the benefits of peer history are contingent upon the agent, the specific arena, and the capacity to distill transferable insights from public traces.


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

TikTok Billionaire Tops Ambani as Asia’s Second-Richest
Bloomberg

TikTok Billionaire Tops Ambani as Asia’s Second-Richest

TikTok founder surpasses Mukesh Ambani to become Asia’s second-richest person, marking a significant shift in the region...

Publishers in UK can opt out of Google AI search results
BBC News

Publishers in UK can opt out of Google AI search results

UK publishers can now opt out of Google’s AI search summaries, a CMA ruling designed to boost their bargaining power and...

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.
Bloomberg

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.

Kioxia’s market cap nears Toyota’s, signaling a major shift in Japan’s corporate hierarchy. This narrowing gap highlight...

Reuters

Morning Bid: Marvell, a fitting name for the latest AI darling

Reuters highlights Marvell as a top AI stock, noting its name perfectly suits its status as the newest market darling.

Financial Times

Tim Hayward: I built the Jaguar E-Type of computer keyboards

Tim Hayward compares his bespoke keyboard designs to the Jaguar E-Type. He explores high-end customization for personal ...

Financial Times

AI Labs: Zuckerberg’s $100bn gamble

Meta’s $100 billion AI investment aims to secure AI dominance, but questions remain whether sheer spending can outpace c...