arXiv

Scaling Self-Evolving Agents via Parametric Memory

Title: Scaling Self-Evolving Agents via Parametric Memory

Abstract: Current memory-enhanced LLM agents typically restrict past experiences to the prompt space, utilizing either textual summaries or retrieved passages, while maintaining frozen model parameters during a rollout. Consequently, these systems are limited to looking up prior information rather than learning from it; their decision-making policy remains static regardless of experience, and any details omitted from the context window are lost forever. To address this, we present TMEM, a self-evolving parametric memory framework that enables agents to not only condense history into explicit memory structures but also integrate distilled supervision into fast LoRA weights ($\Delta_t$) through lightweight online updates. This mechanism genuinely modifies the agent’s future behavior within a single episode. We formalize this approach as an agentic decision process characterized by fast-weight rollout dynamics, where actions are drawn from $\pi_{\theta_0+\Delta_t}$, and extraction actions generate supervision signals that update $\Delta_t$ for subsequent decisions. This perspective renders the extraction policy directly optimizable via reinforcement learning: training the base parameters $\theta_0$ enhances not only task-specific actions but also the quality of the data utilized for online LoRA adaptation. Additionally, we introduce SVD-based initialization for the LoRA subspace to expedite online convergence. Evaluations on LoCoMo, LongMemEval-S, multi-objective search, and CL-Bench demonstrate that TMEM consistently surpasses summary-based and retrieval-based baselines across various model scales.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

AI Concentration Risk Is the Problem: 3-Minutes MLIV
Bloomberg

AI Concentration Risk Is the Problem: 3-Minutes MLIV

The article argues that AI concentration risk, rather than the technology itself, is the primary concern. It highlights ...

Reuters

Foxconn announces strategic collaboration with Intel on next-gen AI infrastructure

Foxconn and Intel announced a strategic partnership to develop next-generation AI infrastructure. This collaboration aim...

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)
Bloomberg

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

SpaceX aims for a record $75 billion valuation through an initial public offering. This historic IPO marks a significant...

Broadcom AI Chip Outlook Disappoints Investors
Bloomberg

Broadcom AI Chip Outlook Disappoints Investors

Broadcom’s AI chip projections disappointed investors, dampening market sentiment. The outlook fell short of expectation...

Reuters

Europe's tech 'liberation day'? Computer says not yet

Europe’s expected tech breakthrough remains unrealized, as current systems indicate that a true "liberation day" has not...

Hiranandani Group CEO on Powering India's Digital Future
Bloomberg

Hiranandani Group CEO on Powering India's Digital Future

Hiranandani Group CEO discusses driving India's digital transformation.