arXiv

AURA: Action-Gated Memory for Robot Policies at Constant VRAM

Title: AURA: Action-Gated Memory for Robot Policies at Constant VRAM

Abstract:

While the KV-cache serves as the optimal memory solution for datacenter environments, it is ill-suited for robotic applications. Datacenter inference typically involves batching numerous short requests that are subsequently reset, allowing the attention cache to be amortized across a large user base. In contrast, embodied agents operate through single, lengthy, non-resetting episodes on edge hardware constrained by bandwidth. In these settings, high-bandwidth memory and flash storage are limited resources; flash has restricted write endurance, and memory writes often become the primary bottleneck rather than computational power.

AURA-Mem (Action-Utility Recurrent Adaptive Memory) is designed to address this specific operational regime. It encases a frozen vision-language-action backbone with a recurrent memory of fixed size and a learned gating mechanism. This gate ensures that data is written only when the current observation is likely to alter the subsequent action, effectively creating a memory system that remains silent when unnecessary. Unlike memory models based on reconstruction, this gate is trained directly against a closed-loop action-error signal.

The inference state of AURA-Mem remains constant at 4,224 bytes, irrespective of the time horizon. By comparison, a standard KV-cache expands to 6,061 times its size when processing 100,000 steps. Tests on a controlled synthetic benchmark show that AURA-Mem achieves accuracy comparable to the best O(1) baselines while requiring 5.19 to 6.13 times fewer writes. On simpler configurations, write reduction reaches up to 9.19 times. Random and periodic scheduling strategies, when matched for budget, fail to replicate these gains, confirming that the performance benefit is isolated to the action-surprise signal.

In closed-loop evaluations using a trained OpenVLA-OFT 7B model on LIBERO-Long (with n=60 episodes per arm), the gating mechanism did not compromise success rates. AURA-Mem matched the success rate of the ungated base policy (0.233) and slightly outperformed the always-write KV arm (0.217), all while utilizing 7.0 times fewer writes and maintaining constant memory usage. Additionally, an approximate-information-state value-loss bound was instantiated to demonstrate methodology; at this scale, the bound serves as a theoretical illustration rather than a strict guarantee.


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

TikTok Billionaire Tops Ambani as Asia’s Second-Richest
Bloomberg

TikTok Billionaire Tops Ambani as Asia’s Second-Richest

TikTok founder surpasses Mukesh Ambani to become Asia’s second-richest person, marking a significant shift in the region...

Publishers in UK can opt out of Google AI search results
BBC News

Publishers in UK can opt out of Google AI search results

UK publishers can now opt out of Google’s AI search summaries, a CMA ruling designed to boost their bargaining power and...

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.
Bloomberg

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.

Kioxia’s market cap nears Toyota’s, signaling a major shift in Japan’s corporate hierarchy. This narrowing gap highlight...

Reuters

Morning Bid: Marvell, a fitting name for the latest AI darling

Reuters highlights Marvell as a top AI stock, noting its name perfectly suits its status as the newest market darling.

Financial Times

Tim Hayward: I built the Jaguar E-Type of computer keyboards

Tim Hayward compares his bespoke keyboard designs to the Jaguar E-Type. He explores high-end customization for personal ...

Financial Times

AI Labs: Zuckerberg’s $100bn gamble

Meta’s $100 billion AI investment aims to secure AI dominance, but questions remain whether sheer spending can outpace c...