arXiv

LifeSide: Benchmarking Agents as Lifelong Digital Companions

Title: LifeSide: Evaluating Agents as Enduring Digital Companions

Abstract: To function as effective lifelong digital companions, AI agents must synthesize cues across multiple sessions, continuously refine their user models, and navigate evolving privacy constraints. Current assessment frameworks fall short in this regard, typically isolating tests for memory retention and short-term empathy. We address this limitation by presenting \benchmark, a new evaluation framework focused on multi-session loops involving Memory, Emotion, and Environment. This benchmark treats users as persistent entities within layered worlds, utilizing multi-agent simulations to integrate environmental shifts into conversational contexts while maintaining the distinction between internal states and outward expressions. Through extensive testing involving 2,000 distinct personas and 111,000 tasks, our results assess capabilities in memory tracking, user comprehension, privacy management, and emotional support. The findings highlight a significant performance gap: despite excelling in standard memory benchmarks, current models struggle to maintain accurate user understanding or provide genuine companionship over extended periods.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

TechCrunch

Meta’s Oversight Board says account bans lack due process, transparency

Meta’s Oversight Board criticized account bans for lacking due process and transparency, citing inconsistent enforcement...

TechCrunch

Meta rolls out a new AI creator assistant on Facebook

Meta launched an AI creator assistant on Facebook to streamline analytics and content brainstorming. Initially available...

TechCrunch

What to expect from WWDC 2026: Siri’s highly anticipated revamp and Apple Intelligence updates

WWDC 2026 promises a Siri revamp powered by Google’s Gemini and standalone app, plus AI agents in the App Store and Came...

TechCrunch

A burglar used a Waymo to steal yoga clothes in San Francisco — and got away with it

A thief stole yoga clothes using a Waymo, but police failed to catch them because the car’s video data was deleted and b...

Goldman Sachs CEO David Solomon on the Coming Mega IPOs
Bloomberg

Goldman Sachs CEO David Solomon on the Coming Mega IPOs

Goldman Sachs CEO David Solomon anticipates a surge in major IPOs, signaling renewed market confidence and significant o...

What Are A.I. Agents Actually Doing?
New York Times

What Are A.I. Agents Actually Doing?

Arena research shows tech professionals are most likely to use AI agents at work, highlighting a strong industry trend i...