arXiv

Value Entanglement: Conflation Between Different Kinds of Good In (Some) Large Language Models

Title: Value Entanglement: The Blurring of Distinct Value Categories in Certain Large Language Models

Abstract: Achieving value alignment in Large Language Models (LLMs) necessitates the empirical assessment of the values these systems have actually internalized. A key feature of human value representation is the ability to differentiate between various types of worth. This study examines whether LLMs similarly distinguish among three specific categories of good: moral, grammatical, and economic. Through an analysis of model behavior, embeddings, and residual stream activations, we identify widespread instances of "value entanglement"—a merging of these otherwise distinct value representations. Our findings indicate that, compared to human standards, both grammatical and economic judgments are disproportionately swayed by moral considerations. However, this conflation can be rectified by selectively removing the activation vectors linked to moral reasoning.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

HPE Sponsor Spotlight
Bloomberg

HPE Sponsor Spotlight

HPE Sponsor Spotlight highlights key partners driving innovation. Discover how their solutions enhance enterprise infras...

TechCrunch

Meta steals a tactic from Tesla and builds data centers in tents

Meta builds six large tents in Ohio to cut data center construction time by 50%, mirroring Tesla and xAI’s strategies. T...

Bruce Springsteen’s Anti-Trump Message Isn’t Hurting Business
Bloomberg

Bruce Springsteen’s Anti-Trump Message Isn’t Hurting Business

Stephen Colbert’s anti-Trump stance hasn’t hurt his business, mirroring Bruce Springsteen’s sustained commercial success...

Ciena CEO Rejects Dot-Com Bubble Comparisons
Bloomberg

Ciena CEO Rejects Dot-Com Bubble Comparisons

Ciena’s CEO rejects comparisons to the dot-com bubble, dismissing parallels to that era’s market volatility.

Verizon CEO on Using Tech to Transform Telecom
Bloomberg

Verizon CEO on Using Tech to Transform Telecom

Verizon’s CEO discusses leveraging technology to revolutionize the telecommunications sector, highlighting transformativ...

TechCrunch

Apple approves Poke as the first AI agent on its Messages for Business platform

Apple approved Poke as the first AI agent on its Messages for Business platform, enabling text-based AI interactions via...