arXiv

Value Entanglement: Conflation Between Different Kinds of Good In (Some) Large Language Models

June 4, 2026 · Seong Hah Cho, Junyi Li, Anna Leshinskaya · Original Source

Title: Value Entanglement: The Blurring of Distinct Value Categories in Certain Large Language Models

Abstract: Achieving value alignment in Large Language Models (LLMs) necessitates the empirical assessment of the values these systems have actually internalized. A key feature of human value representation is the ability to differentiate between various types of worth. This study examines whether LLMs similarly distinguish among three specific categories of good: moral, grammatical, and economic. Through an analysis of model behavior, embeddings, and residual stream activations, we identify widespread instances of "value entanglement"—a merging of these otherwise distinct value representations. Our findings indicate that, compared to human standards, both grammatical and economic judgments are disproportionately swayed by moral considerations. However, this conflation can be rectified by selectively removing the activation vectors linked to moral reasoning.

Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Bloomberg

HPE Sponsor Spotlight

June 4, 2026

HPE Sponsor Spotlight highlights key partners driving innovation. Discover how their solutions enhance enterprise infras...

TechCrunch

Meta steals a tactic from Tesla and builds data centers in tents

June 4, 2026

Meta builds six large tents in Ohio to cut data center construction time by 50%, mirroring Tesla and xAI’s strategies. T...

Bloomberg

Bruce Springsteen’s Anti-Trump Message Isn’t Hurting Business

June 4, 2026

Stephen Colbert’s anti-Trump stance hasn’t hurt his business, mirroring Bruce Springsteen’s sustained commercial success...

Bloomberg

Ciena CEO Rejects Dot-Com Bubble Comparisons

June 4, 2026

Ciena’s CEO rejects comparisons to the dot-com bubble, dismissing parallels to that era’s market volatility.

Bloomberg

Verizon CEO on Using Tech to Transform Telecom

June 4, 2026

Verizon’s CEO discusses leveraging technology to revolutionize the telecommunications sector, highlighting transformativ...

TechCrunch

Apple approves Poke as the first AI agent on its Messages for Business platform

June 4, 2026

Apple approved Poke as the first AI agent on its Messages for Business platform, enabling text-based AI interactions via...

Top international news

Value Entanglement: Conflation Between Different Kinds of Good In (Some) Large Language Models

Related Articles

HPE Sponsor Spotlight

Meta steals a tactic from Tesla and builds data centers in tents

Bruce Springsteen’s Anti-Trump Message Isn’t Hurting Business

Ciena CEO Rejects Dot-Com Bubble Comparisons

Verizon CEO on Using Tech to Transform Telecom

Apple approves Poke as the first AI agent on its Messages for Business platform