arXiv

Distributional Open-Ended Evaluation of LLM Cultural Value Alignment Based on Value Codebook

Title: Evaluating LLM Cultural Value Alignment Through a Distributional Lens Using a Value Codebook

As large language models (LLMs) are increasingly deployed on a global scale, ensuring their cultural value orientations are properly aligned has become essential for both safety and user engagement. However, current benchmarking methods struggle with the Construct-Composition-Context ($C^3$) challenge. These existing approaches typically rely on discriminative, multiple-choice formats that assess value knowledge rather than genuine orientations, ignore subcultural heterogeneity, and do not align with the nature of real-world open-ended generation.

To address these limitations, we present DOVE, a distributional evaluation framework that directly compares the distributions of human-written text against LLM-generated outputs. DOVE employs a rate-distortion variational optimization objective to build a compact value codebook from 10,000 documents. This process maps text into a structured value space, effectively filtering out semantic noise. To measure alignment, the framework utilizes unbalanced optimal transport, which captures intra-cultural distributional structures and subgroup diversity.

Experiments conducted across 12 different LLMs demonstrate that DOVE offers superior predictive validity, achieving a 31.56% correlation with downstream tasks. Furthermore, the framework maintains high reliability even with small sample sizes, requiring as few as 500 samples per culture.


Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Related Articles

Law’s Billable Hour Is Being Shredded by AI
Bloomberg

Law’s Billable Hour Is Being Shredded by AI

AI is dismantling the billable hour by automating routine legal tasks. This technological shift threatens the traditiona...

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026
Bloomberg

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026

SoftBank in Early Talks to Back $800 Million Agile Robots Round
Bloomberg

SoftBank in Early Talks to Back $800 Million Agile Robots Round

SoftBank is in early talks to back Agile Robots’ $800 million funding round. The Japanese tech giant is currently in pre...

Amundi Is Diversifying Risk Via Commodity Currencies, Gold
Bloomberg

Amundi Is Diversifying Risk Via Commodity Currencies, Gold

Amundi diversifies risk by investing in commodity-linked currencies and gold. This strategy hedges against market volati...

Reuters

Marvell Technology surges after Nvidia's Huang calls it 'next trillion-dollar company'

Marvell Technology shares surged after Nvidia CEO Jensen Huang labeled the firm the “next trillion-dollar company.”

Russia Says It Found Foreign Spyware on Top Officials’ Phones
Bloomberg

Russia Says It Found Foreign Spyware on Top Officials’ Phones

Russia’s FSB claims to have discovered foreign spyware on senior officials’ phones. Moscow attributes the intrusion to h...