arXiv

Probing Outcome-Level Resemblance and Mechanism-Level Alignment in LLM Risk Decisions: Evidence from the St. Petersburg Game

Title: Examining the Gap Between Outcome Similarity and Mechanism Consistency in LLM Risk Assessment: Insights from the St. Petersburg Paradox

Abstract:

While large language models (LLMs) may exhibit caution in risk-related tasks, outwardly prudent outputs do not necessarily reflect alignment with human decision-making processes. To explore this distinction, we utilized the St. Petersburg game—a classic paradox characterized by infinite expected value but low human willingness to pay—as a controlled experimental framework. Our study assessed 28 LLMs using a comprehensive prompt suite that featured the original game, as well as controlled variants manipulating factors such as truncation, repeated play scenarios, numeric endowments, and occupational identity. The suite also included prompts designed to elicit human-perspective reasoning and paired comparisons between base models and those subjected to instruction tuning.

In the baseline St. Petersburg task, the majority of models produced finite bids, superficially mimicking human risk aversion. However, this outcome-level similarity obscures significant divergences in underlying mechanisms. Analysis of the controlled variants indicates that models frequently abandon human-like behavior in favor of conditional and computational rationality when specific constraints are introduced. Although instruction tuning and human-cue prompting generally resulted in lower bids and mitigated certain observable anomalies, the fundamental mechanism-level response patterns remained largely stable. These results suggest that behavioral alignment in risk decision-making can be merely superficial; LLMs can generate human-like decisions without adopting human-consistent reasoning processes. Consequently, high-stakes evaluations of LLM decision-making must transcend simple outcome similarity to rigorously assess mechanism-level consistency.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

AI Concentration Risk Is the Problem: 3-Minutes MLIV
Bloomberg

AI Concentration Risk Is the Problem: 3-Minutes MLIV

The article argues that AI concentration risk, rather than the technology itself, is the primary concern. It highlights ...

Reuters

Foxconn announces strategic collaboration with Intel on next-gen AI infrastructure

Foxconn and Intel announced a strategic partnership to develop next-generation AI infrastructure. This collaboration aim...

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)
Bloomberg

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

SpaceX aims for a record $75 billion valuation through an initial public offering. This historic IPO marks a significant...

Broadcom AI Chip Outlook Disappoints Investors
Bloomberg

Broadcom AI Chip Outlook Disappoints Investors

Broadcom’s AI chip projections disappointed investors, dampening market sentiment. The outlook fell short of expectation...

Reuters

Europe's tech 'liberation day'? Computer says not yet

Europe’s expected tech breakthrough remains unrealized, as current systems indicate that a true "liberation day" has not...

Hiranandani Group CEO on Powering India's Digital Future
Bloomberg

Hiranandani Group CEO on Powering India's Digital Future

Hiranandani Group CEO discusses driving India's digital transformation.