arXiv

Simulate, Reason, Decide: Scientific Reasoning with LLMs for Simulation-Driven Decision Making

Title: Simulate, Reason, Decide: Scientific Reasoning with LLMs for Simulation-Driven Decision Making

Abstract:

High-stakes decision-making processes are increasingly leveraging LLM-driven systems that incorporate scientific simulators. However, current frameworks typically treat these simulators as opaque black-box interfaces, employing LLMs merely to generate, calibrate, or execute them. This approach fails to treat simulators as structured mechanistic systems capable of being reasoned about, thereby preventing the identification, representation, and analysis of the assumptions and mechanisms that drive simulator behavior. Consequently, existing methods suffer from limited transparency, auditability, and the ability to justify decisions.

To address these limitations, we present MechSim, a neuro-symbolic reasoning framework grounded in mechanisms designed for executable scientific simulators. In contrast to previous neuro-symbolic models that focus on static symbolic structures, MechSim empowers LLM agents to reason directly about the mechanisms, underlying assumptions, and execution dynamics of scientific simulators. The framework utilizes a shared structured schema to represent simulators, capturing critical elements such as assumptions, variables, mechanism dependencies, and execution traces. Building upon this representation, LLM agents function as constrained reasoning engines, producing structured explanations that are grounded in evidence and explicitly link simulator outcomes to their foundational mechanisms. Our evaluation across several high-stakes domains demonstrates that this approach enhances the quality of mechanism-level explanations, improves simulator analysis, and increases the reliability of downstream decision-making.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)
Bloomberg

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

SpaceX aims for a record $75 billion valuation through an initial public offering. This historic IPO marks a significant...

Broadcom AI Chip Outlook Disappoints Investors
Bloomberg

Broadcom AI Chip Outlook Disappoints Investors

Broadcom’s AI chip projections disappointed investors, dampening market sentiment. The outlook fell short of expectation...

Hiranandani Group CEO on Powering India's Digital Future
Bloomberg

Hiranandani Group CEO on Powering India's Digital Future

Hiranandani Group CEO discusses driving India's digital transformation.

Cerebras Says It’s Working With All AI Gear Makers Except Nvidia
Bloomberg

Cerebras Says It’s Working With All AI Gear Makers Except Nvidia

Cerebras confirmed partnerships with all major AI hardware vendors except Nvidia. This broad engagement positions Cerebr...

Putin Turns Russia’s AI Future Into a Kremlin Family Business
Bloomberg

Putin Turns Russia’s AI Future Into a Kremlin Family Business

Putin is consolidating Russia’s AI ambitions into a Kremlin family business, effectively turning the sector into a dynas...

Reuters

Meta repeatedly pushes back new AI model release for developers, WSJ says

Meta has repeatedly delayed the release of its new AI model for developers, according to the WSJ. This ongoing postponem...