arXiv

DeliChess: A Multi-party Dialogue Dataset for Deliberation in Chess Puzzle Solving

Title: DeliChess: A Multi-party Dialogue Dataset for Deliberation in Chess Puzzle Solving

Abstract: Multi-party dialogue serves as a vital environment for investigating collaborative reasoning and decision-making; however, current datasets seldom address structured, complex reasoning tasks that require depth. To bridge this gap, we present DeliChess, a new dataset comprising group deliberation dialogues where participants work together to solve multiple-choice chess puzzles. In this framework, each member first attempts the puzzle independently, followed by a multi-party discussion, after which the group submits a revised collective answer. The dataset comprises 107 dialogues, complete with full transcripts, individual choices made before and after the discussion, and metadata detailing puzzle difficulty and move quality.

We assess performance through three metrics derived from chess engine evaluations, revealing that deliberation notably enhances group accuracy. Additionally, we examine the impact of probing utterances—messages designed to elicit proposals, justifications, or strategic reflection—using a classifier trained on previous deliberation data. Our analysis indicates that while probing increases the variability of group performance post-discussion, it does not consistently result in improved outcomes. Ultimately, our dataset provides a robust testbed for modeling group reasoning, dialogue dynamics, and the resolution of conflicting perspectives and opinions within a clearly defined strategic domain.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

TechCrunch

A burglar used a Waymo to steal yoga clothes in San Francisco — and got away with it

A thief stole yoga clothes using a Waymo, but police failed to catch them because the car’s video data was deleted and b...

Goldman Sachs CEO David Solomon on the Coming Mega IPOs
Bloomberg

Goldman Sachs CEO David Solomon on the Coming Mega IPOs

Goldman Sachs CEO David Solomon anticipates a surge in major IPOs, signaling renewed market confidence and significant o...

What Are A.I. Agents Actually Doing?
New York Times

What Are A.I. Agents Actually Doing?

Arena research shows tech professionals are most likely to use AI agents at work, highlighting a strong industry trend i...

TechCrunch

Cash App launches a wand for tap-and-pay

Cash App launched a $25 NFC "Magic Wand" for tap-and-pay, blending viral novelty with practical contactless payments. It...

Databricks CEO Plans to Avoid IPO During Year of Huge Offerings
Bloomberg

Databricks CEO Plans to Avoid IPO During Year of Huge Offerings

Databricks CEO plans to avoid an IPO in 2021, despite a surge in public offerings. This contrasts with earlier reports t...

TechCrunch

Waymo’s spent robotaxi batteries will be used as grid storage

Waymo partners with B2U to repurpose retired robotaxi batteries for grid storage in California and Texas, aligning with ...