arXiv

Learning Empirically Admissible Neural Heuristics for Combinatorial Search

Title: Learning Empirically Admissible Neural Heuristics for Combinatorial Search

Abstract:

Achieving optimal solution paths for classic combinatorial puzzles, including the Rubik’s Cube, Lights Out, and sliding tile variants, continues to pose a significant challenge within the field of artificial intelligence. While heuristic search algorithms like A* ensure optimality, they rely strictly on admissible heuristics—those that do not overestimate the actual remaining cost to reach the goal. Although deep reinforcement learning approaches, such as DeepCubeA, utilize deep neural networks to estimate these cost-to-go values, standard training via mean-squared error (MSE) frequently results in overestimations. Such violations of admissibility undermine the guarantee of optimal solutions.

To address this, we present a robust framework for learning neural heuristics that are calibrated to be admissible. Our method employs an underestimating Admissible Bellman Operator alongside an Asymmetric Loss function designed specifically to penalize overestimation. Furthermore, to mitigate residual errors inherent in neural function approximation, we introduce a post-hoc calibration safety offset derived from validation scrambles. Evaluation results indicate that our calibrated neural heuristics exhibit no admissibility violations under the testing protocol and successfully maintain path optimality. Compared to standard analytical baselines, this approach reduces search node expansions by up to 83.0% on the 2x2 Rubik's Cube, 19.9% on the 3x3 Lights Out grid, and 1.9% on the 8-Puzzle.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

China’s Robotaxi Dilemma Shows AI Policy Tension Between Growth and Jobs
Bloomberg

China’s Robotaxi Dilemma Shows AI Policy Tension Between Growth and Jobs

China’s robotaxi expansion highlights the policy tension between driving economic growth through AI and protecting emplo...

Exams watchdog warns of rise in high-tech cheating
BBC News

Exams watchdog warns of rise in high-tech cheating

Ofqual warns of rising high-tech cheating, with smart devices involved in 44% of misconduct cases. Invigilators are trai...

Thailand’s Richest Man Plans $4.3 Billion Expansion Amid AI Boom
Bloomberg

Thailand’s Richest Man Plans $4.3 Billion Expansion Amid AI Boom

Thailand’s wealthiest individual is investing $4.3 billion in expansion, capitalizing on the booming artificial intellig...

Reuters

Amazon unveils new AI warehouse robot in $12 billion Europe push

Amazon unveiled a new AI warehouse robot, marking a key step in its $12 billion European expansion strategy to enhance l...

US Tech Sector Announces Most Job Cuts in Nearly Two Years
Bloomberg

US Tech Sector Announces Most Job Cuts in Nearly Two Years

The US tech sector recorded its highest wave of layoffs in nearly two years, signaling a significant downturn for the in...

Iran Says No Progress in US Talks | The Opening Trade 6/4/2026
Bloomberg

Iran Says No Progress in US Talks | The Opening Trade 6/4/2026

Iran reports no progress in US talks on June 4, 2026. The Opening Trade highlights the ongoing diplomatic impasse betwee...