arXiv

CAPER: Clause-Aligned Process Supervision for Text-to-SQL

Title: CAPER: Clause-Aligned Process Supervision for Text-to-SQL

Abstract:

Standard evaluations of Text-to-SQL systems rely on query-level execution correctness. However, this final signal offers limited insight into which intermediate SQL decisions were responsible for either success or failure. Furthermore, token-level dense supervision proves ineffective because SQL tokens rarely correspond to complete semantic decisions, can unfairly penalize queries that are execution-equivalent, and are challenging to label reliably at scale.

To address these issues, we introduce CAPER. This approach automatically generates clause-level supervision through counterfactual interventions on the SQL abstract syntax tree, facilitating root-cause error localization for reward modeling. We utilize this generated data to train CAPER-9B, a lightweight Clause-PRM designed to deliver clause-boundary feedback for both policy optimization and candidate verification.

Evaluations on the BIRD and Spider datasets demonstrate that clause-aligned supervision significantly enhances execution accuracy, yielding a relative improvement of up to 15.3% in EX compared to GPT-5.4. Additionally, it bolsters failure-localization capabilities, achieving 84.53% accuracy and a 90.60% MRR on held-out failures. Further details can be found at our project page: https://github.com/banrichard/RL-NL2SQL.


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

TechCrunch

The world’s largest privately owned laser just turned on

Xcimer Energy activated the Phoenix laser, the world’s largest privately owned laser, aiming to commercialize fusion pow...

Uber Targets Doubling Its Fleet of Electric Motorcycles in Kenya
Bloomberg

Uber Targets Doubling Its Fleet of Electric Motorcycles in Kenya

Uber plans to double its electric motorcycle fleet in Kenya. This expansion aims to enhance sustainable transport option...

AI Saves Time But Most Companies Waste the Gain, Study Shows
Bloomberg

AI Saves Time But Most Companies Waste the Gain, Study Shows

A study reveals that while AI saves employee time, most companies fail to capitalize on these gains, squandering potenti...

JPMorgan Lifts S&P Target on Earnings 'Supercycle'
Bloomberg

JPMorgan Lifts S&P Target on Earnings 'Supercycle'

JPMorgan raised its S&P 500 target, citing an earnings “supercycle” that reflects heightened confidence in corporate pro...

Europe Sleepwalking Into Economic Ruin, Serb Leader Says
Bloomberg

Europe Sleepwalking Into Economic Ruin, Serb Leader Says

Serbian leader warns Europe is sleepwalking into economic ruin.

Delta Electronics Flags Power Crunch
Bloomberg

Delta Electronics Flags Power Crunch

Delta Electronics warns of a looming power deficit due to surging demand and constrained production, predicting serious ...