arXiv

SANE Schema-aware Natural-language Evaluation of Biological Data

Title: SANE: Schema-Grounded Natural Language Assessment for Biological Databases

Abstract: While high-throughput microscopy produces vast, organized datasets detailing cellular reactions to drug treatments, retrieving information from these collections usually demands proficiency in SQL. Although large language models (LLMs) present a natural-language interface as an alternative, their propensity for hallucination casts doubt on the trustworthiness of their outputs. To address this, we introduce SANE (Schema-Aware Natural-language Evaluation), a new framework for domain-specific text-to-SQL assessment. SANE utilizes benchmarks that are automatically generated and anchored in real-world, specific experimental structures, thereby making the evaluation process more scalable, systematic, and reproducible.

Our study employs SANE to assess a few-shot LLM, demonstrating that precise query generation is possible without any model training or fine-tuning, provided that constrained schemas are used alongside structured prompting and guardrails. We found that most errors do not result from faulty SQL syntax but rather from ambiguous or underspecified user inputs. These issues typically lead to excessive clarification requests or responses to queries that require disambiguation before answering. Consequently, our findings suggest that few-shot LLMs can deliver reliable database access in well-defined domains when paired with schema-aware prompting techniques.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

TechCrunch

Meta’s Oversight Board says account bans lack due process, transparency

Meta’s Oversight Board criticized account bans for lacking due process and transparency, citing inconsistent enforcement...

Fed's Daly Says Forward Guidance Could Be Misleading
Bloomberg

Fed's Daly Says Forward Guidance Could Be Misleading

Fed’s Daly warns forward guidance may be misleading or lack clarity.

TechCrunch

Meta rolls out a new AI creator assistant on Facebook

Meta launched an AI creator assistant on Facebook to streamline analytics and content brainstorming. Initially available...

TechCrunch

What to expect from WWDC 2026: Siri’s highly anticipated revamp and Apple Intelligence updates

WWDC 2026 promises a Siri revamp powered by Google’s Gemini and standalone app, plus AI agents in the App Store and Came...

TechCrunch

A burglar used a Waymo to steal yoga clothes in San Francisco — and got away with it

A thief stole yoga clothes using a Waymo, but police failed to catch them because the car’s video data was deleted and b...

Goldman Sachs CEO David Solomon on the Coming Mega IPOs
Bloomberg

Goldman Sachs CEO David Solomon on the Coming Mega IPOs

Goldman Sachs CEO David Solomon anticipates a surge in major IPOs, signaling renewed market confidence and significant o...