arXiv

Need to Know: Contextual-Integrity-Grounded Query Rewriting for Privacy-Conscious LLM Delegation

Title: Need to Know: Contextual-Integrity-Grounded Query Rewriting for Privacy-Conscious LLM Delegation

Abstract:

As large language models (LLMs) become deeply integrated into daily professional and personal workflows, queries transmitted to cloud-hosted LLMs frequently contain a mix of information critical to the task and sensitive data that is not. Traditional privacy-preserving methods, which rely on type-based Personally Identifiable Information (PII) redaction, are often context-agnostic. This approach can lead to two significant drawbacks: the unnecessary exposure of untyped sensitive context and the excessive removal of text segments that are essential for generating an answer. To address these challenges, we reframe privacy-preserving query rewriting through the lens of Contextual Integrity, positing that a data span should only be transmitted if it is strictly necessary for the specific task at hand.

We introduce DelegateCI-Bench, the inaugural task-based Contextual Integrity benchmark designed for privacy-aware delegation. This benchmark consists of 3,167 samples, combining high-quality synthetic data across 11 tasks and 20 task types, real-world user queries derived from WildChat, and a specialized medical challenge set featuring dense sensitive information. Leveraging this benchmark, we propose a Contextual Integrity (CI)-guided reinforcement learning framework. This framework transforms essential and non-essential sensitive spans into verifiable optimization signals, enabling the training of a query rewriter that retains task-critical information while suppressing unnecessary sensitive disclosures. Our experimental results demonstrate that the learned rewriter delivers the optimal balance between privacy and utility, achieving an average utility improvement of up to +10.1 over on-device baselines.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

TechCrunch

Benchmark raises its first-ever growth fund as part of $2B capital raise

Benchmark Capital launches its first growth fund, raising $2 billion to target later-stage AI deals. This marks a strate...

Netflix Aims to Use AI to Help Viewers Manage Content Overload
Bloomberg

Netflix Aims to Use AI to Help Viewers Manage Content Overload

Netflix uses AI to help viewers manage content overload, tackling the challenge of too many choices.

TSMC CEO Warns Chip Supply Won’t Meet AI-Fueled Demand for Years
Bloomberg

TSMC CEO Warns Chip Supply Won’t Meet AI-Fueled Demand for Years

TSMC CEO warns that chip supply will lag behind surging AI demand for years. This multi-year shortfall highlights the in...

Reuters

TSMC boss upbeat on outlook as AI boom shows no sign of easing

TSMC executives remain optimistic as sustained AI demand shows no signs of slowing, driving strong confidence in the com...

Bitcoin Falls to Pre-Iran Conflict Low as Crypto Slide Extends
Bloomberg

Bitcoin Falls to Pre-Iran Conflict Low as Crypto Slide Extends

Bitcoin drops to its lowest level before the Iran conflict, extending a broader cryptocurrency decline.

Why Amazon Has Struggled to Crack India
Bloomberg

Why Amazon Has Struggled to Crack India

Amazon’s aggressive push for dominance in India has stalled, marking the end of its ambitious expansion efforts. The 202...