arXiv

SharedRequest: Privacy-Preserving Model-Agnostic Inference for Large Language Models

Title: SharedRequest: Privacy-Preserving Model-Agnostic Inference for Large Language Models

Abstract

As public large language models (LLMs) like ChatGPT become ubiquitous, safeguarding the privacy of user prompts has emerged as a pressing concern. Current approaches to privacy-preserving inference typically force a trade-off between utility and efficiency, and they frequently demand model-specific adjustments that hinder broad compatibility. To address these limitations, we introduce SharedRequest, a framework that enables privacy-preserving LLM inference without being tied to a specific model architecture. Unlike previous methods that focus on individual prompts, SharedRequest shifts the privacy protection mechanism to the batch level. Its core strategy involves masking sensitive data by blending original prompts with noisy versions and clustering semantically similar instructions. This approach allows the system to distribute inference costs across a substantial number of queries, thereby minimizing any negative effect on the quality of LLM responses. Because it operates independently of LLM architecture, SharedRequest does not require access to model parameters or structural modifications. Our empirical evaluations show that SharedRequest delivers more than 20% greater utility than existing differential privacy baselines. Furthermore, its shared-prompt mechanism cuts query costs by as much as 5 times when compared to standard non-batched inference.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

Revolut Co-Founder, CTO Vlad Yatsenko to Step Down From Role
Bloomberg

Revolut Co-Founder, CTO Vlad Yatsenko to Step Down From Role

Revolut co-founder and CTO Vlad Yatsenko is stepping down from his executive role. The resignation marks a significant l...

Microsoft’s AI Chief Says Anthropic Models Are Too Expensive
Bloomberg

Microsoft’s AI Chief Says Anthropic Models Are Too Expensive

Microsoft AI CEO Mustafa Suleyman criticized Anthropic’s models as too expensive. Meanwhile, Microsoft plans to allow us...

Ramp Notches $44 Billion Valuation in New Funding Round
Bloomberg

Ramp Notches $44 Billion Valuation in New Funding Round

RAMP secured a $44 billion valuation in its latest funding round. CEO Eric Glyman attended the 2026 Reagan National Econ...

China’s Robotaxi Dilemma Shows AI Policy Tension Between Growth and Jobs
Bloomberg

China’s Robotaxi Dilemma Shows AI Policy Tension Between Growth and Jobs

China’s robotaxi expansion highlights the policy tension between driving economic growth through AI and protecting emplo...

Exams watchdog warns of rise in high-tech cheating
BBC News

Exams watchdog warns of rise in high-tech cheating

Ofqual warns of rising high-tech cheating, with smart devices involved in 44% of misconduct cases. Invigilators are trai...

Thailand’s Richest Man Plans $4.3 Billion Expansion Amid AI Boom
Bloomberg

Thailand’s Richest Man Plans $4.3 Billion Expansion Amid AI Boom

Thailand’s wealthiest individual is investing $4.3 billion in expansion, capitalizing on the booming artificial intellig...