arXiv

Description-Code Inconsistency in Real-world MCP Servers: Measurement, Detection, and Security Implications

Title: Description-Code Inconsistency in Real-world MCP Servers: Measurement, Detection, and Security Implications

Abstract:

The Model Context Protocol (MCP) has established itself as a vital standard, enabling Large Language Models (LLMs) to leverage external tools. Within this framework, LLMs depend on natural language descriptions supplied by MCP servers to identify and execute specific functions. This process rests on the implicit assumption that these descriptions accurately mirror the underlying code implementations; however, this alignment is not rigorously verified in practical deployments. Consequently, MCP systems are vulnerable to a phenomenon known as Description-Code Inconsistency (DCI), wherein a tool’s stated capabilities and security parameters diverge from its actual code behavior.

This study offers a thorough examination of DCI within live MCP server environments. We provide a formal definition of the issue and introduce a comprehensive taxonomy that categorizes inconsistencies into functional discrepancies and undeclared side effects. Leveraging this taxonomy, we engineered DCIChecker, an automated framework that integrates structure-aware static analysis with a Direct-Reverse-Arbitration prompting technique to cross-verify tool descriptions against their corresponding code.

We evaluated this framework using a substantial dataset of 19,200 description-code pairs sourced from 2,214 real-world MCP servers. Our findings indicate that DCI is prevalent, affecting 9.93% of the analyzed pairs. Furthermore, we show that DCI establishes a significant defense blind spot, enabling a spectrum of risks ranging from operational failures to covert malicious activities. To address these challenges, we outline mitigation strategies aimed at enforcing semantic consistency and bolstering the reliability of the burgeoning agentic ecosystem.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)
Bloomberg

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

SpaceX aims for a record $75 billion valuation through an initial public offering. This historic IPO marks a significant...

Broadcom AI Chip Outlook Disappoints Investors
Bloomberg

Broadcom AI Chip Outlook Disappoints Investors

Broadcom’s AI chip projections disappointed investors, dampening market sentiment. The outlook fell short of expectation...

Hiranandani Group CEO on Powering India's Digital Future
Bloomberg

Hiranandani Group CEO on Powering India's Digital Future

Hiranandani Group CEO discusses driving India's digital transformation.

Cerebras Says It’s Working With All AI Gear Makers Except Nvidia
Bloomberg

Cerebras Says It’s Working With All AI Gear Makers Except Nvidia

Cerebras confirmed partnerships with all major AI hardware vendors except Nvidia. This broad engagement positions Cerebr...

Putin Turns Russia’s AI Future Into a Kremlin Family Business
Bloomberg

Putin Turns Russia’s AI Future Into a Kremlin Family Business

Putin is consolidating Russia’s AI ambitions into a Kremlin family business, effectively turning the sector into a dynas...

Reuters

Meta repeatedly pushes back new AI model release for developers, WSJ says

Meta has repeatedly delayed the release of its new AI model for developers, according to the WSJ. This ongoing postponem...