arXiv

Beyond Prompt-Based Planning: MCP-Native Graph Planning-based Biomedical Agent System

Title: Moving Past Prompt-Driven Scheduling: A Graph-Based Biomedical Agent Framework Native to MCP

Abstract:

While biomedical agents hold the potential to streamline intricate biological processes, existing frameworks are hindered by two primary constraints. First, bioinformatics utilities exhibit significant diversity in their interfaces and execution environments. Second, agent scheduling continues to depend on flat, prompt-retrieved tool descriptions. As the ecosystem of biomedical software expands, the link between the volume of available tools and context window limits results in inefficient execution, unstable planning, and tool confusion.

To address these challenges, we present BioManus, a biomedical agent designed natively for the Model Context Protocol (MCP). BioManus utilizes graph-scaffolded planning to manage structured biological capabilities. The system begins with the BioinfoMCP Compiler, a component that transforms disparate bioinformatics software into standardized MCP servers, thereby generating a comprehensive executable MCP ecosystem. Subsequently, this ecosystem is organized as a typed, heterogeneous MCP graph encompassing tools, operations, data types, and workflow stages.

During inference, BioManus identifies and retrieves compact, task-specific subgraphs, which are then used to synthesize scaffolds for operation-level workflows. This architectural approach separates the complexity of planning from the sheer size of the tool inventory. It achieves a context compression ratio of $\Theta(N / (h \cdot \bar{m}))$ under high-recall retrieval conditions, where $N$ represents the total number of tools, $h$ denotes the workflow horizon, and $\bar{m}$ (significantly smaller than $N$) indicates the average number of candidate tools per operation.

Evaluations on the BioAgentBench and LAB-Bench datasets demonstrate that BioManus outperforms advanced biomedical agent baselines in terms of context efficiency, workflow validity, and execution accuracy. These findings indicate a necessary paradigm shift: achieving scalable biomedical reasoning depends on utilizing structured, executable capability graphs rather than relying on increasingly large prompt-level tool retrievals.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

AI Concentration Risk Is the Problem: 3-Minutes MLIV
Bloomberg

AI Concentration Risk Is the Problem: 3-Minutes MLIV

The article argues that AI concentration risk, rather than the technology itself, is the primary concern. It highlights ...

Reuters

Foxconn announces strategic collaboration with Intel on next-gen AI infrastructure

Foxconn and Intel announced a strategic partnership to develop next-generation AI infrastructure. This collaboration aim...

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)
Bloomberg

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

SpaceX aims for a record $75 billion valuation through an initial public offering. This historic IPO marks a significant...

Broadcom AI Chip Outlook Disappoints Investors
Bloomberg

Broadcom AI Chip Outlook Disappoints Investors

Broadcom’s AI chip projections disappointed investors, dampening market sentiment. The outlook fell short of expectation...

Reuters

Europe's tech 'liberation day'? Computer says not yet

Europe’s expected tech breakthrough remains unrealized, as current systems indicate that a true "liberation day" has not...

Hiranandani Group CEO on Powering India's Digital Future
Bloomberg

Hiranandani Group CEO on Powering India's Digital Future

Hiranandani Group CEO discusses driving India's digital transformation.