arXiv

Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward

Title: Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward

Original: arXiv:2602.12430v4 Announce Type: replace-cross Abstract: The transition from monolithic language models to modular, skill-equipped agents marks a defining shift in how large language models (LLMs) are deployed in practice. Rather than encoding all procedural knowledge within model weights, agent skills -- composable packages of instructions, code, and resources that agents load on demand -- enable dynamic capability extension without retraining. It is formalized in a paradigm of progressive disclosure, portable skill definitions, and integration with the Model Context Protocol (MCP). This survey provides a comprehensive treatment of the agent skills landscape, as it has rapidly evolved during the last few months. We organize the field along four axes: (i) architectural foundations, examining the SKILL$.$md specification, progressive context loading, and the complementary roles of skills and MCP; (ii) skill acquisition, covering reinforcement learning with skill libraries, autonomous skill discovery (SEAgent), and compositional skill synthesis; (iii) deployment at scale, including the computer-use agent (CUA) stack, GUI grounding advances, and benchmark progress on OSWorld and SWE-bench; and (iv) security, where recent empirical analyses reveal that 26.1% of community-contributed skills contain vulnerabilities, motivating our proposed Skill Trust and Lifecycle Governance Framework -- a four-tier, gate-based permission model that maps skill provenance to graduated deployment capabilities. We identify seven open challenges -- from cross-platform skill portability to capability-based permission models -- and propose a research agenda for realizing trustworthy, self-improving skill ecosystems. Unlike prior surveys that broadly cover LLM agents or tool use, this work focuses specifically on the emerging skill abstraction layer and its implications for the next generation of agentic systems. Project repo: https://github.com/scienceaix/agentskills

Rewrite: Title: Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward

Original: arXiv:2602.12430v4 Announce Type: replace-cross Abstract: The shift from monolithic language models to modular, skill-equipped agents represents a pivotal change in the practical deployment of large language models (LLMs). Instead of embedding all procedural knowledge directly into model weights, agent skills—modular bundles of instructions, code, and resources loaded on demand—allow for dynamic capability expansion without the need for retraining. This approach is structured around progressive disclosure, portable skill definitions, and integration with the Model Context Protocol (MCP). This survey offers a thorough examination of the rapidly evolving agent skills landscape. We structure the discussion across four key dimensions: (i) architectural foundations, focusing on the SKILL$.$md specification, progressive context loading, and the synergistic relationship between skills and MCP; (ii) skill acquisition, encompassing reinforcement learning with skill libraries, autonomous skill discovery via SEAgent, and compositional skill synthesis; (iii) large-scale deployment, highlighting the computer-use agent (CUA) stack, advancements in GUI grounding, and performance on benchmarks like OSWorld and SWE-bench; and (iv) security, where empirical studies indicate that 26.1% of community-contributed skills harbor vulnerabilities. This finding underscores the need for our proposed Skill Trust and Lifecycle Governance Framework—a four-tier, gate-based permission system that aligns skill provenance with graduated deployment permissions. We outline seven open challenges, ranging from cross-platform skill portability to capability-based permission models, and present a research agenda aimed at fostering trustworthy, self-improving skill ecosystems. Distinct from earlier surveys that broadly address LLM agents or tool use, this study zeroes in on the emerging skill abstraction layer and its impact on future agentic systems. Project repo: https://github.com/scienceaix/agentskills


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

TikTok Billionaire Tops Ambani as Asia’s Second-Richest
Bloomberg

TikTok Billionaire Tops Ambani as Asia’s Second-Richest

TikTok founder surpasses Mukesh Ambani to become Asia’s second-richest person, marking a significant shift in the region...

Publishers in UK can opt out of Google AI search results
BBC News

Publishers in UK can opt out of Google AI search results

UK publishers can now opt out of Google’s AI search summaries, a CMA ruling designed to boost their bargaining power and...

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.
Bloomberg

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.

Kioxia’s market cap nears Toyota’s, signaling a major shift in Japan’s corporate hierarchy. This narrowing gap highlight...

Reuters

Morning Bid: Marvell, a fitting name for the latest AI darling

Reuters highlights Marvell as a top AI stock, noting its name perfectly suits its status as the newest market darling.

Financial Times

Tim Hayward: I built the Jaguar E-Type of computer keyboards

Tim Hayward compares his bespoke keyboard designs to the Jaguar E-Type. He explores high-end customization for personal ...

Financial Times

AI Labs: Zuckerberg’s $100bn gamble

Meta’s $100 billion AI investment aims to secure AI dominance, but questions remain whether sheer spending can outpace c...