arXiv

SUSD: Structured Unsupervised Skill Discovery through State Factorization

Title: SUSD: Structured Unsupervised Skill Discovery through State Factorization

Abstract:

Unsupervised Skill Discovery (USD) seeks to autonomously acquire a varied repertoire of skills without the need for external reward signals. While maximizing Mutual Information (MI) between latent skill variables and environmental states is a prevalent USD strategy, these methods often converge on simple, static behaviors due to their invariance characteristics. This tendency restricts the identification of dynamic, task-critical actions. Although Distance-Maximizing Skill Discovery (DSD) attempts to foster more dynamic capabilities by utilizing state-space distances, it still fails to ensure a comprehensive skill set that addresses all controllable factors or entities within the environment.

To address these limitations, we propose SUSD, a new framework that exploits the compositional nature of environments by decomposing the state space into independent components, such as distinct objects or controllable entities. By assigning separate skill variables to each factor, SUSD allows for more granular control over the skill discovery process. Furthermore, a dynamic model monitors progress across these factors, directing the agent’s attention toward areas that have been less explored. This structured methodology not only encourages the emergence of richer, more diverse skills but also produces a factorized skill representation. This representation supports fine-grained, disentangled control over individual entities, thereby facilitating efficient training for compositional downstream tasks through Hierarchical Reinforcement Learning (HRL).

Our experiments, conducted across three environments featuring between one and ten factors, show that SUSD can identify diverse and complex skills without supervision. The results indicate that our method significantly surpasses current unsupervised skill discovery techniques, particularly in complex and factorized settings. The code for this work is publicly accessible at: https://github.com/hadi-hosseini/SUSD.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

Glazer Family Members Said to Study Manchester United Stake Sale
Bloomberg

Glazer Family Members Said to Study Manchester United Stake Sale

Reports indicate the Glazer family is evaluating a potential sale of their Manchester United stake, with family members ...

Ares' Blair Jacbobson: Disconnect Over Private Credit Headlines
Bloomberg

Ares' Blair Jacbobson: Disconnect Over Private Credit Headlines

Ares’ Blair Jacobson argues that private credit headlines misrepresent reality, highlighting a disconnect between media ...

Nvidia-Backed Robotics Startup Generalist AI Valued at $2 Billion
Bloomberg

Nvidia-Backed Robotics Startup Generalist AI Valued at $2 Billion

Nvidia-backed robotics startup Generalist AI has reached a $2 billion valuation. Founders Pete Florence, Andy Zeng, and ...

TechCrunch

Oura Ring 5 review: Thinner, lighter, better

The Oura Ring 5 is 40% smaller and lighter than its predecessor, offering superior comfort and a discreet, jewelry-like ...

Financial Times

How AI has de-skilled translation

AI fragments specialist translation into routine tasks, effectively de-skilling the profession. This shift reduces compl...

Zurich Insurance Expands Data-Center Offering Beyond the US
Bloomberg

Zurich Insurance Expands Data-Center Offering Beyond the US

Zurich Insurance Group is expanding its data center insurance products internationally, extending coverage beyond the Un...