arXiv

ZeroWBC: Learning Natural Whole-Body Humanoid Interaction from Human Egocentric Data

Title: ZeroWBC: Learning Natural Whole-Body Humanoid Interaction from Human Egocentric Data

Abstract:

Controlling humanoid robots for versatile and natural whole-body interactions remains a significant challenge, largely due to the prohibitive costs associated with collecting whole-body teleoperation data. To address this, we introduce ZeroWBC, a novel framework that eliminates the need for teleoperation by learning interaction capabilities directly from human egocentric videos. These videos are paired with synchronized whole-body motion capture and text annotations.

ZeroWBC utilizes a "generation-then-tracking" approach to manage whole-body interactions within static scenes. The process begins with an initial egocentric image and a language instruction. A fine-tuned Vision-Language Model then predicts future human whole-body motion tokens. These tokens are decoded into continuous motion sequences and subsequently retargeted to the humanoid robot. The system then employs a general interactive motion tracking policy to execute these reference motions, alongside specific root and key body-part trajectories.

To enhance interaction quality, we propose an interaction-oriented tracking reward. This reward mechanism emphasizes the alignment of global root and key body-part trajectories while ensuring the preservation of natural whole-body dynamics. Evaluations conducted on the Unitree G1 humanoid robot demonstrate that ZeroWBC facilitates diverse, scene-aware behaviors without requiring any robot teleoperation demonstrations. These findings indicate a scalable new paradigm for teaching natural whole-body interaction to humanoids using human egocentric data.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

TechCrunch

A burglar used a Waymo to steal yoga clothes in San Francisco — and got away with it

A thief stole yoga clothes using a Waymo, but police failed to catch them because the car’s video data was deleted and b...

Goldman Sachs CEO David Solomon on the Coming Mega IPOs
Bloomberg

Goldman Sachs CEO David Solomon on the Coming Mega IPOs

Goldman Sachs CEO David Solomon anticipates a surge in major IPOs, signaling renewed market confidence and significant o...

What Are A.I. Agents Actually Doing?
New York Times

What Are A.I. Agents Actually Doing?

Arena research shows tech professionals are most likely to use AI agents at work, highlighting a strong industry trend i...

TechCrunch

Cash App launches a wand for tap-and-pay

Cash App launched a $25 NFC "Magic Wand" for tap-and-pay, blending viral novelty with practical contactless payments. It...

Databricks CEO Plans to Avoid IPO During Year of Huge Offerings
Bloomberg

Databricks CEO Plans to Avoid IPO During Year of Huge Offerings

Databricks CEO plans to avoid an IPO in 2021, despite a surge in public offerings. This contrasts with earlier reports t...

TechCrunch

Waymo’s spent robotaxi batteries will be used as grid storage

Waymo partners with B2U to repurpose retired robotaxi batteries for grid storage in California and Texas, aligning with ...