arXiv

Interpretable Multimodal Gesture Recognition for Drone and Mobile Robot Teleoperation via Log-Likelihood Ratio Fusion

Title: Interpretable Multimodal Gesture Recognition for Drone and Mobile Robot Teleoperation via Log-Likelihood Ratio Fusion

Abstract:

In hazardous settings like industrial plants and disaster sites, the intuitive and dependable remote control of Unmanned Aerial Vehicles (UAVs) and mobile robots is critical, as human operators often remain in unsafe conditions. Hands-free teleoperation addresses this challenge by boosting operator mobility and situational awareness, ultimately enhancing safety. Although vision-based gesture recognition has been investigated for hands-free control, its reliability frequently drops due to lighting changes, background clutter, and occlusions, restricting its practical use in real-world scenarios.

To address these vulnerabilities, we present a multimodal gesture recognition system that combines inertial measurement data—specifically accelerometer, gyroscope, and orientation readings—from Apple Watches worn on both wrists, with capacitive sensing signals captured by specialized gloves. Our approach employs a late fusion mechanism grounded in the log-likelihood ratio (LLR). This strategy not only boosts recognition accuracy but also offers interpretability by measuring the specific contribution of each data modality.

To facilitate this study, we developed a novel dataset featuring 20 unique gestures modeled after aircraft marshalling signals, which includes synchronized recordings of RGB video, IMU data, and capacitive sensor inputs. Our experiments show that the proposed framework matches the performance of current state-of-the-art vision-based baselines. However, it does so with substantially lower computational demands, a smaller model footprint, and reduced training times, rendering it ideal for real-time robot control. Consequently, we highlight the promise of sensor-based multimodal fusion as a robust and interpretable approach for gesture-driven teleoperation of drones and mobile robots.


Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Related Articles

Advantech's Tsai on Nvidia Collaboration, AI Strategy
Bloomberg

Advantech's Tsai on Nvidia Collaboration, AI Strategy

Advantech's Tsai discusses the Nvidia partnership and AI strategy.

SK Hynix to Double Wafer Capacity to Ease Memory Chip Crunch
Bloomberg

SK Hynix to Double Wafer Capacity to Ease Memory Chip Crunch

SK Hynix plans to double its wafer capacity to alleviate the ongoing global memory chip shortage. This expansion aims to...

AI Productivity Boost Is Overhyped | 3-Minute MLIV
Bloomberg

AI Productivity Boost Is Overhyped | 3-Minute MLIV

The video argues that AI’s productivity boost is overhyped, challenging the assumption that it will significantly enhanc...

Intel's Lip-Bu Tan on Agentic AI & Partner Networks
Bloomberg

Intel's Lip-Bu Tan on Agentic AI & Partner Networks

Intel’s Lip-Bu Tan discusses Agentic AI and the vital role of partner networks in driving innovation.

Haas Says Arm May Hit $15 Billion AI Chip Revenue Goal Early
Bloomberg

Haas Says Arm May Hit $15 Billion AI Chip Revenue Goal Early

Haas suggests Arm may achieve its $15 billion AI chip revenue target sooner than expected. This indicates strong market ...

Arm May Hit $15 Billion AI Chip Revenue Goal Early, CEO Says
Bloomberg

Arm May Hit $15 Billion AI Chip Revenue Goal Early, CEO Says

Arm’s CEO predicts the company could hit its $15 billion AI chip revenue target ahead of schedule. This optimistic outlo...