arXiv

Mixed-Modality Dual Face-Hair Retrieval

Title: Mixed-Modality Dual Face-Hair Retrieval

Abstract:

This paper presents Dual Face-Hair Retrieval (DFHR), a novel image retrieval task that operates as a mixed-modality dual-reference system. In this framework, a query is defined by two inputs: a face image that establishes identity and a hairstyle reference provided either as an image or as text. DFHR diverges from previous retrieval paradigms by necessitating cross-component reasoning between two semantically distinct attributes—identity and hairstyle—that stem from heterogeneous modalities. To address this complexity, the formulation requires localized feature disentanglement, alignment of semantics across modalities, and the composition of mixed modalities within a single embedding space.

To support this new task, we introduce DFHR-Bench, the inaugural benchmark for mixed-modality face-hair retrieval. This dataset contains more than 180,000 annotated triplets covering both dual-image and image-text scenarios. The data was generated using a multi-stage annotation protocol designed to preserve both semantic accuracy and identity integrity. Additionally, we propose MFHC (Multimodal Face-Hair Combiner), a comprehensive framework that integrates disentangled identity and hairstyle embeddings via token injection and multi-view supervision. Together, DFHR and DFHR-Bench define a new standard for visual retrieval that is both identity-aware and capable of attribute control across different modalities.


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

TechCrunch

The world’s largest privately owned laser just turned on

Xcimer Energy activated the Phoenix laser, the world’s largest privately owned laser, aiming to commercialize fusion pow...

Uber Targets Doubling Its Fleet of Electric Motorcycles in Kenya
Bloomberg

Uber Targets Doubling Its Fleet of Electric Motorcycles in Kenya

Uber plans to double its electric motorcycle fleet in Kenya. This expansion aims to enhance sustainable transport option...

AI Saves Time But Most Companies Waste the Gain, Study Shows
Bloomberg

AI Saves Time But Most Companies Waste the Gain, Study Shows

A study reveals that while AI saves employee time, most companies fail to capitalize on these gains, squandering potenti...

JPMorgan Lifts S&P Target on Earnings 'Supercycle'
Bloomberg

JPMorgan Lifts S&P Target on Earnings 'Supercycle'

JPMorgan raised its S&P 500 target, citing an earnings “supercycle” that reflects heightened confidence in corporate pro...

Europe Sleepwalking Into Economic Ruin, Serb Leader Says
Bloomberg

Europe Sleepwalking Into Economic Ruin, Serb Leader Says

Serbian leader warns Europe is sleepwalking into economic ruin.

Delta Electronics Flags Power Crunch
Bloomberg

Delta Electronics Flags Power Crunch

Delta Electronics warns of a looming power deficit due to surging demand and constrained production, predicting serious ...