arXiv

T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation

Title: T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation

Abstract:

The field of Text-to-Audio-Video (T2AV) generation seeks to produce videos that are temporally coherent and accompanied by audio that is semantically aligned, all derived from natural language inputs. However, assessing these systems remains a fragmented endeavor, typically depending on isolated unimodal metrics or limited benchmarks that overlook critical aspects such as cross-modal alignment, adherence to instructions, and perceptual fidelity when handling intricate prompts. To bridge this gap, we introduce T2AV-Compass, a comprehensive benchmark designed for the holistic evaluation of T2AV models. This benchmark comprises 500 varied and complex prompts, developed through a taxonomy-driven pipeline to guarantee both semantic depth and physical plausibility.

T2AV-Compass employs a dual-tier evaluation framework. This approach combines objective signal-level metrics—covering video quality, audio quality, and cross-modal synchronization—with a subjective "MLLM-as-a-Judge" protocol to assess instruction following and overall realism. Our extensive testing of 11 prominent T2AV systems demonstrates that even the most advanced models significantly lag behind human-level standards in terms of realism and cross-modal consistency. Persistent issues were observed in areas such as audio authenticity, fine-grained synchronization, and prompt adherence. These findings underscore the substantial potential for future improvements and position T2AV-Compass as a rigorous diagnostic platform essential for advancing the state of text-to-audio-video generation.


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

23andMe Is Back as Nonprofit Aiming to Reach 100 Million Users
Bloomberg

23andMe Is Back as Nonprofit Aiming to Reach 100 Million Users

23andMe has transitioned into a nonprofit, aiming to onboard 100 million users to democratize genetic access and advance...

Trump Officials Held Millions of Dollars of SpaceX Ahead of IPO
Bloomberg

Trump Officials Held Millions of Dollars of SpaceX Ahead of IPO

Reports indicate Trump administration officials withheld millions in SpaceX payments ahead of its IPO. The delay occurre...

AI Jitters Fuel Biggest Swings in India’s IT Stocks Since 2020
Bloomberg

AI Jitters Fuel Biggest Swings in India’s IT Stocks Since 2020

AI uncertainty is driving the largest volatility in Indian IT stocks since 2020, causing significant market swings.

SpaceX IPO Terms Due & Trump's New Tariffs | The Pulse 6/3/2026
Bloomberg

SpaceX IPO Terms Due & Trump's New Tariffs | The Pulse 6/3/2026

Spacecraft giant SpaceX nears finalizing its IPO structure, while former President Trump announces new tariffs, reshapin...

News Publishers Weigh Whether AI is Industry Killer or Savior
Bloomberg

News Publishers Weigh Whether AI is Industry Killer or Savior

NYT shares fell after missing financial forecasts, following a tech staff strike. This occurs amid industry debates on A...

Reuters

When IPOs go wrong: SpaceX, AI firms face a delicate process

SpaceX and AI firms face a delicate IPO process amid complex markets. Their transition to public offerings is fraught wi...