arXiv

When Seeing Is Not Believing -- A Benchmark for Search-Grounded Video Misinformation Detection

Title: Visual Evidence Is Not Enough: A New Benchmark for Detecting Search-Grounded Video Misinformation

Video-based disinformation has evolved beyond simple fabrication, now operating at the level of semantics and evidentiary context. Authentic clips are increasingly manipulated through selective editing, temporal reordering, cross-source splicing, or the integration of AI-generated elements to fabricate misleading narratives. Because these manipulations rely on missing, reordered, or recontextualized evidence that exists outside the video file itself, verifying such content cannot be achieved by analyzing the input video in isolation.

To address this challenge, we present EVID-Bench, a benchmark designed for search-grounded video misinformation detection. In this framework, systems are required to scour the open web for related footage and identify inaccuracies by comparing multiple video sources. The benchmark includes 222 videos that exhibit nine distinct types of manipulation across three primary categories: AI generation, single-source editing, and multi-source editing. Crucially, every sample has been confirmed to be undetectable by current frontier models when relying solely on visual inspection.

We tested nine leading multimodal models using a retrieval-augmented verification baseline. The results indicate significant limitations: the top-performing system attained only 61.43% accuracy at the point level and 43.24% at the video level. Manipulations involving AI generation proved particularly difficult to detect. Our error analysis highlights persistent issues, including models focusing on irrelevant cues, incorrectly attributing synthetic artifacts to editorial cuts, and halting their search processes before fully elucidating the nature of the manipulation.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

Bloomberg Tech Event Special | Bloomberg Tech 6/04/2026
Bloomberg

Bloomberg Tech Event Special | Bloomberg Tech 6/04/2026

This title indicates a special Bloomberg Tech broadcast scheduled for June 4, 2026. No specific content details are prov...

Anthropic’s Amodei on Pros and Cons of an AI Startup IPO
Bloomberg

Anthropic’s Amodei on Pros and Cons of an AI Startup IPO

Anthropic CEO Dario Amodei weighs the pros and cons of an IPO for his AI startup, highlighting the trade-offs between pu...

TechCrunch

Meta’s Oversight Board says account bans lack due process, transparency

Meta’s Oversight Board criticized account bans for lacking due process and transparency, citing inconsistent enforcement...

Fed's Daly Says Forward Guidance Could Be Misleading
Bloomberg

Fed's Daly Says Forward Guidance Could Be Misleading

Fed’s Daly warns forward guidance may be misleading or lack clarity.

TechCrunch

Meta rolls out a new AI creator assistant on Facebook

Meta launched an AI creator assistant on Facebook to streamline analytics and content brainstorming. Initially available...

TechCrunch

What to expect from WWDC 2026: Siri’s highly anticipated revamp and Apple Intelligence updates

WWDC 2026 promises a Siri revamp powered by Google’s Gemini and standalone app, plus AI agents in the App Store and Came...