TextFake: Benchmarking AI-Generated Image Detection on Text-Rich Images
Title: TextFake: Evaluating the Detection of AI-Generated Images in Text-Heavy Contexts
Abstract
While current detectors for AI-generated images (AIGI) demonstrate strong performance on standard natural-image datasets, their effectiveness against text-rich forgeries—such as fake screenshots, documents, and news pages that are common in misinformation campaigns—has not been adequately assessed. To address this gap, we present TextFake, a new benchmark comprising 20,000 images designed for text-rich AIGI detection. This dataset covers 28 languages, four topic categories, and two scene modalities.
Our methodology for creating fake images involves a four-stage pipeline. We begin by annotating real images across three controlled dimensions, then generate corresponding fake versions using structured prompting that aligns with the underlying data distribution. This approach effectively eliminates covariate shortcuts.
When we conducted zero-shot evaluations on 14 specialized detectors and three state-of-the-art Vision-Language Model (VLM) APIs, we observed a significant systematic performance gap. No model achieved accuracy above 80%, and several saw their performance drop by more than 60% compared to their results on natural-image benchmarks. Our diagnostic analysis highlights three primary failure modes:
- The Text Density Curse: High concentrations of glyphs overwhelm detectors that rely on low-level features.
- Cloaking via Rendering Fidelity: High-quality text rendering masks generative artifacts, making them harder to detect.
- Threshold Collapse: Routine perturbations cause detectors to revert to chance-level performance.
Source: arXiv Generated at: 2026-06-02 00:00:00 UTC





