arXiv

GGT-100K: Generative Ground Truth for Generalizable Real-World Image Restoration

Title: GGT-100K: Establishing Generative Ground Truth for Robust Real-World Image Restoration

The field of real-world image restoration (IR) is currently hindered by a critical shortage of high-quality paired training data. While synthetic datasets are plentiful, they frequently fall short in accurately simulating actual real-world degradations. Conversely, acquiring real-world paired datasets is both costly and technically challenging. Consequently, IR models developed using these existing resources often exhibit poor generalization when deployed in real-world contexts.

To address this challenge, this study introduces Generative Ground Truth (GGT), a method that leverages generative multimodal foundation models (MFMs) to generate high-quality (HQ) targets directly from real-world low-quality (LQ) images. The research begins with a comprehensive assessment of nine leading state-of-the-art MFMs, such as GPT-Image-2 and Nano-Banana-2, across various scenes and degradation types. The findings indicate that Nano-Banana-2, particularly when utilizing VLM-based adaptive prompting, possesses the superior ability to create HQ targets that are both perceptually realistic and faithful to the original content. These generated targets effectively serve as the GGT for their corresponding LQ inputs.

Building on this finding, the authors utilize Nano-Banana-2 to develop a GGT synthesis pipeline featuring multi-stage quality control mechanisms to guarantee data reliability. This process yielded GGT-100K, a substantial paired dataset containing 103,707 LQ-HQ training pairs. This dataset encompasses a wide variety of scenes and complex real-world degradation patterns. Additionally, a test set comprising 500 image pairs was created for evaluation.

Extensive experimental results demonstrate that GGT-100K consistently enhances the real-world generalization capabilities of numerous IR models. The benefits are especially pronounced when fine-tuning generative models for IR tasks. These outcomes suggest that MFMs are viable tools for generating restoration-oriented data, and that GGT-100K serves as a valuable resource for extending the generalization limits of real-world IR models.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

TechCrunch

A burglar used a Waymo to steal yoga clothes in San Francisco — and got away with it

A thief stole yoga clothes using a Waymo, but police failed to catch them because the car’s video data was deleted and b...

Goldman Sachs CEO David Solomon on the Coming Mega IPOs
Bloomberg

Goldman Sachs CEO David Solomon on the Coming Mega IPOs

Goldman Sachs CEO David Solomon anticipates a surge in major IPOs, signaling renewed market confidence and significant o...

What Are A.I. Agents Actually Doing?
New York Times

What Are A.I. Agents Actually Doing?

Arena research shows tech professionals are most likely to use AI agents at work, highlighting a strong industry trend i...

TechCrunch

Cash App launches a wand for tap-and-pay

Cash App launched a $25 NFC "Magic Wand" for tap-and-pay, blending viral novelty with practical contactless payments. It...

Databricks CEO Plans to Avoid IPO During Year of Huge Offerings
Bloomberg

Databricks CEO Plans to Avoid IPO During Year of Huge Offerings

Databricks CEO plans to avoid an IPO in 2021, despite a surge in public offerings. This contrasts with earlier reports t...

TechCrunch

Waymo’s spent robotaxi batteries will be used as grid storage

Waymo partners with B2U to repurpose retired robotaxi batteries for grid storage in California and Texas, aligning with ...