arXiv

Pinterest Canvas: Large-Scale Image Generation at Pinterest

Title: Pinterest Canvas: Large-Scale Image Generation at Pinterest

Abstract: Although contemporary image generation models exhibit impressive versatility across numerous tasks, this broad flexibility often complicates control through prompting or basic inference adjustments. Consequently, these general-purpose models are frequently ill-suited for applications demanding strict product standards. To address this challenge, we present Pinterest Canvas, a robust large-scale image generation infrastructure designed specifically to facilitate image editing and enhancement within the Pinterest ecosystem.

Our approach begins by training a foundational diffusion model on a heterogeneous, multimodal dataset, endowing it with extensive image-editing capabilities. Rather than attempting to force a single generic model to manage every downstream objective, we employ a strategy of rapidly fine-tuning specialized variants of this base architecture on task-specific datasets. This method yields dedicated models tailored to individual use cases. This paper outlines the core components of the Canvas system and shares our established best practices regarding dataset curation, training protocols, and inference procedures.

Through case studies focusing on background enhancement and aspect-ratio outpainting, we illustrate how the system addresses distinct product requirements. Online A/B testing results indicate that these enhancements drive substantial user engagement, achieving lifts of 18.0% and 12.5%, respectively. Furthermore, evaluations involving human raters confirm that our models surpass third-party alternatives in these specific tasks. Finally, we demonstrate the system’s broader applicability by showcasing other Canvas variants, such as multi-image scene synthesis and image-to-video generation, proving that our methodology effectively generalizes across a wide spectrum of potential downstream applications.


Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Related Articles

Law’s Billable Hour Is Being Shredded by AI
Bloomberg

Law’s Billable Hour Is Being Shredded by AI

AI is dismantling the billable hour by automating routine legal tasks. This technological shift threatens the traditiona...

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026
Bloomberg

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026

SoftBank in Early Talks to Back $800 Million Agile Robots Round
Bloomberg

SoftBank in Early Talks to Back $800 Million Agile Robots Round

SoftBank is in early talks to back Agile Robots’ $800 million funding round. The Japanese tech giant is currently in pre...

Amundi Is Diversifying Risk Via Commodity Currencies, Gold
Bloomberg

Amundi Is Diversifying Risk Via Commodity Currencies, Gold

Amundi diversifies risk by investing in commodity-linked currencies and gold. This strategy hedges against market volati...

Reuters

Marvell Technology surges after Nvidia's Huang calls it 'next trillion-dollar company'

Marvell Technology shares surged after Nvidia CEO Jensen Huang labeled the firm the “next trillion-dollar company.”

Russia Says It Found Foreign Spyware on Top Officials’ Phones
Bloomberg

Russia Says It Found Foreign Spyware on Top Officials’ Phones

Russia’s FSB claims to have discovered foreign spyware on senior officials’ phones. Moscow attributes the intrusion to h...