arXiv

Heterogeneous Decentralized Diffusion Models

Title: Heterogeneous Decentralized Diffusion Models

Abstract:

The training of frontier-scale diffusion models typically demands immense computational power, confining participation to well-funded institutions due to the need for tightly coupled clusters. Although Decentralized Diffusion Models (DDM) facilitate the isolated training of multiple experts, current methods are constrained by homogeneous training objectives and high resource costs, such as the 1176 GPU-days required by existing approaches. We introduce a streamlined framework that significantly lowers these barriers while accommodating heterogeneous training goals. Our solution integrates three primary innovations: first, a decentralized training paradigm that permits experts to utilize distinct objectives—specifically DDPM and Flow Matching—which are unified during inference without the need for retraining; second, a conversion method for pretrained checkpoints from ImageNet-DDPM to Flow Matching objectives, which accelerates convergence and removes the necessity for objective-specific pretraining; and third, the adoption of PixArt-$\alpha$’s efficient AdaLN-Single architecture, which lowers parameter counts without compromising output quality. Evaluations on the LAION-Aesthetics dataset demonstrate that, compared to the training scale of previous DDM studies, our method cuts computational requirements by 16 times and data usage by 14 times. Furthermore, under consistent inference conditions, our heterogeneous setup outperforms the homogeneous baseline in both FID scores and intra-prompt diversity. By removing synchronization dependencies and supporting a mix of DDPM and Flow Matching objectives, our framework democratizes access to decentralized generative model training, allowing contributors with single GPUs possessing just 24–48GB of VRAM to participate.


Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Related Articles

Advantech's Tsai on Nvidia Collaboration, AI Strategy
Bloomberg

Advantech's Tsai on Nvidia Collaboration, AI Strategy

Advantech's Tsai discusses the Nvidia partnership and AI strategy.

SK Hynix to Double Wafer Capacity to Ease Memory Chip Crunch
Bloomberg

SK Hynix to Double Wafer Capacity to Ease Memory Chip Crunch

SK Hynix plans to double its wafer capacity to alleviate the ongoing global memory chip shortage. This expansion aims to...

AI Productivity Boost Is Overhyped | 3-Minute MLIV
Bloomberg

AI Productivity Boost Is Overhyped | 3-Minute MLIV

The video argues that AI’s productivity boost is overhyped, challenging the assumption that it will significantly enhanc...

Intel's Lip-Bu Tan on Agentic AI & Partner Networks
Bloomberg

Intel's Lip-Bu Tan on Agentic AI & Partner Networks

Intel’s Lip-Bu Tan discusses Agentic AI and the vital role of partner networks in driving innovation.

Haas Says Arm May Hit $15 Billion AI Chip Revenue Goal Early
Bloomberg

Haas Says Arm May Hit $15 Billion AI Chip Revenue Goal Early

Haas suggests Arm may achieve its $15 billion AI chip revenue target sooner than expected. This indicates strong market ...

Arm May Hit $15 Billion AI Chip Revenue Goal Early, CEO Says
Bloomberg

Arm May Hit $15 Billion AI Chip Revenue Goal Early, CEO Says

Arm’s CEO predicts the company could hit its $15 billion AI chip revenue target ahead of schedule. This optimistic outlo...