arXiv

Pixel Cube: Diffusion-based Portrait Video Relighting Through Realistic Lighting Reproduction

Title: Pixel Cube: Achieving Photorealistic Portrait Video Relighting via Diffusion Models and Accurate Lighting Simulation

Abstract:

This paper introduces a novel diffusion-based approach for relighting dynamic portrait videos, ensuring both photorealistic quality and temporal stability. The methodology is driven by a unique hybrid training dataset comprising both real-world captured and computationally rendered dynamic portrait videos. This dataset encompasses a wide variety of subject appearances, facial movements, head orientations, and explicit lighting scenarios. To facilitate this, we engineered a specialized LED-based lighting rig designed to emulate realistic lighting conditions while enabling the high-speed acquisition of relighting data.

By capitalizing on the inherent image priors within pre-trained video diffusion models, we developed a high-performance generative framework. This framework utilizes per-frame high dynamic range (HDR) environment maps as primary lighting controls to achieve realistic, identity-preserving relighting. Furthermore, the model incorporates a synthesized background image, providing additional control over the camera’s color tone and exposure levels. The resulting output is a temporally consistent relit video that appears natural and harmonious within the new lighting environment. Crucially, the system faithfully retains the subject’s expressions and intricate facial details, such as skin texture, wrinkles, and facial hair.

The model demonstrates strong generalization capabilities across unseen data, effectively handling variations in subject appearance, motion patterns, and lighting conditions. We conducted extensive experiments involving relighting in-the-wild videos using diverse environment maps and showcased practical applications in portrait photography. Our results indicate that this method sets a new state-of-the-art standard in terms of photorealism, lighting harmony, and temporal consistency.


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

TechCrunch

The world’s largest privately owned laser just turned on

Xcimer Energy activated the Phoenix laser, the world’s largest privately owned laser, aiming to commercialize fusion pow...

Uber Targets Doubling Its Fleet of Electric Motorcycles in Kenya
Bloomberg

Uber Targets Doubling Its Fleet of Electric Motorcycles in Kenya

Uber plans to double its electric motorcycle fleet in Kenya. This expansion aims to enhance sustainable transport option...

AI Saves Time But Most Companies Waste the Gain, Study Shows
Bloomberg

AI Saves Time But Most Companies Waste the Gain, Study Shows

A study reveals that while AI saves employee time, most companies fail to capitalize on these gains, squandering potenti...

JPMorgan Lifts S&P Target on Earnings 'Supercycle'
Bloomberg

JPMorgan Lifts S&P Target on Earnings 'Supercycle'

JPMorgan raised its S&P 500 target, citing an earnings “supercycle” that reflects heightened confidence in corporate pro...

Europe Sleepwalking Into Economic Ruin, Serb Leader Says
Bloomberg

Europe Sleepwalking Into Economic Ruin, Serb Leader Says

Serbian leader warns Europe is sleepwalking into economic ruin.

Delta Electronics Flags Power Crunch
Bloomberg

Delta Electronics Flags Power Crunch

Delta Electronics warns of a looming power deficit due to surging demand and constrained production, predicting serious ...