Global News Digest

arXiv

StressDream: Steering Video World Models for Robust Policy Evaluation and Improvement

Title: StressDream: Directing Video World Models for Resilient Policy Assessment and Enhancement

Abstract:

Video world models (WMs) have emerged as a promising tool for evaluating and enhancing policies by generating realistic future observations based on an ego-robot’s actions. Although WMs are capable of modeling distributions over potential futures, standard policy evaluation and improvement strategies often depend on nominal imaginations. This reliance can cause critical, high-impact consequences of robot actions to be overlooked unless an impractically large number of samples are generated. To facilitate robust policy evaluation and improvement through WM simulations, we introduce StressDream. This method guides imaginations toward high-impact but plausible outcomes—defined at inference time—by optimizing the initial noise inputs of diffusion-based WMs.

Optimizing high-dimensional noise presents significant challenges, as the process must account for subtle, scene-specific target events within generated videos while preventing the noise from drifting into out-of-distribution (OOD) territory, which results in unrealistic simulations. We tackle these issues using two complementary objectives: a semantic objective, which leverages a Vision-Language Model to provide informative gradients by analyzing the generated video content, and a plausibility objective, which ensures the optimized noise remains within realistic bounds. Utilizing state-of-the-art video world models for autonomous driving and robotic manipulation, we demonstrate that StressDream successfully directs simulations toward high-impact, plausible outcomes specified via text at inference time, such as task failures. This capability enables robust policy evaluation and improvement by pinpointing actions whose plausible futures involve undesirable results. Video demonstrations are accessible at https://junwon.me/StressDream/.


Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Related Articles

Schroders Renewable Unit Targets AI Assets as Power Demand Soars
Bloomberg

Schroders Renewable Unit Targets AI Assets as Power Demand Soars

Schroders’ renewable unit targets AI infrastructure, pivoting to meet soaring energy demand from artificial intelligence...

State Street's Paglia on SBI Group Partnership, ETFs
Bloomberg

State Street's Paglia on SBI Group Partnership, ETFs

State Street's Paglia discusses the SBI Group partnership and ETFs, but the source text is missing. Please provide the a...

Nvidia Boss Says Workers Should Be Paid ‘as Much as Possible’
Bloomberg

Nvidia Boss Says Workers Should Be Paid ‘as Much as Possible’

Nvidia CEO Jensen Huang advocates for paying workers “as much as possible,” emphasizing maximum compensation. This stanc...

TSE Talking With Regulator For Easing ETF Listing Rules
Bloomberg

TSE Talking With Regulator For Easing ETF Listing Rules

The Tokyo Stock Exchange is discussing with regulators to ease ETF listing rules. This aims to simplify market access an...

S&P DJI CEO on Japan Markets, Mega IPOs
Bloomberg

S&P DJI CEO on Japan Markets, Mega IPOs

S&P DJI CEO discusses Japan's financial markets and major IPOs.