arXiv

Beyond Pixel Histories: World Models with Persistent 3D State

Title: Moving Past Pixel-Based Histories: World Models with Enduring 3D States

Abstract: Interactive world models facilitate open-ended video generation by dynamically responding to user inputs. Yet, most current approaches fail to incorporate an explicit 3D environmental representation. Consequently, these models must implicitly deduce 3D consistency from data, while their spatial memory is confined to short temporal windows. This limitation leads to unnatural interactions and hinders applications like agent training. To overcome these challenges, we introduce PERSIST, a novel world model framework that simulates the progression of a latent 3D scene, encompassing the environment, camera movements, and rendering processes. This approach enables the synthesis of new frames endowed with persistent spatial memory and geometric consistency. Our evaluation, comprising both quantitative metrics and a qualitative user study, reveals significant enhancements in spatial memory, 3D consistency, and long-horizon stability compared to existing methods, thereby supporting the creation of coherent, evolving 3D worlds. Additionally, we showcase new functionalities, such as generating varied 3D environments from a single image and allowing for precise, geometry-aware control through direct 3D space editing and specification. Project page: https://francelico.github.io/persist.github.io


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

Reuters

Foxconn announces strategic collaboration with Intel on next-gen AI infrastructure

Foxconn and Intel announced a strategic partnership to develop next-generation AI infrastructure. This collaboration aim...

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)
Bloomberg

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

SpaceX aims for a record $75 billion valuation through an initial public offering. This historic IPO marks a significant...

Broadcom AI Chip Outlook Disappoints Investors
Bloomberg

Broadcom AI Chip Outlook Disappoints Investors

Broadcom’s AI chip projections disappointed investors, dampening market sentiment. The outlook fell short of expectation...

Reuters

Europe's tech 'liberation day'? Computer says not yet

Europe’s expected tech breakthrough remains unrealized, as current systems indicate that a true "liberation day" has not...

Hiranandani Group CEO on Powering India's Digital Future
Bloomberg

Hiranandani Group CEO on Powering India's Digital Future

Hiranandani Group CEO discusses driving India's digital transformation.

Cerebras Says It’s Working With All AI Gear Makers Except Nvidia
Bloomberg

Cerebras Says It’s Working With All AI Gear Makers Except Nvidia

Cerebras confirmed partnerships with all major AI hardware vendors except Nvidia. This broad engagement positions Cerebr...