arXiv

Family Matters: A Systematic Study of Spatial vs. Frequency Masking for Continual Test-Time Adaptation

Title: Family Matters: A Systematic Study of Spatial vs. Frequency Masking for Continual Test-Time Adaptation

Abstract:

While recent approaches to continual test-time adaptation (CTTA) utilize masked image modeling to mitigate learning instability caused by distribution shifts, they typically treat the masking family ($F$) as a static design choice. Consequently, innovation has been concentrated exclusively on the selection strategy ($S$), leaving the family dimension largely underexplored. This paper presents a systematic empirical investigation designed to isolate the impact of this axis. We employ a controlled CTTA instantiation, Mask to Adapt (M2A), which standardizes the selection strategy to random sampling and employs standard loss functions. Within this framework, we vary only the masking family—comparing spatial approaches (pixel, patch) against frequency-based methods (all-band, low-band, high-band)—while holding all other components constant.

Our findings yield specific design guidance for CTTA contexts:

  1. The masking family dictates whether adaptation reinforces useful structure or amplifies errors. On architectures utilizing patch-tokenization, spatial masking facilitates the accumulation of stable representations over extended data streams, whereas frequency masking leads to catastrophic collapse. We attribute this instability to a structural-preservation mechanism: spatial coherence preserves the broad-spectrum redundancy required to prevent terminal overlap with a corruption’s spectral signature.
  2. The optimal family is contingent upon the alignment between architecture and task. The disparity between families disappears in Convolutional Neural Networks (CNNs), where overlapping receptive fields mitigate the effects of patch occlusion. Conversely, in tasks requiring fine-grained global cues and large-capacity Vision Transformers (ViTs), frequency masking proves to be a competitive alternative.

In system-level comparisons that are confounded by differences in losses and auxiliary components, M2A’s random selection strategy performs on par with heuristic methods. However, we interpret this finding as suggestive context rather than a controlled quantification of the relative importance of the selection strategy $S$.


Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Related Articles

Law’s Billable Hour Is Being Shredded by AI
Bloomberg

Law’s Billable Hour Is Being Shredded by AI

AI is dismantling the billable hour by automating routine legal tasks. This technological shift threatens the traditiona...

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026
Bloomberg

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026

SoftBank in Early Talks to Back $800 Million Agile Robots Round
Bloomberg

SoftBank in Early Talks to Back $800 Million Agile Robots Round

SoftBank is in early talks to back Agile Robots’ $800 million funding round. The Japanese tech giant is currently in pre...

Amundi Is Diversifying Risk Via Commodity Currencies, Gold
Bloomberg

Amundi Is Diversifying Risk Via Commodity Currencies, Gold

Amundi diversifies risk by investing in commodity-linked currencies and gold. This strategy hedges against market volati...

Reuters

Marvell Technology surges after Nvidia's Huang calls it 'next trillion-dollar company'

Marvell Technology shares surged after Nvidia CEO Jensen Huang labeled the firm the “next trillion-dollar company.”

Russia Says It Found Foreign Spyware on Top Officials’ Phones
Bloomberg

Russia Says It Found Foreign Spyware on Top Officials’ Phones

Russia’s FSB claims to have discovered foreign spyware on senior officials’ phones. Moscow attributes the intrusion to h...