arXiv

The Variance Brain Foundation Models Forgot: Third-Order Statistics Predict Cognition Where Billion-Parameter Models Fail

Title: The Variance Brain Foundation Models Forgot: Third-Order Statistics Predict Cognition Where Billion-Parameter Models Fail

Abstract: Brain foundation models (BFMs), which are self-supervised Transformers trained on functional magnetic resonance imaging (fMRI) data, are theoretically capable of extracting individual cognitive performance metrics from these signals. However, our evaluation reveals a significant shortfall: across three leading BFM architectures and various readout methods, their predictive accuracy for cognition falls below that of a simple linear regression model utilizing only the ~80K parameters of the functional connectivity (FC) matrix. This performance deficit intensifies as model scale increases; for instance, BrainLM’s larger 650M-parameter model performs worse than its smaller 111M-parameter counterpart.

We identify this discrepancy as a variance allocation problem. While BFM pretraining effectively captures the dominant variance components within fMRI data, it fails to retain the higher-order structural information essential for predicting cognition. Our analysis of reconstructed signals via per-cumulant examination indicates that although second-order covariance is partially maintained, the third-order co-skewness tensor is substantially erased.

To address this loss, we developed a linear pipeline that projects fMRI signals into a subspace optimized for preserving co-skewness and derives FC within this space. This approach surpasses both raw FC and all tested pretrained BFMs across every dataset and parcellation scheme examined, achieving superior performance to previous state-of-the-art methods under controlled conditions, despite requiring no pretraining and no GPU. Furthermore, we demonstrate that the primary limitation lies in the pretraining objective rather than model architecture or size, as we restore the raw-FC performance ceiling on BrainLM’s forward pass by fine-tuning with a loss function specifically targeted at this co-skewness-preserving subspace.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

AI Concentration Risk Is the Problem: 3-Minutes MLIV
Bloomberg

AI Concentration Risk Is the Problem: 3-Minutes MLIV

The article argues that AI concentration risk, rather than the technology itself, is the primary concern. It highlights ...

Reuters

Foxconn announces strategic collaboration with Intel on next-gen AI infrastructure

Foxconn and Intel announced a strategic partnership to develop next-generation AI infrastructure. This collaboration aim...

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)
Bloomberg

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

SpaceX aims for a record $75 billion valuation through an initial public offering. This historic IPO marks a significant...

Broadcom AI Chip Outlook Disappoints Investors
Bloomberg

Broadcom AI Chip Outlook Disappoints Investors

Broadcom’s AI chip projections disappointed investors, dampening market sentiment. The outlook fell short of expectation...

Reuters

Europe's tech 'liberation day'? Computer says not yet

Europe’s expected tech breakthrough remains unrealized, as current systems indicate that a true "liberation day" has not...

Hiranandani Group CEO on Powering India's Digital Future
Bloomberg

Hiranandani Group CEO on Powering India's Digital Future

Hiranandani Group CEO discusses driving India's digital transformation.