arXiv

The Loss Is Not Enough: Sampling Conditions and Inductive Bias in Contrastive Representation Learning

Title: Beyond the Loss: Sampling Constraints and Inductive Bias in Contrastive Representation Learning

Abstract:

While contrastive learning stands as a dominant approach in self-supervised representation learning, the precise conditions required for it to successfully recover meaningful latent geometry are not yet fully elucidated. This study establishes a measure-theoretic framework to formalize the "diversity condition," a critical support requirement for positive-pair sampling that is essential for achieving isometric latent recovery. We demonstrate that the conventional full-support von Mises-Fisher distribution satisfies this diversity condition, ensuring that global minimizers of the contrastive loss recover the latent geometry up to an orthogonal transformation. Conversely, when conditional distributions are restricted, non-orthogonal mappings can achieve a strictly lower asymptotic contrastive loss. To address this, we propose a support-corrected variant of Information Noise Contrastive Estimation (InfoNCE). This theoretical adjustment renders orthogonal latent space recovery feasible, although it does not uniquely enforce it. Our experimental validation on synthetic benchmarks confirms these identifiability predictions, while results from CIFAR-10 align with the qualitative hypothesis that architectural inductive bias plays a more pivotal role when sampling diversity is constrained. Collectively, these findings shed light on the interplay between sampling mechanisms and encoder inductive bias within contrastive representation learning.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

TechCrunch

Meta’s Oversight Board says account bans lack due process, transparency

Meta’s Oversight Board criticized account bans for lacking due process and transparency, citing inconsistent enforcement...

Fed's Daly Says Forward Guidance Could Be Misleading
Bloomberg

Fed's Daly Says Forward Guidance Could Be Misleading

Fed’s Daly warns forward guidance may be misleading or lack clarity.

TechCrunch

Meta rolls out a new AI creator assistant on Facebook

Meta launched an AI creator assistant on Facebook to streamline analytics and content brainstorming. Initially available...

TechCrunch

What to expect from WWDC 2026: Siri’s highly anticipated revamp and Apple Intelligence updates

WWDC 2026 promises a Siri revamp powered by Google’s Gemini and standalone app, plus AI agents in the App Store and Came...

TechCrunch

A burglar used a Waymo to steal yoga clothes in San Francisco — and got away with it

A thief stole yoga clothes using a Waymo, but police failed to catch them because the car’s video data was deleted and b...

Goldman Sachs CEO David Solomon on the Coming Mega IPOs
Bloomberg

Goldman Sachs CEO David Solomon on the Coming Mega IPOs

Goldman Sachs CEO David Solomon anticipates a surge in major IPOs, signaling renewed market confidence and significant o...