arXiv

Edge of Stability Selectively Shapes Learning Across the Data Distribution

Title: Edge of Stability Selectively Shapes Learning Across the Data Distribution

Abstract:

Current research typically characterizes the edge of stability (EoS) as a universal feature of the optimization process. In contrast, we demonstrate that this phenomenon is selective: the stability constraint actively redistributes learning efforts across different segments of the training data, thereby enhancing advancement for certain groups while hindering it for others. By employing a branching intervention that allows the model to either enter or exit the EoS regime from an identical initial state, we provide causal evidence of this trade-off and pinpoint two critical prerequisites for a specific group to gain an advantage. The first condition requires that the group’s aggregate gradient aligns with the principal Hessian eigenvector. We isolate this mechanism through a controlled perturbation that maintains distance but randomizes direction; this disruption of alignment effectively nullifies the benefit. The second condition is that the group must maintain a non-vanishing gradient magnitude throughout training. Under cross-entropy loss, gradient saturation causes confidently classified groups to decouple, thereby transferring the advantage to output-outliers, which sustain persistent gradients. Collectively, these findings reveal that the EoS serves not merely as a boundary for stability, but as a governing mechanism for how learning is allocated across the entire data distribution.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

TechCrunch

A burglar used a Waymo to steal yoga clothes in San Francisco — and got away with it

A thief stole yoga clothes using a Waymo, but police failed to catch them because the car’s video data was deleted and b...

Goldman Sachs CEO David Solomon on the Coming Mega IPOs
Bloomberg

Goldman Sachs CEO David Solomon on the Coming Mega IPOs

Goldman Sachs CEO David Solomon anticipates a surge in major IPOs, signaling renewed market confidence and significant o...

What Are A.I. Agents Actually Doing?
New York Times

What Are A.I. Agents Actually Doing?

Arena research shows tech professionals are most likely to use AI agents at work, highlighting a strong industry trend i...

TechCrunch

Cash App launches a wand for tap-and-pay

Cash App launched a $25 NFC "Magic Wand" for tap-and-pay, blending viral novelty with practical contactless payments. It...

Databricks CEO Plans to Avoid IPO During Year of Huge Offerings
Bloomberg

Databricks CEO Plans to Avoid IPO During Year of Huge Offerings

Databricks CEO plans to avoid an IPO in 2021, despite a surge in public offerings. This contrasts with earlier reports t...

TechCrunch

Waymo’s spent robotaxi batteries will be used as grid storage

Waymo partners with B2U to repurpose retired robotaxi batteries for grid storage in California and Texas, aligning with ...