arXiv

Invariant Gradient Alignment for Robust Reasoning Distillation

Title: Enhancing Robust Reasoning Distillation through Invariant Gradient Alignment

Abstract: Large language models (LLMs) are prone to shortcut learning, a phenomenon where they systematically struggle with out-of-distribution (OOD) inputs that present different semantic surfaces than the training data, even if the underlying logical structure remains the same. This limitation hampers knowledge distillation pipelines that aim to transfer chain-of-thought reasoning capabilities to smaller student models. To address this, we propose Invariant Gradient Alignment (IGA), a novel training framework designed to synchronize gradient updates across examples that are semantically varied yet logically isomorphic. IGA relies on three key innovations: first, the use of Logical Isomer Sets, which comprise problem groups that share the same logical structure across disparate semantic fields such as mathematics, law, medicine, and science; second, a differentiable Continuous Gradient Conflict Mask that reduces parameter dimensions exhibiting high cross-domain gradient variance while safeguarding invariant directions; and third, a truncated Singular Value Decomposition (SVD) projection that maps the masked gradient back onto the LoRA low-rank manifold, thereby preserving parameter efficiency. Theoretical analysis demonstrates that IGA provides tighter OOD generalization bounds compared to Empirical Risk Minimization (ERM), with performance scaling alongside the number of isomer domains, and achieves convergence at the standard Stochastic Gradient Descent (SGD) rate under mild regularity conditions. Empirical evaluations show that IGA surpasses eight baseline methods across four distinct benchmarks, achieving accuracy improvements of up to 14.3 percentage points over ERM-SFT. Furthermore, it attains a Logical Consistency Score of 0.031, compared to 0.142 for the baseline, marking a fourfold enhancement in representational invariance.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

Revolut Co-Founder, CTO Vlad Yatsenko to Step Down From Role
Bloomberg

Revolut Co-Founder, CTO Vlad Yatsenko to Step Down From Role

Revolut co-founder and CTO Vlad Yatsenko is stepping down from his executive role. The resignation marks a significant l...

Microsoft’s AI Chief Says Anthropic Models Are Too Expensive
Bloomberg

Microsoft’s AI Chief Says Anthropic Models Are Too Expensive

Microsoft AI CEO Mustafa Suleyman criticized Anthropic’s models as too expensive. Meanwhile, Microsoft plans to allow us...

Ramp Notches $44 Billion Valuation in New Funding Round
Bloomberg

Ramp Notches $44 Billion Valuation in New Funding Round

RAMP secured a $44 billion valuation in its latest funding round. CEO Eric Glyman attended the 2026 Reagan National Econ...

China’s Robotaxi Dilemma Shows AI Policy Tension Between Growth and Jobs
Bloomberg

China’s Robotaxi Dilemma Shows AI Policy Tension Between Growth and Jobs

China’s robotaxi expansion highlights the policy tension between driving economic growth through AI and protecting emplo...

Exams watchdog warns of rise in high-tech cheating
BBC News

Exams watchdog warns of rise in high-tech cheating

Ofqual warns of rising high-tech cheating, with smart devices involved in 44% of misconduct cases. Invigilators are trai...

Thailand’s Richest Man Plans $4.3 Billion Expansion Amid AI Boom
Bloomberg

Thailand’s Richest Man Plans $4.3 Billion Expansion Amid AI Boom

Thailand’s wealthiest individual is investing $4.3 billion in expansion, capitalizing on the booming artificial intellig...