arXiv

Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher

Title: Trust Functions: Achieving Near-Lossless Weak-to-Strong Generalization Through Strategic Reliance on Weak Teachers

Abstract: Weak-to-strong generalization explores methods for enhancing a robust student model using guidance from a less capable teacher, particularly in scenarios where accurate labels are limited. We frame this challenge primarily as a data selection task, focusing on the critical need to distinguish which weak labels possess sufficient reliability to act as effective training signals. To solve this, we propose trust functions that calculate a scalar trust score for every weak label, allowing the system to filter out unreliable supervision. In diverse areas such as world knowledge, quantitative reasoning, and strategy games, this trust-based filtering produces student models that perform on par with, and occasionally exceed, those trained on ground-truth data, thereby realizing near-lossless weak-to-strong generalization. Furthermore, trust functions facilitate an iterative weak-to-strong progression, where a trained student is recycled as the teacher for the next cycle, compounding performance gains. We identify several underlying mechanisms that explain the success of trust functions.


Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Related Articles

Law’s Billable Hour Is Being Shredded by AI
Bloomberg

Law’s Billable Hour Is Being Shredded by AI

AI is dismantling the billable hour by automating routine legal tasks. This technological shift threatens the traditiona...

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026
Bloomberg

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026

SoftBank in Early Talks to Back $800 Million Agile Robots Round
Bloomberg

SoftBank in Early Talks to Back $800 Million Agile Robots Round

SoftBank is in early talks to back Agile Robots’ $800 million funding round. The Japanese tech giant is currently in pre...

Amundi Is Diversifying Risk Via Commodity Currencies, Gold
Bloomberg

Amundi Is Diversifying Risk Via Commodity Currencies, Gold

Amundi diversifies risk by investing in commodity-linked currencies and gold. This strategy hedges against market volati...

Reuters

Marvell Technology surges after Nvidia's Huang calls it 'next trillion-dollar company'

Marvell Technology shares surged after Nvidia CEO Jensen Huang labeled the firm the “next trillion-dollar company.”

Russia Says It Found Foreign Spyware on Top Officials’ Phones
Bloomberg

Russia Says It Found Foreign Spyware on Top Officials’ Phones

Russia’s FSB claims to have discovered foreign spyware on senior officials’ phones. Moscow attributes the intrusion to h...