arXiv

Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher

June 2, 2026 · Arda Uzunoglu, Alvin Zhang, Daniel Khashabi · Original Source

Title: Trust Functions: Achieving Near-Lossless Weak-to-Strong Generalization Through Strategic Reliance on Weak Teachers

Abstract: Weak-to-strong generalization explores methods for enhancing a robust student model using guidance from a less capable teacher, particularly in scenarios where accurate labels are limited. We frame this challenge primarily as a data selection task, focusing on the critical need to distinguish which weak labels possess sufficient reliability to act as effective training signals. To solve this, we propose trust functions that calculate a scalar trust score for every weak label, allowing the system to filter out unreliable supervision. In diverse areas such as world knowledge, quantitative reasoning, and strategy games, this trust-based filtering produces student models that perform on par with, and occasionally exceed, those trained on ground-truth data, thereby realizing near-lossless weak-to-strong generalization. Furthermore, trust functions facilitate an iterative weak-to-strong progression, where a trained student is recycled as the teacher for the next cycle, compounding performance gains. We identify several underlying mechanisms that explain the success of trust functions.

Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Bloomberg

Law’s Billable Hour Is Being Shredded by AI

June 2, 2026

AI is dismantling the billable hour by automating routine legal tasks. This technological shift threatens the traditiona...

Bloomberg

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026

June 2, 2026

Bloomberg

SoftBank in Early Talks to Back $800 Million Agile Robots Round

June 2, 2026

SoftBank is in early talks to back Agile Robots’ $800 million funding round. The Japanese tech giant is currently in pre...

Bloomberg

Amundi Is Diversifying Risk Via Commodity Currencies, Gold

June 2, 2026

Amundi diversifies risk by investing in commodity-linked currencies and gold. This strategy hedges against market volati...

Reuters

Marvell Technology surges after Nvidia's Huang calls it 'next trillion-dollar company'

June 2, 2026

Marvell Technology shares surged after Nvidia CEO Jensen Huang labeled the firm the “next trillion-dollar company.”

Bloomberg

Russia Says It Found Foreign Spyware on Top Officials’ Phones

June 2, 2026

Russia’s FSB claims to have discovered foreign spyware on senior officials’ phones. Moscow attributes the intrusion to h...

Global News Digest

Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher

Related Articles

Law’s Billable Hour Is Being Shredded by AI

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026

SoftBank in Early Talks to Back $800 Million Agile Robots Round

Amundi Is Diversifying Risk Via Commodity Currencies, Gold

Marvell Technology surges after Nvidia's Huang calls it 'next trillion-dollar company'

Russia Says It Found Foreign Spyware on Top Officials’ Phones