arXiv

Training Prompt Matters: State-Adaptive Optimization for Robust Fine-Tuning

Title: The Significance of Training Prompts: State-Adaptive Optimization for Resilient Fine-Tuning

Abstract:

Although prompt engineering is essential for unlocking the full potential of Large Language Models (LLMs) during inference, the function of prompts during the training phase has been largely overlooked. Current fine-tuning approaches generally regard training prompts as superficial variations, operating under the assumption that instructions with identical semantics produce the same learning results. We demonstrate, however, that this perceived equivalence is misleading. While paraphrased prompts may result in similar performance within a specific task, they trigger markedly divergent effects on cross-task capabilities, particularly concerning catastrophic forgetting and generalization. Importantly, we find that these cross-task effects are positively correlated, suggesting the presence of "superior" prompts that consistently enhance overall performance. Moreover, we identify that these optimal prompts can be reliably detected by analyzing task loss before the learning process begins. Drawing on these findings, we propose State-Adaptive Prompt Optimization (SAPO), a lightweight yet powerful training framework that transforms task formulation from a fixed input into a dynamic, state-dependent variable. Extensive experiments across various benchmarks validate SAPO’s efficacy, showing it significantly reduces forgetting and boosts generalization, thereby outperforming current state-of-the-art methods. These findings shed light on the influence of training prompts on learning dynamics and provide a practical methodology for robust fine-tuning. Our code is accessible at https://github.com/Eric8932/SAPO.


Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Related Articles

Law’s Billable Hour Is Being Shredded by AI
Bloomberg

Law’s Billable Hour Is Being Shredded by AI

AI is dismantling the billable hour by automating routine legal tasks. This technological shift threatens the traditiona...

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026
Bloomberg

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026

SoftBank in Early Talks to Back $800 Million Agile Robots Round
Bloomberg

SoftBank in Early Talks to Back $800 Million Agile Robots Round

SoftBank is in early talks to back Agile Robots’ $800 million funding round. The Japanese tech giant is currently in pre...

Amundi Is Diversifying Risk Via Commodity Currencies, Gold
Bloomberg

Amundi Is Diversifying Risk Via Commodity Currencies, Gold

Amundi diversifies risk by investing in commodity-linked currencies and gold. This strategy hedges against market volati...

Reuters

Marvell Technology surges after Nvidia's Huang calls it 'next trillion-dollar company'

Marvell Technology shares surged after Nvidia CEO Jensen Huang labeled the firm the “next trillion-dollar company.”

Russia Says It Found Foreign Spyware on Top Officials’ Phones
Bloomberg

Russia Says It Found Foreign Spyware on Top Officials’ Phones

Russia’s FSB claims to have discovered foreign spyware on senior officials’ phones. Moscow attributes the intrusion to h...