arXiv

Offline-to-Online Learning in Linear Bandits

Title: Bridging Offline and Online Learning in Linear Bandit Frameworks

Abstract: This paper investigates the challenge of online learning augmented by a pre-existing offline dataset within the context of stochastic linear bandits. While such scenarios are common in real-world applications, the nuanced trade-off between offline and online learning strategies in structured settings has not been thoroughly explored. To address this, we introduce a novel linear bandit algorithm designed to navigate this balance effectively. The proposed method leverages offline data during the initial phases of interaction, progressively shifting its focus toward exploration as the time horizon extends. We derive regret bounds that prove our approach performs competitively against both purely online and purely offline baselines. Specifically, the algorithm ensures sublinear regret with respect to the optimal action as the volume of online interactions increases, while its regret concerning an offline reference metric diminishes as the quantity of offline samples expands. Our empirical evaluations confirm the robustness and efficacy of this method across a diverse range of problem parameters.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

AI Concentration Risk Is the Problem: 3-Minutes MLIV
Bloomberg

AI Concentration Risk Is the Problem: 3-Minutes MLIV

The article argues that AI concentration risk, rather than the technology itself, is the primary concern. It highlights ...

Reuters

Foxconn announces strategic collaboration with Intel on next-gen AI infrastructure

Foxconn and Intel announced a strategic partnership to develop next-generation AI infrastructure. This collaboration aim...

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)
Bloomberg

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

SpaceX aims for a record $75 billion valuation through an initial public offering. This historic IPO marks a significant...

Broadcom AI Chip Outlook Disappoints Investors
Bloomberg

Broadcom AI Chip Outlook Disappoints Investors

Broadcom’s AI chip projections disappointed investors, dampening market sentiment. The outlook fell short of expectation...

Reuters

Europe's tech 'liberation day'? Computer says not yet

Europe’s expected tech breakthrough remains unrealized, as current systems indicate that a true "liberation day" has not...

Hiranandani Group CEO on Powering India's Digital Future
Bloomberg

Hiranandani Group CEO on Powering India's Digital Future

Hiranandani Group CEO discusses driving India's digital transformation.