arXiv

Adaptive Exploration for Latent-State Bandits

June 2, 2026 · Jikai Jin, Kenneth Hung, Sanath Kumar Krishnamurthy, Baoyi Shi, Congshan Zhang · Original Source

Title: Adaptive Exploration Strategies for Latent-State Bandit Problems

Abstract: This research investigates bandit scenarios where reward distributions are governed by an unobserved Markov state that transitions independently of the learner’s decisions. Consequently, the best-performing arm may shift over time, even though the learner’s information is limited to historical actions and outcomes. To address this, we introduce algorithms that enhance LinUCB by incorporating two specific summaries of the hidden state: a lagged action-reward pair and, when feasible, a probe fingerprint derived from the rewards of multiple arms. The adaptive versions of these algorithms dynamically update the fingerprint by applying tests for residuals, margins, and staleness. Synthetic evaluations assessing state cardinality, transition rates, noise levels, and time horizons demonstrate that these approaches significantly lower dynamic regret compared to standard, adversarial, and non-stationary bandit baselines, provided that the summaries effectively differentiate states and are refreshed with sufficient frequency. Furthermore, ablation studies and misspecification tests highlight primary failure points, including insufficient fingerprint separation, excessive noise, and state transitions occurring during sequential probing phases.

Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Bloomberg

Withings Debuts New Smart Scale Marketed Toward GLP-1 Users

June 2, 2026

Withings launched a new smart scale targeting GLP-1 users, offering advanced body composition analysis. This device help...

TechCrunch

Rocket engine startup Impulse raises $500 million to hire people, not AI

June 2, 2026

Rocket engine startup Impulse Space raised $500 million to hire 200 engineers, prioritizing human expertise over AI for ...

Bloomberg

Startup Impulse Space Raises $500 Million, Valued at $4 Billion

June 2, 2026

Impulse Space secured $500 million in funding, achieving a $4 billion valuation. This investment supports the developmen...

Bloomberg

Walmart’s Answer to Apple Pay Wants to Be Your Favorite Financial App

June 2, 2026

Walmart’s new financial app aims to rival Apple Pay, positioning itself as a preferred digital payment and banking solut...

Bloomberg

Nvidia Is Bigger, Stronger, and Trying to Slay the Laptop Dragon Again

June 2, 2026

Nvidia unveiled the RTX Spark Superchip at Computex 2026, aiming to challenge Intel’s PC dominance and modernize hardwar...

TechCrunch

Pacific Fusion’s latest prototype packs 440 gigawatts into an 80-nanosecond burst

June 2, 2026

Pacific Fusion’s new prototype delivers 440 gigawatts in 80 nanoseconds, securing over $1 billion in funding and enablin...

Global News Digest

Adaptive Exploration for Latent-State Bandits

Related Articles

Withings Debuts New Smart Scale Marketed Toward GLP-1 Users

Rocket engine startup Impulse raises $500 million to hire people, not AI

Startup Impulse Space Raises $500 Million, Valued at $4 Billion

Walmart’s Answer to Apple Pay Wants to Be Your Favorite Financial App

Nvidia Is Bigger, Stronger, and Trying to Slay the Laptop Dragon Again

Pacific Fusion’s latest prototype packs 440 gigawatts into an 80-nanosecond burst