arXiv

Neural Variance-aware Dueling Bandits with Deep Representation and Shallow Exploration

June 4, 2026 · Youngmin Oh, Jinje Park, Taejin Paik · Original Source

Title: Neural Variance-aware Dueling Bandits with Deep Representation and Shallow Exploration

Abstract: This paper presents the inaugural variance-aware algorithms for contextual dueling bandits, utilizing neural networks to approximate nonlinear utilities alongside shallow exploration techniques. A primary theoretical hurdle in prior research was the lack of a closed-form estimator, necessitating an excessively wide network with a width of $m = \widetilde{\Omega}(T^{14})$. To overcome this limitation, we employ a new analytical framework that integrates spectral analysis with iterative self-improvement. This approach lowers the network width requirement to $m = \widetilde{\Omega}(T^{6})$ and demonstrates that our methods attain a sublinear regret bound of $\widetilde{\mathcal{O}}(d\sqrt{\sum_{t=1}^{T} \sigma_t^2} + \sqrt{dT})$ within both Upper Confidence Bound (UCB) and Thompson Sampling (TS) contexts. Experimental findings confirm that the proposed algorithms deliver state-of-the-art results on both synthetic and real-world benchmarks, while maintaining computational efficiency and exhibiting sublinear regret in practical applications.

Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

TechCrunch

Meta’s Oversight Board says account bans lack due process, transparency

June 4, 2026

Meta’s Oversight Board criticized account bans for lacking due process and transparency, citing inconsistent enforcement...

Bloomberg

Fed's Daly Says Forward Guidance Could Be Misleading

June 4, 2026

Fed’s Daly warns forward guidance may be misleading or lack clarity.

TechCrunch

Meta rolls out a new AI creator assistant on Facebook

June 4, 2026

Meta launched an AI creator assistant on Facebook to streamline analytics and content brainstorming. Initially available...

TechCrunch

What to expect from WWDC 2026: Siri’s highly anticipated revamp and Apple Intelligence updates

June 4, 2026

WWDC 2026 promises a Siri revamp powered by Google’s Gemini and standalone app, plus AI agents in the App Store and Came...

TechCrunch

A burglar used a Waymo to steal yoga clothes in San Francisco — and got away with it

June 4, 2026

A thief stole yoga clothes using a Waymo, but police failed to catch them because the car’s video data was deleted and b...

Bloomberg

Goldman Sachs CEO David Solomon on the Coming Mega IPOs

June 4, 2026

Goldman Sachs CEO David Solomon anticipates a surge in major IPOs, signaling renewed market confidence and significant o...

Top international news

Neural Variance-aware Dueling Bandits with Deep Representation and Shallow Exploration

Related Articles

Meta’s Oversight Board says account bans lack due process, transparency

Fed's Daly Says Forward Guidance Could Be Misleading

Meta rolls out a new AI creator assistant on Facebook

What to expect from WWDC 2026: Siri’s highly anticipated revamp and Apple Intelligence updates

A burglar used a Waymo to steal yoga clothes in San Francisco — and got away with it

Goldman Sachs CEO David Solomon on the Coming Mega IPOs