@dblp

Smooth Contextual Bandits: Bridging the Parametric and Non-differentiable Regret Regimes.

, , and . COLT, volume 125 of Proceedings of Machine Learning Research, page 2007-2010. PMLR, (2020)

Links and resources

Tags