@dblp

An algorithm with nearly optimal pseudo-regret for both stochastic and adversarial bandits.

, and . CoRR, (2016)

Links and resources

Tags