Author of the publication

UBEV - A More Practical Algorithm for Episodic RL with Near-Optimal PAC and Regret Guarantees.

, , and . CoRR, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

No Free Lunch versus Occam's Razor in Supervised Learning., and . Algorithmic Probability and Friends, volume 7070 of Lecture Notes in Computer Science, page 223-235. Springer, (2011)Soft-Bayes: Prod for Mixtures of Experts with Log-Loss., , and . ALT, volume 76 of Proceedings of Machine Learning Research, page 372-399. PMLR, (2017)Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits., , , , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 3601-3610. PMLR, (2019)Online Learning to Rank with Features., , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 3856-3865. PMLR, (2019)Refining the Confidence Level for Optimistic Bandit Strategies.. J. Mach. Learn. Res., (2018)Free Lunch for Optimisation under the Universal Distribution., , and . CoRR, (2016)Learning with Good Feature Representations in Bandits and in RL with a Generative Model., and . CoRR, (2019)Bandit Algorithms, and . (2019)Following the Leader and Fast Rates in Online Linear Prediction: Curved Constraint Sets and Other Regularities., , , and . J. Mach. Learn. Res., (2017)Iterative Budgeted Exponential Search., , , , and . IJCAI, page 1249-1257. ijcai.org, (2019)