Author of the publication

Efficient Model-Based Concave Utility Reinforcement Learning through Greedy Mirror Descent.

, , , and . AISTATS, volume 238 of Proceedings of Machine Learning Research, page 2206-2214. PMLR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Detection and classification of seismic events with progressive multi-channel correlation and hidden Markov models., , , and . Comput. Geosci., (2015)Target Tracking for Contextual Bandits: Application to Demand Side Management., , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 754-763. PMLR, (2019)Apprentissage statistique de la topologie d'un ensemble de données étiquetées., , and . EGC, volume RNTI-E-9 of Revue des Nouvelles Technologies de l'Information, page 455-460. Cépaduès-Éditions, (2007)Improved Sleeping Bandits with Stochastic Action Sets and Adversarial Rewards., , and . ICML, volume 119 of Proceedings of Machine Learning Research, page 8357-8366. PMLR, (2020)Versatile Dueling Bandits: Best-of-both World Analyses for Learning from Relative Preferences., and . ICML, volume 162 of Proceedings of Machine Learning Research, page 19011-19026. PMLR, (2022)One Arrow, Two Kills: A Unified Framework for Achieving Optimal Regret Guarantees in Sleeping Bandits., , and . AISTATS, volume 206 of Proceedings of Machine Learning Research, page 7755-7773. PMLR, (2023)Efficient online learning with kernels for adversarial large scale problems., , and . NeurIPS, page 9427-9436. (2019)Le Graphe Génératif Gaussien., , and . Monde des Util. Anal. Données, (2009)Uniform regret bounds over Rd for the sequential linear regression problem with the square loss., , , and . ALT, volume 98 of Proceedings of Machine Learning Research, page 404-432. PMLR, (2019)Online nonparametric regression with Sobolev kernels., , , and . CoRR, (2021)