Author of the publication

Guest editorial: special issue on reinforcement learning for real life.

, , , , and . Mach. Learn., 110 (9): 2291-2293 (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Exploration by Optimisation in Partial Monitoring., and . COLT, volume 125 of Proceedings of Machine Learning Research, page 2488-2515. PMLR, (2020)Online Learning in Markov Decision Processes with Changing Cost Sequences., , and . ICML, volume 32 of JMLR Workshop and Conference Proceedings, page 512-520. JMLR.org, (2014)Shifting Regret, Mirror Descent, and Matrices., and . ICML, volume 48 of JMLR Workshop and Conference Proceedings, page 2943-2951. JMLR.org, (2016)Continuous Time Associative Bandit Problems., , , and . IJCAI, page 830-835. (2007)Structured Best Arm Identification with Fixed Confidence., , , and . ALT, volume 76 of Proceedings of Machine Learning Research, page 593-616. PMLR, (2017)Gradient Descent for Sparse Rank-One Matrix Completion for Crowd-Sourced Aggregation of Sparsely Interacting Workers., , , and . CoRR, (2019)Learning to segment from a few well-selected training images., , and . ICML, volume 382 of ACM International Conference Proceeding Series, page 305-312. ACM, (2009)Manifold-adaptive dimension estimation., , and . ICML, volume 227 of ACM International Conference Proceeding Series, page 265-272. ACM, (2007)Margin Maximizing Discriminant Analysis., , and . ECML, volume 3201 of Lecture Notes in Computer Science, page 227-238. Springer, (2004)On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data., , , , and . CoRR, (2021)