Author of the publication

Reinforcement Learning Algorithms for MDPs -- A Survey

. TR09-13. Department of Computing Science, University of Alberta, (2009)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Policy Error Bounds for Model-Based Reinforcement Learning with Factored Linear Models, and . COLT, page 121--151. (2016)Cleaning up the neighborhood: A full classification for adversarial partial monitoring, and . ALT, (February 2019)An Information-Theoretic Approach to Minimax Regret in Partial Monitoring, and . COLT, (April 2019)Multi-view Matrix Factorization for Linear Dynamical System Estimation, , , and . NIPS, page 7092--7101. (2017)Randomized Exploration in Generalized Linear Bandits, , , , , and . AISTATS, (March 2020)Structured Best Arm Identification with Fixed Confidence, , , and . ALT, 76, page 593--616. PMLR, (October 2017)Conservative Bandits, , , and . ICML, page 1254--1262. (2016)Mixing Time Estimation in Reversible Markov Chains from a Single Sample Path, , , , , and . Annals of Applied Probability, 29 (4): 2439--2480 (July 2019)PAC-Bayes bounds for stable algorithms with instance-dependent priors, , , , and . NIPS, (September 2018)Uncertainty and Performance of Adaptive Controllers for Functionally Uncertain Output Feedback Systems, , and . CDC, page 4515--4520. Tampa, Florida, IEEE, (December 1998)