Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Reinforcement Learning Algorithms for MDPs -- A Survey

{. Szepesvári. TR09-13. Department of Computing Science, University of Alberta, (2009)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Éva SzuÌ cs

Bela Csáki

Jürgen Csomor

Attila Csemez

Katharina Csontos

Other publications of authors with the same name

Policy Error Bounds for Model-Based Reinforcement Learning with Factored Linear ModelsB. Pires, and {. Szepesvári. COLT, page 121--151. (2016)Cleaning up the neighborhood: A full classification for adversarial partial monitoringT. Lattimore, and {. Szepesvári. ALT, (February 2019)An Information-Theoretic Approach to Minimax Regret in Partial MonitoringT. Lattimore, and {. Szepesvári. COLT, (April 2019)Multi-view Matrix Factorization for Linear Dynamical System EstimationM. Karami, M. White, D. Schuurmans, and {. Szepesvári. NIPS, page 7092--7101. (2017)Randomized Exploration in Generalized Linear BanditsB. Kveton, M. and Zaheer, {. Szepesvári, L. Li, M. Ghavamzadeh, and C. Boutilier. AISTATS, (March 2020)Structured Best Arm Identification with Fixed ConfidenceR. Huang, M. Ajallooeian, {. Szepesvári, and M. Müller. ALT, 76, page 593--616. PMLR, (October 2017)Conservative BanditsR. Shariff, Y. Wu, T. Lattimore, and {. Szepesvári. ICML, page 1254--1262. (2016)Mixing Time Estimation in Reversible Markov Chains from a Single Sample PathD. Hsu, A. Kontorovich, D. Levin, Y. Peres, {. Szepesvári, and G. Wolfer. Annals of Applied Probability, 29 (4): 2439--2480 (July 2019)PAC-Bayes bounds for stable algorithms with instance-dependent priorsO. Rivasplata, {. Szepesvári, J. Shawe-Taylor, E. Parrado-Hernandez, and S. Sun. NIPS, (September 2018)Uncertainty and Performance of Adaptive Controllers for Functionally Uncertain Output Feedback SystemsM. French, {. Szepesvári, and E. Rogers. CDC, page 4515--4520. Tampa, Florida, IEEE, (December 1998)

BibSonomy

Disambiguation of "Szepesvári, Cs."

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Reinforcement Learning Algorithms for MDPs -- A Survey

Please choose a person to relate this publication to

Éva SzuÌ cs

Bela Csáki

Jürgen Csomor

Attila Csemez

Katharina Csontos

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Szepesvári, Cs."

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Reinforcement Learning Algorithms for MDPs -- A Survey

Please choose a person to relate this publication to

Éva SzuÌ cs

Bela Csáki

Jürgen Csomor

Attila Csemez

Katharina Csontos

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Reinforcement Learning Algorithms for MDPs -- A Survey