From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Upper-Confidence-Bound Algorithms for Active Learning in Multi-armed Bandits.

A. Carpentier, A. Lazaric, M. Ghavamzadeh, R. Munos, и P. Auer. ALT, том 6925 из Lecture Notes in Computer Science, стр. 189-203. Springer, (2011)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Mohammad Mohammad

Mohammad Sadeghi

Metallogenic modeling and mineral potential mapping in the Takab district, NW IranM. Sadeghi. Uni Halle-Wittenberg, (2009)

Mohammad Reza Mohammad Khorasani

Mohammad Sadeghian

Mohammad Naghdi

Другие публикации лиц с тем же именем

A multiagent reinforcement learning algorithm by dynamically merging markov decision processes.M. Ghavamzadeh, и S. Mahadevan. AAMAS, стр. 845-846. ACM, (2002)Upper-Confidence-Bound Algorithms for Active Learning in Multi-armed Bandits.A. Carpentier, A. Lazaric, M. Ghavamzadeh, R. Munos, и P. Auer. ALT, том 6925 из Lecture Notes in Computer Science, стр. 189-203. Springer, (2011)Classification-Based Approximate Policy Iteration.A. massoud Farahmand, D. Precup, A. da Motta Salles Barreto, и M. Ghavamzadeh. IEEE Trans. Automat. Contr., 60 (11): 2989-2993 (2015)Optimizing over a Restricted Policy Class in Markov Decision Processes.E. Banijamali, Y. Abbasi-Yadkori, M. Ghavamzadeh, и N. Vlassis. CoRR, (2018)A Generalized Kernel Approach to Structured Output LearningH. Kadri, M. Ghavamzadeh, и P. Preux. CoRR, (2012)Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control.N. Levine, Y. Chow, R. Shu, A. Li, M. Ghavamzadeh, и H. Bui. CoRR, (2019)Bayesian Policy Gradient and Actor-Critic Algorithms.M. Ghavamzadeh, Y. Engel, и M. Valko. J. Mach. Learn. Res., (2016)Optimizing over a Restricted Policy Class in MDPs.E. Banijamali, Y. Abbasi-Yadkori, M. Ghavamzadeh, и N. Vlassis. AISTATS, том 89 из Proceedings of Machine Learning Research, стр. 3042-3050. PMLR, (2019)Natural actor-critic algorithms.S. Bhatnagar, R. Sutton, M. Ghavamzadeh, и M. Lee. Autom., 45 (11): 2471-2482 (2009)Adaptive Sampling for Minimax Fair Classification.S. Shekhar, M. Ghavamzadeh, и T. Javidi. CoRR, (2021)

BibSonomy

Disambiguation