From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

A multiagent reinforcement learning algorithm by dynamically merging markov decision processes., и . AAMAS, стр. 845-846. ACM, (2002)Upper-Confidence-Bound Algorithms for Active Learning in Multi-armed Bandits., , , , и . ALT, том 6925 из Lecture Notes in Computer Science, стр. 189-203. Springer, (2011)Classification-Based Approximate Policy Iteration., , , и . IEEE Trans. Automat. Contr., 60 (11): 2989-2993 (2015)Optimizing over a Restricted Policy Class in Markov Decision Processes., , , и . CoRR, (2018)A Generalized Kernel Approach to Structured Output Learning, , и . CoRR, (2012)Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control., , , , , и . CoRR, (2019)Bayesian Policy Gradient and Actor-Critic Algorithms., , и . J. Mach. Learn. Res., (2016)Optimizing over a Restricted Policy Class in MDPs., , , и . AISTATS, том 89 из Proceedings of Machine Learning Research, стр. 3042-3050. PMLR, (2019)Natural actor-critic algorithms., , , и . Autom., 45 (11): 2471-2482 (2009)Adaptive Sampling for Minimax Fair Classification., , и . CoRR, (2021)