From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Behavior coordination for a mobile robot using modular reinforcement learning., , и . IROS, стр. 1329-1336. IEEE, (1996)Co-evolution of Shaping Rewards and Meta-Parameters in Reinforcement Learning., , , и . Adapt. Behav., 16 (6): 400-412 (2008)A Generalized Natural Actor-Critic Algorithm., , , и . NIPS, стр. 1312-1320. Curran Associates, Inc., (2009)The Team Description of Osaka University "Trackies-99"., , , , , , , , и . RoboCup, том 1856 из Lecture Notes in Computer Science, стр. 750-753. Springer, (1999)Efficient sample reuse in policy search by multiple importance sampling.. GECCO, стр. 545-552. ACM, (2018)Estimating cost function of expert players in differential games: A model-based method and its data-driven extension., и . Expert Syst. Appl., (2024)Vision Based State Space Construction for Learning Mobile Robots in Multi-agent Environments., , и . EWLR, том 1545 из Lecture Notes in Computer Science, стр. 62-78. Springer, (1997)Cooperative Behavior Acquisition in Multi Mobile Robots Environment by Reinforcement Learning Based on State Vector Estimation., , и . ICRA, стр. 1558-1563. IEEE Computer Society, (1998)A New Natural Policy Gradient by Stationary Distribution Metric., , , и . ECML/PKDD (2), том 5212 из Lecture Notes in Computer Science, стр. 82-97. Springer, (2008)Theoretical Analysis of Efficiency and Robustness of Softmax and Gap-Increasing Operators in Reinforcement Learning., , и . AISTATS, том 89 из Proceedings of Machine Learning Research, стр. 2995-3003. PMLR, (2019)