From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Behavior coordination for a mobile robot using modular reinforcement learning.

E. Uchibe, M. Asada, и K. Hosoda. IROS, стр. 1329-1336. IEEE, (1996)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Eiji Hisamatsu

Eiji Habuto

Eiji Kudo

Eiji Takahashi

Eiji Kouno

Другие публикации лиц с тем же именем

Behavior coordination for a mobile robot using modular reinforcement learning.E. Uchibe, M. Asada, и K. Hosoda. IROS, стр. 1329-1336. IEEE, (1996)Co-evolution of Shaping Rewards and Meta-Parameters in Reinforcement Learning.S. Elfwing, E. Uchibe, K. Doya, и H. Christensen. Adapt. Behav., 16 (6): 400-412 (2008)A Generalized Natural Actor-Critic Algorithm.T. Morimura, E. Uchibe, J. Yoshimoto, и K. Doya. NIPS, стр. 1312-1320. Curran Associates, Inc., (2009)The Team Description of Osaka University "Trackies-99".S. Suzuki, T. Kato, H. Ishizuka, H. Kawanishi, T. Tamura, M. Yanase, Y. Takahashi, E. Uchibe, и M. Asada. RoboCup, том 1856 из Lecture Notes in Computer Science, стр. 750-753. Springer, (1999)Efficient sample reuse in policy search by multiple importance sampling.E. Uchibe. GECCO, стр. 545-552. ACM, (2018)Estimating cost function of expert players in differential games: A model-based method and its data-driven extension.H. Asl, и E. Uchibe. Expert Syst. Appl., (2024)Vision Based State Space Construction for Learning Mobile Robots in Multi-agent Environments.E. Uchibe, M. Asada, и K. Hosoda. EWLR, том 1545 из Lecture Notes in Computer Science, стр. 62-78. Springer, (1997)Cooperative Behavior Acquisition in Multi Mobile Robots Environment by Reinforcement Learning Based on State Vector Estimation.E. Uchibe, M. Asada, и K. Hosoda. ICRA, стр. 1558-1563. IEEE Computer Society, (1998)A New Natural Policy Gradient by Stationary Distribution Metric.T. Morimura, E. Uchibe, J. Yoshimoto, и K. Doya. ECML/PKDD (2), том 5212 из Lecture Notes in Computer Science, стр. 82-97. Springer, (2008)Theoretical Analysis of Efficiency and Robustness of Softmax and Gap-Increasing Operators in Reinforcement Learning.T. Kozuno, E. Uchibe, и K. Doya. AISTATS, том 89 из Proceedings of Machine Learning Research, стр. 2995-3003. PMLR, (2019)

Что такое BibSonomy?: С чего начать; Кнопки для браузера; Помощь
Разработчикам: Обзор; API-документация

Контакт и защита личных данных: о нас; Cookies; Сообщить о проблеме; BibSonomy Вики

Интеграция: PUMA; Расширение для TYPO3; Плагин для; Клиент Java REST; Поддерживаемые источники; далее

О BibSonomy: Команда; Блог; Список рассылки
Социальные сети: Наш Twitter