From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Reinforcement learning from human reward: Discounting in episodic tasks.

W. Knox, и P. Stone. RO-MAN, стр. 878-885. IEEE, (2012)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Bradley Miller

William Bradley

Bradley Heins

Bradley Lushman

Bradley Malkovsky

Другие публикации лиц с тем же именем

Reward (Mis)design for autonomous driving.W. Knox, A. Allievi, H. Banzhaf, F. Schmitt, и P. Stone. Artif. Intell., (марта 2023)Reinforcement Learning with Human Feedback in Mountain Car.W. Knox, A. Setapen, и P. Stone. AAAI Spring Symposium: Help Me Help You: Bridging the Gaps in Human-Agent Collaboration, AAAI, (2011)Combining manual feedback with subsequent MDP reward signals for reinforcement learning.W. Knox, и P. Stone. AAMAS, стр. 5-12. IFAAMAS, (2010)Using informative behavior to increase engagement while learning from human reward.G. Li, S. Whiteson, W. Knox, и H. Hung. Auton. Agents Multi Agent Syst., 30 (5): 826-848 (2016)Models of human preference for learning reward functions.W. Knox, S. Hatgis-Kessell, S. Booth, S. Niekum, P. Stone, и A. Allievi. CoRR, (2022)Contrastive Preference Learning: Learning from Human Feedback without RL.J. Hejna, R. Rafailov, H. Sikchi, C. Finn, S. Niekum, W. Knox, и D. Sadigh. CoRR, (2023)Interactively shaping agents via human reinforcement: the TAMER framework.W. Knox, и P. Stone. K-CAP, стр. 9-16. ACM, (2009)Reinforcement learning from human reward: Discounting in episodic tasks.W. Knox, и P. Stone. RO-MAN, стр. 878-885. IEEE, (2012)Domestic Interaction on a Segway Base.W. Knox, J. Lee, и P. Stone. RoboCup, том 5399 из Lecture Notes in Computer Science, стр. 519-531. Springer, (2008)Person recognition on a Segway Robot: A video of UT Austin Villa Robocup@Home 2007 finals demonstration.W. Knox, J. Lee, и P. Stone. ICRA, стр. 1785-1786. IEEE, (2008)

Что такое BibSonomy?: С чего начать; Кнопки для браузера; Помощь
Разработчикам: Обзор; API-документация

Контакт и защита личных данных: о нас; Cookies; Сообщить о проблеме; BibSonomy Вики

Интеграция: PUMA; Расширение для TYPO3; Плагин для; Клиент Java REST; Поддерживаемые источники; далее

О BibSonomy: Команда; Блог; Список рассылки
Социальные сети: Наш Twitter