Author of the publication

Computational, Neuroscientific, and Lifespan Perspectives on the Exploration-Exploitation Dilemma.

, , , , , , , , , and . CogSci, cognitivesciencesociety.org, (2011)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Combining manual feedback with subsequent MDP reward signals for reinforcement learning., and . AAMAS, page 5-12. IFAAMAS, (2010)Using informative behavior to increase engagement while learning from human reward., , , and . Auton. Agents Multi Agent Syst., 30 (5): 826-848 (2016)Reinforcement Learning with Human Feedback in Mountain Car., , and . AAAI Spring Symposium: Help Me Help You: Bridging the Gaps in Human-Agent Collaboration, AAAI, (2011)Reward (Mis)design for autonomous driving., , , , and . Artif. Intell., (March 2023)Person recognition on a Segway Robot: A video of UT Austin Villa Robocup@Home 2007 finals demonstration., , and . ICRA, page 1785-1786. IEEE, (2008)Interactively shaping agents via human reinforcement: the TAMER framework., and . K-CAP, page 9-16. ACM, (2009)Models of human preference for learning reward functions., , , , , and . CoRR, (2022)Contrastive Preference Learning: Learning from Human Feedback without RL., , , , , , and . CoRR, (2023)Domestic Interaction on a Segway Base., , and . RoboCup, volume 5399 of Lecture Notes in Computer Science, page 519-531. Springer, (2008)Reinforcement learning from human reward: Discounting in episodic tasks., and . RO-MAN, page 878-885. IEEE, (2012)