From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Reward (Mis)design for autonomous driving., , , , и . Artif. Intell., (марта 2023)Reinforcement Learning with Human Feedback in Mountain Car., , и . AAAI Spring Symposium: Help Me Help You: Bridging the Gaps in Human-Agent Collaboration, AAAI, (2011)Combining manual feedback with subsequent MDP reward signals for reinforcement learning., и . AAMAS, стр. 5-12. IFAAMAS, (2010)Using informative behavior to increase engagement while learning from human reward., , , и . Auton. Agents Multi Agent Syst., 30 (5): 826-848 (2016)Models of human preference for learning reward functions., , , , , и . CoRR, (2022)Contrastive Preference Learning: Learning from Human Feedback without RL., , , , , , и . CoRR, (2023)Interactively shaping agents via human reinforcement: the TAMER framework., и . K-CAP, стр. 9-16. ACM, (2009)Reinforcement learning from human reward: Discounting in episodic tasks., и . RO-MAN, стр. 878-885. IEEE, (2012)Domestic Interaction on a Segway Base., , и . RoboCup, том 5399 из Lecture Notes in Computer Science, стр. 519-531. Springer, (2008)Person recognition on a Segway Robot: A video of UT Austin Villa Robocup@Home 2007 finals demonstration., , и . ICRA, стр. 1785-1786. IEEE, (2008)