Author of the publication

Identification and off-policy learning of multiple objectives using adaptive clustering.

, and . Neurocomputing, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Balanced Q-learning: Combining the influence of optimistic and pessimistic targets., , , , , , and . Artif. Intell., (December 2023)Memory-Constrained Policy Optimization., , , , , , and . CoRR, (2022)Intuitive Physics Guided Exploration for Sample Efficient Sim2real Transfer., , , and . ICPR Workshops (2), volume 13644 of Lecture Notes in Computer Science, page 674-686. Springer, (2022)Sympathy-based Reinforcement Learning Agents., , , and . AAMAS, page 1164-1172. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), (2022)Learning Transferable Domain Priors for Safe Exploration in Reinforcement Learning., , , , and . CoRR, (2019)Sample-Efficient Co-Design of Robotic Agents Using Multi-fidelity Training on Universal Policy Network., , , , and . CoRR, (2023)Human-aligned reinforcement learning for autonomous agents and robots., , , , and . Neural Comput. Appl., 35 (23): 16689-16691 (August 2023)A New Representation of Successor Features for Transfer across Dissimilar Environments., , , , , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 1-9. PMLR, (2021)A self-replication basis for designing complex agents.. GECCO (Companion), page 45-46. ACM, (2018)Experience Replay Using Transition Sequences., and . CoRR, (2017)