Author of the publication

Transfer of task representation in reinforcement learning using policy-based proto-value functions.

, , and . AAMAS (3), page 1329-1332. IFAAMAS, (2008)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

No-Regret Exploration in Goal-Oriented Reinforcement Learning., , , , and . CoRR, (2019)Parallel Higher Order Alternating Least Square for Tensor Recommender System., , and . AAAI Workshops, volume WS-17 of AAAI Technical Report, AAAI Press, (2017)Multi-Bandit Best Arm Identification., , , and . NIPS, page 2222-2230. (2011)Reinforcement Learning in Continuous Action Spaces through Sequential Monte Carlo Methods., , and . NIPS, page 833-840. Curran Associates, Inc., (2007)Limiting Extrapolation in Linear Approximate Value Iteration., , , and . NeurIPS, page 5616-5625. (2019)Exploiting easy data in online optimization., , and . NIPS, page 810-818. (2014)Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning., , , and . ICML, volume 80 of Proceedings of Machine Learning Research, page 1573-1581. PMLR, (2018)Incremental Skill Acquisition for Self-motivated Learning Animats., , and . SAB, volume 4095 of Lecture Notes in Computer Science, page 357-368. Springer, (2006)Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection., , , , , and . NeurIPS, page 16371-16383. (2021)Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence., , and . NIPS, page 3221-3229. (2012)